Wealthy - Site Reliability Engineer - Kubernetes

2 weeks ago


Bangalore, Karnataka, India Wealthy.in Full time

Job Description :


- Design, implement, and maintain reliable containerized applications using Kubernetes on GCP.

- Develop and optimize SLIs, SLOs, and SLAs for critical systems and services.

- Create and maintain automation for deployment, scaling, and management of applications and infrastructure.

- Implement and manage observability solutions, including monitoring, logging, and alerting systems.

- Conduct capacity planning and performance optimization for Kubernetes clusters and GCP resources.

- Collaborate with development teams to improve application reliability, scalability, and performance.

- Implement and maintain disaster recovery and business continuity plans.

- Continuously improve system reliability through chaos engineering and proactive testing.

- Optimize resource utilization and cost management within GCP.

- Implement security best practices for Kubernetes clusters and GCP services.

- Stay up-to-date with the latest developments in site reliability engineering, Kubernetes, and GCP services.

- Document processes, runbooks, and best practices for maintaining system reliability.

- Serve as the primary point of contact for interactions with Google Cloud support and other service providers.

- Collaborate with Google Cloud representatives to optimize our use of GCP services and stay informed about new features and best practices.

- Manage relationships with other cloud and service providers, ensuring optimal integration and utilization of their services.

- Rapidly learn and adapt to new technologies and tools as needed.

- Proactively explore and experiment with new technologies and methodologies to improve system reliability and efficiency.

- Implement and manage GitOps workflows using ArgoCD for Kubernetes deployments.

- Design, develop, and maintain Helm charts for streamlined application deployments.

Requirements :

- Strong experience with Kubernetes, including deployment, scaling, and management of containerized applications.

- Extensive knowledge of Google Cloud Platform (GCP) services and best practices.

- Solid understanding of containerization technologies, particularly Docker.

- Experience with monitoring and observability tools (e.g, Prometheus, Grafana, Alerting).

- Strong knowledge of version control systems, particularly Git.

- Deep understanding of networking concepts, including VPNs, VPCs, and gateways.

- Familiarity with database administration, particularly cloud-native database solutions.

- Strong analytical skills with a focus on system reliability and performance optimization.

- Excellent communication skills, including the ability to effectively interact with external service providers and translate technical concepts for non-technical stakeholders.

- Experience in vendor management or working closely with cloud service providers.

- Ability to work in a collaborative team environment.

- Demonstrated ability to quickly learn and adapt to new technologies and tools.

- Familiarity with CI/CD concepts and ability to work with various CI/CD tools.

- Natural curiosity and a tinkerer's mindset, with a passion for understanding how systems work at a deep level.

- History of personal projects or contributions to open-source projects (preferred).

- Ability to think creatively and approach problems from multiple angles.

- Proficiency with ArgoCD and GitOps principles for managing Kubernetes deployments.

- Strong experience with Helm for packaging and deploying Kubernetes applications.

- Certifications such as Certified Kubernetes Administrator (CKA) or Google Cloud Professional DevOps Engineer are a plus.

(ref:hirist.tech)

  • Bangalore, Karnataka, India Yogy HR Solutions Full time

    Site Reliability Engineer Exp : 6-12 years Location : Bangalore Mandatory, Hybrid model from Bangalore location. Notice Period : Immediate to 30 days preferred. Roles and Responsibilities :- Work with development partners to shape the architecture, design, and implementation of new and existing systems to enhance their reliability, performance, efficiency,...


  • Bangalore, Karnataka, India Tranzeal Incorporated Full time

    Site Reliability Engineer Responsibilities : - Maintain and scale production services and servers across multiple data centers for complex cloud-based applications. - Improve scalability, reliability, capacity, and performance of production systems. - Write automation code using tools like Terraform and Ansible to provision and operate infrastructure at...

  • DevOps Engineer

    3 weeks ago


    Bangalore, Karnataka, India Catalyst IQ Full time

    We are a pioneering social sports betting platform in India focused on a free-to-play model. We're dedicated to enhancing user experience, fostering community engagement, and developing a loyal fan base.Overview :Join our team as a Site Reliability Engineer (SRE) / DevOps Engineer. Ensure the reliability, scalability, and performance of our social sports...


  • Bangalore, Karnataka, India Altruist Technologies Full time

    Need banking domain experience.Role : Site Reliability EngineerJob Description :1. Run and monitor the production systems environment for high availability, reliability, and performance by taking a holistic view of systems health.2. Build software tools/automation scripts for periodic system admin tasks, spot failure patterns monitor service health, server...


  • Bangalore, Karnataka, India Squareroot Consulting Pvt Ltd. Full time

    Site Reliability EngineerLocation : Bangalore, IndiaDomain : CybersecurityBudget : 30 to 50 Lacks - We are looking for a hands-on devops engineer leading the design, implementation of devops/SRE practice for our infrastructure for data privacy.- The successful candidate will have experience implementing advanced DevOps & SRE techniques such as Auto...


  • Bangalore, Karnataka, India Talpro Full time

    About Us :Talpro is leading the way in transforming the talent acquisition landscape. We deliver innovative, sustainable, and cost-effective recruitment solutions tailored to today's business needs. Our mission is to offer comprehensive hiring strategies that address immediate recruitment demands while laying the groundwork for long-term success.Job...


  • Bangalore, Karnataka, India Kiash Solutions LLp Full time

    Considering candidates that can onboard us in 0-15 days at our Bengaluru office. Skill : Lead Software Engineer - Devops Cloud. Note : Experience : 6.5+ Years. Mandatory Skill are : Devops + Cloud+ Migration + Ansible+ Terraform & Kubernetes +CI/CD, lead experience is needed. Location : Bangalore. Job Type - FTE. Notice Period - Immediate/ 15 Days. Budget -...


  • Bangalore, Karnataka, India Innovation Consulting Services Full time

    We are seeking a highly skilled and motivated Senior Site Reliability Engineer (SRE) - Disaster Recovery Specialist. The ideal candidate will be responsible for designing, implementing, and maintaining systems and processes that ensure the reliability, scalability, and disaster resilience of our infrastructure and applications. This role requires a good...


  • Bangalore, Karnataka, India TETRAHED INC Full time

    Job Description :We are looking for a talented and experienced DevOps/Site Reliability Engineer (SRE) with a strong proficiency in Python.Skill & Experience :- Collaborate with development teams to design, develop, and maintain infrastructure for our highly available and scalable applications.- Automate processes using Python scripting to streamline the...


  • Bangalore, Karnataka, India Arting Digital Full time

    Posting title : Site Reliability Engineer Experience : 7+ Years Location : Bangalore Work mode : WFO Primary skills : Cloud Monitoring & Operations (GCP & Azure), Python, ServiceNowQualification : Any Engineering/ Computers degreeRoles & Responsibilities : Daily Operations & Monitoring : - Actively monitor systems, applications, and infrastructure across...


  • Bangalore, Karnataka, India Groww Full time

    About Groww :- We are a passionate group of people focused on making financial services accessible to every Indian through a multi-product platform.- Each day, we help millions of customers take charge of their financial journey.- Customer obsession is in our DNA.- Every product, every design, every algorithm down to the tiniest detail is executed keeping...


  • Bangalore, Karnataka, India NOMISO INDIA PRIVATE LIMITED Full time

    About Company : Nomiso is a product and services engineering company. We are a team of Software Engineers, Architects, Managers, and Cloud Experts with expertise in Technology and Delivery Management. Our mission is to Empower and Enhance the lives of our customers, through efficient solutions for their complex business problems. At Nomiso we encourage...


  • Bangalore, Karnataka, India Cyitechsearch Full time

    We are hiring for Site Reliability EngineerSkills :- Develop and provide operational support for full-stack software applications.- Relevant industry certifications, such as through the Site Reliability Engineering (SRE) Foundation.- Five years' experience as a site reliability engineer or similar role.- Collaborate with development operations staff to...


  • Bangalore, Karnataka, India Watson Search Partner Full time

    About the SRE :Site reliability engineers (SREs) combine engineering experience and an innate drive to improve existing systems and processes, with the creativity to develop novel solutions to evolving challenges that can hamper reliability, performance and availability of critical platform services and applications. SRE builds solutions (Process + tools) to...


  • Bangalore, Karnataka, India Flairchase Full time

    Job Description : Responsibilities :- Design, build, and ship the systems and infrastructure that power our applications.- Design and development of new features.- Minimizing and hardening microservices and public-facing API gateway attack surface.- Continuous delivery using tools such as Tekton, Travis, Jenkins, Ansible, and Kubernetes.- Observability,...

  • Cloud Engineer

    3 weeks ago


    Bangalore, Karnataka, India Sampoorna Consultants Pvt. Ltd Full time

    Job Description : The Cloud Engineer role is a hands-on role primarily responsible for the deployment, monitoring and uptime/SLA of a multi-tenanted, large scale SaaS Service.This role also has a Site Reliability Engineer (SRE) function to it, requiring an ability to use automation to push updates at scale to the SaaS service and perform automated...


  • Bangalore, Karnataka, India Coders Brain Technology Private Limited Full time

    Role : Lead Platform Engineer (Azure Kubernetes Services)Seeking a seasoned Lead Platform Engineer to drive the maintenance and expansion of our Azure Kubernetes Services (AKS) environment. This critical role is central to our IT services and cloud platform strategy, overseeing the architecture, design, development, implementation, and on-call production...


  • Bangalore, Karnataka, India SMARTWORK IT SERVICES Full time

    Role and Responsibilities :- Maintain and scale production services and servers across multiple data centers for complex, data-intensive cloud services.- Improve scalability, service reliability, capacity, and performance of cloud infrastructure.- Write automation code for provisioning and operating infrastructure at a massive scale.- Work with development...


  • Bangalore, Karnataka, India Cricbuzz Full time

    About the job :We are looking for a highly skilled and motivated Web Server Site Reliability Engineer to join our team. As a Web Server Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our web server infrastructure and CDN services.Experience : 3 - 5 yearsResponsibilities :- Design, implement,...


  • Bangalore, Karnataka, India Lytx, Inc Full time

    Job Description :Why Lytx : Join our dynamic and passionate team of driven, low-ego engineers who are at the forefront of designing and supporting cutting-edge IoT infrastructure. As we rapidly grow and transition to the cloud, we're diving into the exciting realms of "Operations as Code," "Infrastructure as Code," and innovative infrastructure...