Site Reliability Specialist

1 month ago


Bangalore, India Wealthy Full time

Job Title: Site Reliability Specialist

About the Role: Wealthy is seeking a highly skilled Site Reliability Specialist to join our team. As a Site Reliability Specialist, you will be responsible for designing, implementing, and maintaining reliable containerized applications using Kubernetes on GCP.

Key Responsibilities:

  • Develop and optimize SLIs, SLOs, and SLAs for critical systems and services.
  • Create and maintain automation for deployment, scaling, and management of applications and infrastructure.
  • Implement and manage observability solutions, including monitoring, logging, and alerting systems.
  • Conduct capacity planning and performance optimization for Kubernetes clusters and GCP resources.
  • Collaborate with development teams to improve application reliability, scalability, and performance.
  • Implement and maintain disaster recovery and business continuity plans.
  • Continuously improve system reliability through chaos engineering and proactive testing.
  • Optimize resource utilization and cost management within GCP.
  • Implement security best practices for Kubernetes clusters and GCP services.
  • Stay up-to-date with the latest developments in site reliability engineering, Kubernetes, and GCP services.
  • Document processes, runbooks, and best practices for maintaining system reliability.
  • Serve as the primary point of contact for interactions with Google Cloud support and other service providers.
  • Collaborate with Google Cloud representatives to optimize our use of GCP services and stay informed about new features and best practices.
  • Manage relationships with other cloud and service providers, ensuring optimal integration and utilization of their services.
  • Rapidly learn and adapt to new technologies and tools as needed.
  • Proactively explore and experiment with new technologies and methodologies to improve system reliability and efficiency.
  • Implement and manage GitOps workflows using ArgoCD for Kubernetes deployments.
  • Design, develop, and maintain Helm charts for streamlined application deployments.

Requirements:

  • Strong experience with Kubernetes, including deployment, scaling, and management of containerized applications.
  • Extensive knowledge of Google Cloud Platform (GCP) services and best practices.
  • Solid understanding of containerization technologies, particularly Docker.
  • Experience with monitoring and observability tools (e.g, Prometheus, Grafana, Alerting).
  • Strong knowledge of version control systems, particularly Git.
  • Deep understanding of networking concepts, including VPNs, VPCs, and gateways.
  • Familiarity with database administration, particularly cloud-native database solutions.
  • Strong analytical skills with a focus on system reliability and performance optimization.
  • Excellent communication skills, including the ability to effectively interact with external service providers and translate technical concepts for non-technical stakeholders.
  • Experience in vendor management or working closely with cloud service providers.
  • Ability to work in a collaborative team environment.
  • Demonstrated ability to quickly learn and adapt to new technologies and tools.
  • Familiarity with CI/CD concepts and ability to work with various CI/CD tools.
  • Natural curiosity and a tinkerer's mindset, with a passion for understanding how systems work at a deep level.
  • History of personal projects or contributions to open-source projects (preferred).
  • Ability to think creatively and approach problems from multiple angles.
  • Proficiency with ArgoCD and GitOps principles for managing Kubernetes deployments.
  • Strong experience with Helm for packaging and deploying Kubernetes applications.
  • Certifications such as Certified Kubernetes Administrator (CKA) or Google Cloud Professional DevOps Engineer are a plus.


  • Bangalore, India Innovation Consulting Services Full time

    Job Title: Senior Site Reliability Engineer - Disaster Recovery SpecialistWe are seeking a highly skilled and motivated Senior Site Reliability Engineer (SRE) - Disaster Recovery Specialist to join our team at Innovation Consulting Services. The ideal candidate will be responsible for designing, implementing, and maintaining systems and processes that ensure...


  • Bangalore, India Yogy HR Solutions Full time

    Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Yogy HR Solutions. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, performance, and scalability of our cloud-based systems.Key Responsibilities:Collaborate with development partners to design and implement scalable...


  • Bangalore, India Micoworks Full time

    Job Title: Site Reliability EngineerAt Micoworks, we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the stability, scalability, and performance of our cloud-based services.Key Responsibilities:Design, implement, and maintain scalable and reliable...


  • Bangalore, India Squareroot Consulting Pvt Ltd. Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Squareroot Consulting Pvt Ltd. in Bangalore, India. As a Site Reliability Engineer, you will be responsible for designing and implementing secure and scalable infrastructure as a service, automating infrastructure provisioning, and building tools...


  • bangalore, India Tranzeal Incorporated Full time

    Hi Everyone,One of our Direct client is Hiring Site Reliability Engineer in Bengaluru, Karnataka, India. If anyone is interested, please share your resume.Job Title: Site Reliability EngineerLocation: Bengaluru, Karnataka, India - OnsiteJob DescriptionResponsible for maintaining and scaling production services and servers across multiple data centers for...


  • Bangalore, India Tranzeal Incorporated Full time

    Job Title: Site Reliability Engineer (SRE) Location: Bangalore We're hiring a Site Reliability Engineer to join our team in Bangalore! If you have a strong background in maintaining and scaling cloud services and love automating infrastructure at scale, this is for you. Experience with Ansible and Kubernetes is a MUST-HAVE Key...


  • Bangalore, India Wealthy Full time

    Job Title: Site Reliability EngineerWealthy is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for designing, implementing, and maintaining reliable containerized applications using Kubernetes on GCP.Key Responsibilities:Develop and optimize SLIs, SLOs, and SLAs for critical systems...


  • bangalore, India Tranzeal Incorporated Full time

    Hi Everyone,One of our Direct client is Hiring Site Reliability Engineer in Bengaluru, Karnataka, India. If anyone is interested, please share your resume.Job Title: Site Reliability EngineerLocation: Bengaluru, Karnataka, India - OnsiteJob DescriptionResponsible for maintaining and scaling production services and servers across multiple data centers for...


  • Bangalore, India Tsworks Full time

    Who We Are tsworks Technologies India Private Limited (subsidiary of The Software Works, Inc, USA) is a technology product and services company. Our mission is to provide domain expertise, innovative solutions and thought leadership to empower businesses to thrive in a digital world. We value our employees, take pride in providing best value in customer...


  • bangalore, India tsworks Full time

    Who We Are tsworks Technologies India Private Limited (subsidiary of The Software Works, Inc, USA) is a technology product and services company. Our mission is to provide domain expertise, innovative solutions and thought leadership to empower businesses to thrive in a digital world. We value our employees, take pride in providing best value in customer...


  • Bangalore, India Randstad Digital Full time

    Job Title: Site Reliability Engineering Location: Bengalore Experience: 6-8 Years Job Description: Summary: As an Application Developer, you will design, build, and configure applications to meet business process and application requirements. You will collaborate with the team to ensure smooth operations and provide solutions to problems. Your...


  • bangalore, India tsworks Full time

    Who We Aretsworks Technologies India Private Limited (subsidiary of The Software Works, Inc, USA) is a technology product and services company. Our mission is to provide domain expertise, innovative solutions and thought leadership to empower businesses to thrive in a digital world. We value our employees, take pride in providing best value in customer...


  • Bangalore, India Integra Connect Full time

    About Integra Connect Integra Connect delivers a comprehensive, integrated suite of cloud-based technologies and services that enable specialty groups to optimize clinical and financial performance as reimbursement shifts to value-based models. Connected by the Integra Cloud platform, the company’s core applications span population health including...


  • bangalore, India Integra Connect Full time

    About IntegraConnect Integra Connect delivers a comprehensive, integrated suite of cloud-based technologies and services that enable specialty groups to optimize clinical and financial performance as reimbursement shifts to value-based models. Connected by the IntegraCloud platform, the company’s core applications span population health including care...


  • Bangalore, India Integra Connect Full time

    About IntegraConnect Integra Connect delivers a comprehensive, integrated suite of cloud-based technologies and services that enable specialty groups to optimize clinical and financial performance as reimbursement shifts to value-based models. Connected by the IntegraCloud platform, the company’s core applications span population health including care...


  • bangalore, India Tranzeal Incorporated Full time

    Hi Everyone,One of our Direct client is Hiring Site Reliability Engineer in Bengaluru, Karnataka, India. If anyone is interested, please share your resume.Job Title: Site Reliability EngineerLocation: Bengaluru, Karnataka, India - OnsiteJob DescriptionResponsible for maintaining and scaling production services and servers across multiple data centers for...


  • bangalore, India Tranzeal Incorporated Full time

    Hi Everyone, One of our Direct client is Hiring Site Reliability Engineer in Bengaluru, Karnataka, India. If anyone is interested, please share your resume. Job Title: Site Reliability Engineer Location: Bengaluru, Karnataka, India - Onsite Job Description Responsible for maintaining and scaling production services and servers across multiple data...


  • bangalore, India Tranzeal Incorporated Full time

    Hi Everyone, One of our Direct client is Hiring Site Reliability Engineer in Bengaluru, Karnataka, India. If anyone is interested, please share your resume. Job Title: Site Reliability Engineer Location: Bengaluru, Karnataka, India - Onsite Job Description Responsible for maintaining and scaling production services and servers across multiple data...


  • Bangalore, India Randstad Digital Full time

    Job Title: Site Reliability Engineering Location: Bengalore Experience: 6-8Years Job Description: Summary: As an Application Developer, you will design, build, and configure applications to meet business process and application requirements. You will collaborate with the team to ensure smooth operations and provide solutions to problems. Your...


  • Bangalore, India Integra Connect Full time

    About IntegraConnect Integra Connect delivers a comprehensive, integrated suite of cloud-based technologies and services that enable specialty groups to optimize clinical and financial performance as reimbursement shifts to value-based models. Connected by the IntegraCloud platform, the company’s core applications span population health including care...