Site Reliability Engineering Lead

2 months ago


bangalore, India Arcesium Full time

The SRE team is responsible for monitoring the stability and availability of mission critical production systems, managing incidents for quicker resolution, and establishing BAU. Team also building tools/infra which all development teams will use to help monitor and troubleshoot.


What you'll do:

  • Lead reliability engineering projects and drive it to closure.
  • Write code and perform code reviews for best practices and code quality.
  • Contribute to the design/architecture of the system.
  • Automate processes and find opportunities to improve observability and availability of the Platform and reduce toil.
  • Supervise a team of SREs, ensuring production applications are stable, reliable, and well documented.
  • Own end to end availability and performance of mission critical services.
  • Analyze and debug complex issues across tiers from frontend to mid-tier to infrastructure.
  • Practice sustainable incident response and blameless postmortems.


What you'll need:

  • 5 to 9 years of experience handling systems for large scale production environments.
  • A self-starter, able to build, drive and advocate for SRE solution.
  • Effective cross-functional collaboration skills to develop tools for secured, scalable, and reliable systems.
  • Solid understanding of SRE concepts like SLAs, SLOs, SLIs, error budgets, MTTR, MTTD, etc.
  • Experience with variety of tools that help manage, understand, and debug large, complex distributed systems.
  • Good programming experience (Python/Go).
  • Hands-on experience with Kubernetes and Docker.
  • Working knowledge in any one of the cloud platforms (AWS, Azure, GCP)
  • Experience with monitoring and logging tools (e.g. Datadog, ELK, Prometheus, Grafana).
  • Good knowledge of Unix system, networking, web technologies, and databases.
  • Expert with troubleshooting issues and bugs.
  • Incident Management experience coupled with effective communication skills.
  • Experience in financial domain (desirable).
  • Prior SRE/DevOps experience desirable.


Arcesium and its affiliates do not discriminate in employment matters on the basis of race, color, religion, gender, gender identity, pregnancy, national origin, age, military service eligibility, veteran status, sexual orientation, marital status, disability, or any other category protected by law. Note that for us, this is more than just a legal boilerplate. We are genuinely committed to these principles, which form an important part of our corporate culture, and are eager to hear from extraordinarily well qualified individuals having a wide range of backgrounds and personal characteristics.



  • Bangalore, India OptOut Full time

    Job DescriptionAt OptOut, we're seeking a highly skilled Site Reliability Engineer Lead to join our team. As a key member of our engineering organization, you will be responsible for leading our SRE & Observability teams and executing on the vision of providing an enterprise-based common Observability Platform leveraged by a global Engineering, Product, and...


  • Bangalore, India Yogy HR Solutions Full time

    Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Yogy HR Solutions. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, performance, and scalability of our cloud-based systems.Key Responsibilities:Collaborate with development partners to design and implement scalable...


  • Bangalore, India Yogy HR Solutions Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Yogy HR Solutions. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, performance, and scalability of our cloud-based systems.Key Responsibilities:Collaborate with development partners to design and...


  • bangalore, India Tyson Foods India Full time

    Job Description – Lead Site Reliability Engineer (Cloud Engineering) The role as Lead Site Reliability Engineer in the Data & Analytics organization, is to lead efforts in ensuring the reliability, scalability, and performance of our cloud-based systems in like GCP/AWS. The role will play a crucial part in designing and implementing robust, scalable...


  • bangalore, India Tyson Foods India Full time

    Job Description – Lead Site Reliability Engineer (Cloud Engineering) The role as Lead Site Reliability Engineer in the Data & Analytics organization, is to lead efforts in ensuring the reliability, scalability, and performance of our cloud-based systems in like GCP/AWS. The role will play a crucial part in designing and implementing robust, scalable...


  • bangalore, India Tyson Foods India Full time

    Job Description – Lead Site Reliability Engineer (Cloud Engineering) The role as Lead Site Reliability Engineer in the Data & Analytics organization, is to lead efforts in ensuring the reliability, scalability, and performance of our cloud-based systems in like GCP/AWS. The role will play a crucial part in designing and implementing robust, scalable...


  • Bangalore City, India Tyson Foods India Full time

    Job Description – Lead Site Reliability Engineer (Cloud Engineering) The role as Lead Site Reliability Engineer in the Data & Analytics organization, is to lead efforts in ensuring the reliability, scalability, and performance of our cloud-based systems in like GCP/AWS. The role will play a crucial part in designing and implementing robust, scalable...


  • Bangalore, India Cyitechsearch Full time

    Job Title: Site Reliability EngineerAbout the Role:We are seeking a highly skilled Site Reliability Engineer to join our team at Cyitechsearch. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our full-stack software applications.Key Responsibilities:Develop and provide operational...


  • Bangalore, India Micoworks Full time

    Job Title: Site Reliability EngineerAt Micoworks, we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the stability, scalability, and performance of our cloud-based services.Key Responsibilities:Design, implement, and maintain scalable and reliable...


  • Bangalore, India Squareroot Consulting Pvt Ltd. Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Squareroot Consulting Pvt Ltd. in Bangalore, India. As a Site Reliability Engineer, you will be responsible for designing and implementing secure and scalable infrastructure as a service, automating infrastructure provisioning, and building tools...


  • Bangalore, India Integra Connect Full time

    About Integra Connect Integra Connect delivers a comprehensive, integrated suite of cloud-based technologies and services that enable specialty groups to optimize clinical and financial performance as reimbursement shifts to value-based models. Connected by the Integra Cloud platform, the company’s core applications span population health including...


  • bangalore, India Integra Connect Full time

    About IntegraConnect Integra Connect delivers a comprehensive, integrated suite of cloud-based technologies and services that enable specialty groups to optimize clinical and financial performance as reimbursement shifts to value-based models. Connected by the IntegraCloud platform, the company’s core applications span population health including care...


  • Bangalore, India Integra Connect Full time

    About IntegraConnect Integra Connect delivers a comprehensive, integrated suite of cloud-based technologies and services that enable specialty groups to optimize clinical and financial performance as reimbursement shifts to value-based models. Connected by the IntegraCloud platform, the company’s core applications span population health including care...


  • Bangalore, India Integra Connect Full time

    About IntegraConnect Integra Connect delivers a comprehensive, integrated suite of cloud-based technologies and services that enable specialty groups to optimize clinical and financial performance as reimbursement shifts to value-based models. Connected by the IntegraCloud platform, the company’s core applications span population health including care...


  • bangalore, India Tranzeal Incorporated Full time

    Hi Everyone,One of our Direct client is Hiring Site Reliability Engineer in Bengaluru, Karnataka, India. If anyone is interested, please share your resume.Job Title: Site Reliability EngineerLocation: Bengaluru, Karnataka, India - OnsiteJob DescriptionResponsible for maintaining and scaling production services and servers across multiple data centers for...


  • Bangalore, India Wealthy Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Wealthy. As a Site Reliability Engineer, you will be responsible for designing, implementing, and maintaining reliable containerized applications using Kubernetes on GCP.Key Responsibilities:Develop and optimize SLIs, SLOs, and SLAs for critical...


  • Bangalore, India Wealthy Full time

    Job Title: Site Reliability EngineerWealthy is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for designing, implementing, and maintaining reliable containerized applications using Kubernetes on GCP.Key Responsibilities:Develop and optimize SLIs, SLOs, and SLAs for critical systems...


  • Bangalore, India CSC Full time

    Role: Site Reliability Engineer Location: Mumbai/ Bangalore Working Model: Hybrid Shift: 12-9 PM Intro: Do you want to be noticed for your work? Make a difference every day? Be Impactful? Work with cutting edge technology? If so, you will fit in perfectly at CSC and especially within the Regulatory Technology Team. The world’s leading...


  • bangalore, India Integra Connect Full time

    About IntegraConnectIntegra Connect delivers a comprehensive, integrated suite of cloud-based technologies and services that enable specialty groups to optimize clinical and financial performance as reimbursement shifts to value-based models. Connected by the IntegraCloud platform, the company’s core applications span population health including care...


  • bangalore, India Integra Connect Full time

    About IntegraConnectIntegra Connect delivers a comprehensive, integrated suite of cloud-based technologies and services that enable specialty groups to optimize clinical and financial performance as reimbursement shifts to value-based models. Connected by the IntegraCloud platform, the company’s core applications span population health including care...