Site Reliability Engineer

19 hours ago


Chennai, Tamil Nadu, India Zyoin Group Full time
Work Mode: Hybrid (2 days Office)

We are looking for a Site Reliability Engineer (SREs) who will lead the Site Reliability Engineering(SRE) side of each of our products. This position is responsible for making technical decisions, collaborating with development teams and platform engineers, and building and operating highly reliable and scalable products.

Specifically, SREs quantitatively measure and manage system reliability, achieving appropriate risk balance through SLI/SLOs. By automating operations to reduce human error, responding quickly to incidents, conducting root cause analysis, and driving continuous improvement, SREs enhance service resilience. Through these efforts, SREs cultivate a culture within the organization that blends engineering and operational best practices.

In this role, you will act as a leader who identifies technical challenges within development teams, proactively plans solutions, and drives projects to resolution. By closely collaborating with developers and platform engineers, you will promote continuous improvements, ensuring that products remain resilient, scalable, and aligned with business objectives.

Define and implement Service Level Objectives (SLOs) and Service Level Indicators (SLIs) to measure system reliability and performance

Analyze and improve system bottlenecks and conduct capacity planning

Conduct postmortems and root cause analyses to prevent recurrence

Continuously improve the incident management process and optimize on-call operations

Optimize deployment pipelines and CI/CD workflows to improve release efficiency and rollback capabilities

Observability & Monitoring

Design and implement comprehensive monitoring, logging, and tracing strategies using tools like OpenTelemetry, Grafana, Prometheus, and Datadog

Continuously enhance system visibility and root cause analysis capabilities

Work closely with other SREs, platform engineers, and developers to optimize infrastructure and improve reliability

Few years of experience in Site Reliability Engineering, DevOps, or Infrastructure Engineering

Some coding experience is required (does not need to be web applications; experience with batch processing or small automation scripts only is acceptable)

shell(e.g. bash) only experience is not acceptable. Experience with some statically typed(e.g. C, C++, Java, Rust, Go, Scala.. ) Perl, Ruby, Python, PHP, JavaScript…) language is required.

Experience collaborating with development teams to enhance system reliability

Technical leadership experience (mentoring and supporting team members in technical areas)

Proven experience in project management (identifying issues, planning solutions, driving execution, and coordinating stakeholders)

Multiple experiences in the following technical areas:

Experience operating Kubernetes in a production environment

Experience with CI/CD automation tools (e.g., Hands-on experience with observability tools (e.g., Prometheus, OpenTelemetry, Grafana, Datadog)

Familiarity with cloud platforms (AWS or others) and cloud-native architectures

Experience in incident management, disaster recovery, and high availability strategies

Experience fostering SRE best practices within an organization

Proficiency in programming languages such as Go, Python, or Bash for automation and tooling development

Contributions to CNCF projects or open-source communities

Collaboration with global teams in an agile and technically driven environment

Hands-on experience with large-scale distributed systems and cutting-edge cloud-native technologies

  • Chennai, Tamil Nadu, India Ford Global Career Site Full time ₹ 1,04,000 - ₹ 1,30,878 per year

    Be at the Forefront of Mobility's Future: Join Ford as a Site Reliability EngineerEnterprise Technology is the engine driving the future of transportation, and we're looking for a talented Site Reliability Engineer (SRE) to help us redefine mobility. In this role, you'll leverage cutting-edge technology to enhance customer experiences, improve lives, and...


  • Chennai, Tamil Nadu, India Zyoin Group Full time

    Position: Site Reliability Engineer (SRE)Experience: 4 – 10 YearsLocation: Chennai (Hybrid – 2 days in office)Role Overview:We are seeking a Site Reliability Engineer (SRE) responsible for leading reliability practices, ensuring scalable systems, and collaborating with development teams to maintain highly available services.Key Responsibilities- Design,...


  • Chennai, Tamil Nadu, India FIS Full time US$ 1,00,000 - US$ 1,50,000 per year

    Position Type :Full timeType Of Hire :Experienced (relevant combo of work and education)Education Desired :Bachelor of Computer EngineeringTravel Percentage :0%Site Reliability Engineer (SRE) with Mainframe TechnologiesAre you curious, motivated, and forward-thinking? At FIS you'll have the opportunity to work on some of the most challenging and relevant...


  • Chennai, Tamil Nadu, India Zyoin Group Full time

    Position: Site Reliability Engineer (SRE) Experience: 4 – 10 Years Location: Chennai (Hybrid – 2 days in office) Role Overview: We are seeking a Site Reliability Engineer (SRE) responsible for leading reliability practices, ensuring scalable systems, and collaborating with development teams to maintain highly available services. Key Responsibilities ...


  • Chennai, Tamil Nadu, India beBeeReliability Full time ₹ 2,00,00,000 - ₹ 2,50,00,000

    Job Overview:We are seeking an experienced Site Reliability Engineering Lead to oversee the reliability, scalability, and performance of our systems.As a Site Reliability Engineering Lead, you will establish and implement SRE practices, lead a team of engineers, and drive automation, monitoring, and incident response strategies.This position combines...


  • Chennai, Tamil Nadu, India ti Steps Full time US$ 60,000 - US$ 1,20,000 per year

    Site Reliability Engineering (SRE) InternJob Description:Support the SRE team in ensuring the reliability, scalability, and performance of production systems. Learn incident response, monitoring, and automation techniques.Key Responsibilities:Monitor system health and respond to alerts.Help improve system reliability through automation.Participate in root...


  • Chennai, Tamil Nadu, India Grootan Technologies Full time US$ 90,000 - US$ 1,20,000 per year

    About the RoleWe are seeking a skilled Site Reliability Engineer (SRE) with 4 to 5 years of hands-on experience to join our engineering team. In this role, you will be responsible for building and maintaining reliable, scalable, and secure infrastructure to support our applications. You will leverage your expertise in automation, cloud platforms, and...


  • Chennai, Tamil Nadu, India beBeeDevops Full time ₹ 12,00,000 - ₹ 24,00,000

    Job Title:DevOps Engineer with Site Reliability Engineering


  • Chennai, Tamil Nadu, India Zyoin Group Full time

    Job Description Exp : 4- 10 Years Location : Chennai Work Mode: Hybrid (2 days Office) We are looking for a Site Reliability Engineer (SREs) who will lead the Site Reliability Engineering(SRE) side of each of our products. This position is responsible for making technical decisions, collaborating with development teams and platform engineers, and building...


  • Chennai, Tamil Nadu, India Zyoin Group Full time

    Job DescriptionExp : 4- 10 Years Location : Chennai Work Mode: Hybrid (2 days Office)We are looking for a Site Reliability Engineer (SREs) who will lead the Site Reliability Engineering(SRE) side of each of our products. This position is responsible for making technical decisions, collaborating with development teams and platform engineers, and building and...