Lead Site Reliability Engineer

2 weeks ago


Chennai, Tamil Nadu, India Trimble Full time
Job Description

Lead Site Reliability Engineer

Reporting to:Sr Manager, Availability Management

Office Location:Chennai, India

Flexible Working:Hybrid (Part Office/Part Home)

Cloud Site Reliability Engineer Responsibilities

- On-board internal customers to our 24x7 Applications Support and Enterprise Status Page services
- Be involved with creating an SRE culture globally by defining monitoring strategies and best practices at the organization.
- Monitor application performance and have the ability to provide recommendations on increasing the observability of applications and platforms.
- Play an important role in the Continual Service Improvement process, identifying and driving improvement
- Be instrumental to developing standards, guides to assist the business in maximizing their use of common tools .
- Participate in code peer reviews and enforce quality gates to ensure best practices are followed.
- Apply automation to tasks which would benefit from this. Automating repetitive tasks and deploying monitors via code are core examples.
- Document knowledge gained from engagements in the forms of runbooks and other information critical to incident response.
- Exploring and applying Artificial Intelligence to enhance operational processes/procedures

Should-Haves - Skills & Experience

- Strong skills with modern monitoring tools and demonstrable knowledge of APM, RUM and/or synthetic testing.
- Experience working with observability tools such as Datadog, NewRelic, Splunk, CloudWatch, AzureMonitor
- Experience with the OpenTelemetry (OTEL) Standard
- Working knowledge of at least one programming language, such as Python, JavaScript (NodeJS, etc), Golang or others.
- Strong experience with IaC tools, such as Terraform and Cloudformation.
- Experience with cloud environments, especially AWS and/or Azure.
- Good customer interaction skills and able to understand their needs and expectations.
- Strength in conviction, able to encourage adoption to a wide audience but comfortable with mandating where necessary
- Experience with code quality tools, such as SonarQube.
- Knowledge on code linters tools of various programming languages.
- Experience with CI/CD tools. Such as Bamboo, Jenkins, Azure DevOps, Github actions.
- ITIL experience with basic understanding on incident management, problem management and change management.

Nice-to-Haves - Skills & Experience

- Any cloud certification
- ITIL certifications
- Experience with ITSM tools
- Experience using On-Call Management Tooling

  • Chennai, Tamil Nadu, India Zyoin Group Full time

    Position: Site Reliability Engineer (SRE)Experience: 4 – 10 YearsLocation: Chennai (Hybrid – 2 days in office)Role Overview:We are seeking a Site Reliability Engineer (SRE) responsible for leading reliability practices, ensuring scalable systems, and collaborating with development teams to maintain highly available services.Key Responsibilities- Design,...


  • Chennai, Tamil Nadu, India Zyoin Group Full time

    Position: Site Reliability Engineer (SRE) Experience: 4 – 10 Years Location: Chennai (Hybrid – 2 days in office) Role Overview: We are seeking a Site Reliability Engineer (SRE) responsible for leading reliability practices, ensuring scalable systems, and collaborating with development teams to maintain highly available services. Key Responsibilities ...


  • Chennai, Tamil Nadu, India Horizon56 Full time US$ 90,000 - US$ 1,20,000 per year

    We are seeking an experienced and dynamic Site Reliability Engineering Lead to oversee the reliability, scalability, and performance of our critical systems. In this role, you will lead a team of Technical Support Engineers, managing both day-to-day operations and a 24/7 shift schedule. You will collaborate with cross-functional teams to ensure system...


  • Chennai, Tamil Nadu, India beBeeReliability Full time ₹ 15,00,000 - ₹ 25,00,000

    Job Description:About the Role:We are seeking a highly skilled Site Reliability Engineer (SRE) to join our team. This position is responsible for leading the SRE side of our products, making technical decisions, and collaborating with development teams and platform engineers.This role involves quantitatively measuring and managing system reliability,...


  • Chennai, Tamil Nadu, India Zyoin Group Full time

    Work Mode: Hybrid (2 days Office)We are looking for a Site Reliability Engineer (SREs) who will lead the Site Reliability Engineering(SRE) side of each of our products. This position is responsible for making technical decisions, collaborating with development teams and platform engineers, and building and operating highly reliable and scalable products....


  • Chennai, Tamil Nadu, India Zyoin Group Full time

    Job Description Exp : 4- 10 Years Location : Chennai Work Mode: Hybrid (2 days Office) We are looking for a Site Reliability Engineer (SREs) who will lead the Site Reliability Engineering(SRE) side of each of our products. This position is responsible for making technical decisions, collaborating with development teams and platform engineers, and building...


  • Chennai, Tamil Nadu, India Zyoin Group Full time

    Job DescriptionExp : 4- 10 Years Location : Chennai Work Mode: Hybrid (2 days Office)We are looking for a Site Reliability Engineer (SREs) who will lead the Site Reliability Engineering(SRE) side of each of our products. This position is responsible for making technical decisions, collaborating with development teams and platform engineers, and building and...


  • Chennai, Tamil Nadu, India Zyoin Group Full time

    Job DescriptionExp : 4- 10 Years Location : Chennai Work Mode: Hybrid (2 days Office)We are looking for a Site Reliability Engineer (SREs) who will lead the Site Reliability Engineering(SRE) side of each of our products. This position is responsible for making technical decisions, collaborating with development teams and platform engineers, and building and...


  • Chennai, Tamil Nadu, India Concord Full time

    SRE Sr. Engineers (Individual Contributors)Key Attributes:Strong SRE (Site Reliability Engineering) experienceDevOps skills – CI/CD, monitoring, automation, infrastructure as code, etc.Excellent troubleshooting and debugging skills (infrastructure + application level)Perseverance – must push through complex/challenging issues without giving upAble to...


  • Chennai, Tamil Nadu, India Intellect Design Arena Full time ₹ 5,00,000 - ₹ 8,00,000 per year

    Job Title: Site Reliability EngineerCompany: Intellect Design Arena LtdLocation: Chennai, IndiaExperience Required: 6+ yearsJob Type: Full-timeDepartment: SRE / DevOps / Engineering EnablementAbout Intellect Design Arena LtdIntellect Design Arena Ltd is a global leader in digital financial technology, offering cutting-edge solutions for banking, insurance,...