Site Reliability Engineer

5 days ago


Chennai, Tamil Nadu, India NatWest Markets Full time
Job Description

Join us as a Site Reliability Engineer

- You ll be managing the provision of stable, resilient, reliable applications with the end goal of minimising disruption to Customer & Colleague Journeys (CCJ)
- We ll look to you to identify and automate manual tasks and implement observability solutions, ensuring a thorough understanding of CCJ across applications
- This is a great chance to work in a supportive environment with opportunities to advance your personal and career development
- Were offering this role at associate vice president level

What youll doAs a Site Reliability Engineer, you ll collaborate with feature teams to understand application changes, participate in delivery activities, and address production issues to assist in the delivery of change that does not negatively affect the customer experience. You ll also help to monitor and manage cloud costs, recommending optimisations and cost-saving measures.

You ll be responding to, managing, and resolving incidents in a timely manner, performing root cause analysis and driving improvements to prevent recurrence. As well as this, you ll automate routine operational tasks and cloud infrastructure provisioning using IaC tools.

You ll also be:

- Conducting capacity planning exercises to make sure cloud resources can handle anticipated traffic spikes and growth
- Implementing and maintaining monitoring, logging, and alerting systems to provide insights into cloud infrastructure and applications health and performance
- Delivering automation solutions to minimise and eliminate manual tasks associated with maintaining and supporting the applications
- Ensuring an in-depth understanding of the full tech stack on which the application resides and depends on
- Identifying alerting and monitoring requirements for an application, based on sound understanding of customer journeys
- Evaluating the resilience of the end-to-end tech stack on which the applications depend, and addressing weaknesses
- Seeking to reduce frequency of hand-offs in the end-to-end resolution of customer-impacting incidents

The skills youll needTo succeed in this role, you ll need experience of supporting live production services serving customer journeys with a demonstrable knowledge of ITIL processes and IT Security principles along with tools and techniques to prevent compliance breaches.

On top of this, you ll bring hands on experience with Azure Cloud and full-stack observability using tools such as Log Analytics, Application Insights, Grafana, CloudWatch, Prometheus and Splunk.

You ll also need:

- Strong verbal and written communication skills
- Strong hands on experience with cloud platforms including AWS and GCP, and their services such as S3, Lambda and Kubernetes
- Experience of managing production systems and incidents with a focus on minimising downtime and improving system resilience
- Strong troubleshooting skills for cloud infrastructure and application performance issues
- Experience of networking in the cloud and familiarity with Chaos Engineering principles and tools

Hours

45Job Posting Closing Date:

- 27/05/2025

  • Chennai, Tamil Nadu, India Ford Global Career Site Full time ₹ 1,04,000 - ₹ 1,30,878 per year

    Be at the Forefront of Mobility's Future: Join Ford as a Site Reliability EngineerEnterprise Technology is the engine driving the future of transportation, and we're looking for a talented Site Reliability Engineer (SRE) to help us redefine mobility. In this role, you'll leverage cutting-edge technology to enhance customer experiences, improve lives, and...


  • Chennai, Tamil Nadu, India Zyoin Group Full time

    Position: Site Reliability Engineer (SRE)Experience: 4 – 10 YearsLocation: Chennai (Hybrid – 2 days in office)Role Overview:We are seeking a Site Reliability Engineer (SRE) responsible for leading reliability practices, ensuring scalable systems, and collaborating with development teams to maintain highly available services.Key Responsibilities- Design,...


  • Chennai, Tamil Nadu, India FIS Full time US$ 1,00,000 - US$ 1,50,000 per year

    Position Type :Full timeType Of Hire :Experienced (relevant combo of work and education)Education Desired :Bachelor of Computer EngineeringTravel Percentage :0%Site Reliability Engineer (SRE) with Mainframe TechnologiesAre you curious, motivated, and forward-thinking? At FIS you'll have the opportunity to work on some of the most challenging and relevant...


  • Chennai, Tamil Nadu, India beBeeReliability Full time ₹ 2,00,00,000 - ₹ 2,50,00,000

    Job Overview:We are seeking an experienced Site Reliability Engineering Lead to oversee the reliability, scalability, and performance of our systems.As a Site Reliability Engineering Lead, you will establish and implement SRE practices, lead a team of engineers, and drive automation, monitoring, and incident response strategies.This position combines...


  • Chennai, Tamil Nadu, India Zyoin Group Full time

    Position: Site Reliability Engineer (SRE) Experience: 4 – 10 Years Location: Chennai (Hybrid – 2 days in office) Role Overview: We are seeking a Site Reliability Engineer (SRE) responsible for leading reliability practices, ensuring scalable systems, and collaborating with development teams to maintain highly available services. Key Responsibilities ...


  • Chennai, Tamil Nadu, India ti Steps Full time US$ 60,000 - US$ 1,20,000 per year

    Site Reliability Engineering (SRE) InternJob Description:Support the SRE team in ensuring the reliability, scalability, and performance of production systems. Learn incident response, monitoring, and automation techniques.Key Responsibilities:Monitor system health and respond to alerts.Help improve system reliability through automation.Participate in root...


  • Chennai, Tamil Nadu, India Zyoin Group Full time

    Work Mode: Hybrid (2 days Office)We are looking for a Site Reliability Engineer (SREs) who will lead the Site Reliability Engineering(SRE) side of each of our products. This position is responsible for making technical decisions, collaborating with development teams and platform engineers, and building and operating highly reliable and scalable products....


  • Chennai, Tamil Nadu, India Grootan Technologies Full time US$ 90,000 - US$ 1,20,000 per year

    About the RoleWe are seeking a skilled Site Reliability Engineer (SRE) with 4 to 5 years of hands-on experience to join our engineering team. In this role, you will be responsible for building and maintaining reliable, scalable, and secure infrastructure to support our applications. You will leverage your expertise in automation, cloud platforms, and...


  • Chennai, Tamil Nadu, India beBeeDevops Full time ₹ 12,00,000 - ₹ 24,00,000

    Job Title:DevOps Engineer with Site Reliability Engineering


  • Chennai, Tamil Nadu, India Zyoin Group Full time

    Job Description Exp : 4- 10 Years Location : Chennai Work Mode: Hybrid (2 days Office) We are looking for a Site Reliability Engineer (SREs) who will lead the Site Reliability Engineering(SRE) side of each of our products. This position is responsible for making technical decisions, collaborating with development teams and platform engineers, and building...