Site Reliability Engineer
4 hours ago
About the Role
We are seeking a skilled Site Reliability Engineer (SRE) with 4–5 years of hands-on experience to join our engineering team. In this role, you will be responsible for building and maintaining reliable, scalable, and secure infrastructure to support our applications. You will leverage your expertise in automation, cloud platforms, and monitoring to ensure system availability, performance, and resilience.
Key Responsibilities
- Design, implement, and maintain Infrastructure as Code (IaC) using Terraform.
- Manage and optimize workloads deployed on Kubernetes (K8s) and containerized environments (Docker, Helm, etc.).
- Configure, administer, and troubleshoot Linux-based systems; write automation scripts using Bash/Shell scripting.
- Deploy, manage, and secure workloads in Azure Cloud environments, leveraging PaaS, IaaS, and managed services.
- Build and optimize CI/CD pipelines using GitHub Actions for automated deployments and testing.
- Implement, configure, and maintain robust monitoring and alerting systems using Grafana and Azure-native monitoring tools.
- Collaborate with developers and architects to improve application reliability, scalability, and performance.
- Proactively identify and resolve reliability and performance issues across distributed systems.
- Participate in on-call rotations to support production systems and respond to incidents.
Required Skills & Qualifications
- 4–5 years of experience in Site Reliability Engineering, DevOps, or Cloud Infrastructure roles.
- Strong expertise in Terraform and Infrastructure as Code principles.
- Hands-on experience with Kubernetes, containerization, and orchestration tools.
- Proficiency in Linux system administration and Bash/Shell scripting.
- Solid knowledge of Azure Cloud services (networking, compute, storage, monitoring, security).
- Experience designing and maintaining CI/CD pipelines with GitHub Actions.
- Strong understanding of monitoring, alerting, and observability tools (Grafana, Prometheus, Azure Monitor, etc.).
- Familiarity with incident response, troubleshooting, and root cause analysis in distributed systems.
- Excellent problem-solving, collaboration, and communication skills.
-
Site Reliability Engineer
4 hours ago
Chennai, Tamil Nadu, India, Tamil Nadu Tata Consultancy Services Full timeRole: Site Reliability EngineerLocation: Chennai/Bangalore/HyderabadExp- 5-11 years1.Exposure to any APM tool like Dynatrace, Appdynamics, Splunk, etc2.DBA or Infra admin 3.Gremlin or Chaos Monkey or Simian Army or Litmus expertise4.Exposure to ITSM tools like Service Now, etc5.Understanding of Automation and Chaos Engineering6.Exposure to Devops tools and...
-
Site Reliability Engineer
4 hours ago
Chennai, Tamil Nadu, India, Tamil Nadu Datum Technologies Group Full timeJob Title: Site Reliability Engineer (SRE) – AWSExperience: 8+ yearsLocation: Chennai / MumbaiWork Mode: HybridKey Skills: AWS, Terraform, Kubernetes, Docker, Grafana, Prometheus, DatadogJob Summary:We are looking for a skilled Site Reliability Engineer (SRE) with strong AWS experience and a solid background in DevOps, automation, observability, and...
-
AWS Site Reliability Engineer
4 hours ago
Chennai, Tamil Nadu, India, Tamil Nadu HTC Global Services Full timeHTC – A brief profileEstablished in 1990, HTC Inc., a company with headquarters in Troy, Michigan, is a leading global Information Technology solution and BPO provider. HTC assists clients across multiple industry verticals, offering turnkey project lifecycle in, e-business, data warehousing, embedded systems, ECM, SCM, CRM, and ERP solutions. HTC Inc....
-
Sr. Site Reliability Engineer
4 hours ago
Chennai, Tamil Nadu, India, Tamil Nadu Datum Technologies Group Full timeJob Details:Job Title: Sr. Site Reliability Engineer (SRE) Duration: Contract to Hire (On the Payroll of Datum Technology Group)Location: Chennai || Mumbai || Gurugram Interview Process: Virtual (2 Rounds) + 1 Technical screening.Job Description:We are seeking a highly skilled Senior Site Reliability Engineer (SRE) to enhance reliability, scalability, and...
-
Site Reliability Engineer
1 week ago
Chennai, Tamil Nadu, India GSR Business Services Full time ₹ 6,00,000 - ₹ 12,00,000 per yearDear Aspirants,Urgent HiringSite reliability Engineer3-5 YearsChennaiRole Summary:Supports the reliability and performance of systems and infrastructure. Assists in monitoring, troubleshooting, and automating tasks to maintain high-availability environments.Key Responsibilities:Assist in managing VMware and Linux servers.Monitor system health and respond to...
-
Lead Site Reliability Engineer
3 days ago
Chennai, Tamil Nadu, India Datum Technologies Group Full time ₹ 18,00,000 - ₹ 22,00,000 per yearJob Details:Job Title: Lead Site Reliability Engineer (SRE)Duration: Contract to Hire (On the Payroll of Datum Technology Group)Location: Chennai || Mumbai || GurugramInterview Process: Virtual (2 Rounds) + 1 Technical screening.Job Description:We are seeking a highly skilled and experienced Lead Site Reliability Engineer (SRE) to drive reliability,...
-
Sr. Site Reliability Engineer
3 days ago
Chennai, Tamil Nadu, India Datum Technologies Group Full time ₹ 3,00,000 - ₹ 4,50,000 per yearJob Details:Job Title: Sr. Site Reliability Engineer (SRE)Duration: Contract to Hire (On the Payroll of Datum Technology Group)Location: Chennai || Mumbai || GurugramInterview Process: Virtual (2 Rounds) + 1 Technical screening.Job Description:We are seeking a highly skilled Senior Site Reliability Engineer (SRE) to enhance reliability, scalability, and...
-
Site Reliability Engineer
1 week ago
Chennai, Tamil Nadu, India Flex Full time ₹ 8,00,000 - ₹ 24,00,000 per yearExperience:3.5 to 7 yearsLocation:ChennaiWork mode:Hybrid.Role Overview:As a Site Reliability Engineer (SRE) on the Factory Applications team, you will help maintain and scale Brix" - a cloud-native, containerized, microservices-based platform used to build global shop floor systems. Your focus will be on automation, reliability, and performance.Key...
-
Site Reliability Engineer, AVP
6 days ago
Chennai, Tamil Nadu, India NatWest Group Full time ₹ 12,00,000 - ₹ 36,00,000 per yearJoin us as a Site Reliability EngineerYou'll manage the provision of stable, resilient, reliable applications with the end goal of minimising disruption to Customer & Colleague Journeys (CCJ)We'll look to you to identify and automate manual tasks and implement observability solutions, ensuring a thorough understanding of CCJ across applicationsThis is a...
-
AWS Site Reliability Engineer
6 days ago
Chennai, Tamil Nadu, India HTC Global Services Full time ₹ 12,00,000 - ₹ 36,00,000 per yearHTC – A brief profileEstablished in 1990, HTC Inc., a company with headquarters in Troy, Michigan, is a leading global Information Technology solution and BPO provider. HTC assists clients across multiple industry verticals, offering turnkey project lifecycle in, e-business, data warehousing, embedded systems, ECM, SCM, CRM, and ERP solutions. HTC Inc....