Site Reliability Engineer
2 weeks ago
About the Role
We are seeking a skilled
Site Reliability Engineer (SRE)
with 4–5 years of hands-on experience to join our engineering team. In this role, you will be responsible for building and maintaining reliable, scalable, and secure infrastructure to support our applications. You will leverage your expertise in automation, cloud platforms, and monitoring to ensure system availability, performance, and resilience.
Key Responsibilities
- Design, implement, and maintain
Infrastructure as Code (IaC)
using
Terraform
. - Manage and optimize workloads deployed on
Kubernetes (K8s)
and containerized environments (Docker, Helm, etc.). - Configure, administer, and troubleshoot
Linux-based systems
; write automation scripts using
Bash/Shell scripting
. - Deploy, manage, and secure workloads in
Azure Cloud
environments, leveraging PaaS, IaaS, and managed services. - Build and optimize
CI/CD pipelines
using
GitHub Actions
for automated deployments and testing. - Implement, configure, and maintain robust
monitoring and alerting
systems using
Grafana
and Azure-native monitoring tools. - Collaborate with developers and architects to improve application reliability, scalability, and performance.
- Proactively identify and resolve reliability and performance issues across distributed systems.
- Participate in on-call rotations to support production systems and respond to incidents.
Required Skills & Qualifications
- 4–5 years of experience in
Site Reliability Engineering, DevOps, or Cloud Infrastructure
roles. - Strong expertise in
Terraform
and Infrastructure as Code principles. - Hands-on experience with
Kubernetes, containerization
, and orchestration tools. - Proficiency in
Linux system administration
and
Bash/Shell scripting
. - Solid knowledge of
Azure Cloud services
(networking, compute, storage, monitoring, security). - Experience designing and maintaining
CI/CD pipelines
with
GitHub Actions
. - Strong understanding of
monitoring, alerting, and observability tools
(Grafana, Prometheus, Azure Monitor, etc.). - Familiarity with incident response, troubleshooting, and root cause analysis in distributed systems.
- Excellent problem-solving, collaboration, and communication skills.
-
Site Reliability Engineer
1 week ago
Chennai, Tamil Nadu, India Cortex Consultants Full timeJob Title: Site Reliability Engineer (SRE) Experience: 6 to 9 years Location: chennai Job Overview: We are seeking a skilled and proactive Site Reliability Engineer (SRE) to join our growing team. As an SRE, you will be responsible for maintaining the reliability, availability, and performance of our systems. We're looking for someone with solid experience...
-
Site Reliability Engineer
5 days ago
Chennai, Tamil Nadu, India GSR Business Services Full timeDear Aspirants,Urgent HiringSite reliability Engineer3-5 YearsChennaiRole Summary:Supports the reliability and performance of systems and infrastructure. Assists in monitoring, troubleshooting, and automating tasks to maintain high-availability environments.Key Responsibilities:Assist in managing VMware and Linux servers.Monitor system health and respond to...
-
Site Reliability Engineer
2 days ago
Chennai, Tamil Nadu, India TECEZE Full timeJob Title:Site Reliability Engineer (SRE) – Core IT InfrastructureLocation:Chennai/ pune/ bangaloreCompany:TecezeAbout TecezeTecezeis a global IT services and consulting organization delivering innovative, scalable, and secure technology solutions. We specialize in infrastructure services, cloud transformation, DevOps, and managed services, helping...
-
Lead Site Reliability Engineer
1 day ago
Chennai, Tamil Nadu, India Datum Technologies Group Full timeJob Details:Job Title: Lead Site Reliability Engineer (SRE)Duration: Contract to Hire (On the Payroll of Datum Technology Group)Location: Chennai || Mumbai || GurugramInterview Process: Virtual (2 Rounds) + 1 Technical screening.Job Description:We are seeking a highly skilled and experienced Lead Site Reliability Engineer (SRE) to drive reliability,...
-
Site Reliability Engineering
5 days ago
Chennai, Tamil Nadu, India Chasra Solutions Full timePosition id: 34689 Site Reliability Engineering- Chennai (Onsite)Position Description:Employees in this SRE job function are responsible for ensuring availability, reliability and performance of cloud and network systems and services by AUTOMATING routine manual tasks. (Handson Software Engineer only)Key Responsibilities:Collaborate with Infrastructure teams...
-
Site Reliability Engineering
3 hours ago
Chennai, Tamil Nadu, India Umanist Staffing Full timePosition: Site Reliability Engineering (SRE) Engineer 2,34689 Location: Chennai – OnsiteEmployment Type: Full TimeCTC: 16 LPARole Overview:This is a hands-on SRE role focused on ensuring the availability, reliability, and performance of cloud and network systems by automating routine manual tasks. The role requires strong software engineering experience,...
-
Site Reliability Engineer
2 weeks ago
Chennai, Tamil Nadu, India Ford Motor Company Full time ₹ 8,00,000 - ₹ 24,00,000 per yearJob DescriptionJob Description:Ford is seeking an experienced Site Reliability Engineer (SRE) to join our team and lead the development, enhancement, and extension of our global monitoring and observability platform.Enterprise Technology plays a critical part in shaping the future of mobility. If you're looking for the chance to leverage advanced technology...
-
Site Reliability Engineer
2 weeks ago
Chennai, Tamil Nadu, India Proglite Full timeWe have the following requirements for the Site Reliability Engineer roleSkill Set:AWS: EC2, Networking, Storage, autoscaling, CloudWatch, SSM, management (patching/upgrades/security) of OS(windows/Linux) in EC2GCP: GKE/Compute, Networking, storage, Cloud Monitoring, management (patching/upgrades/security) of OS(windows/Linux) in computeSRE Practices:...
-
Site Reliability Engineer
7 days ago
Chennai, Tamil Nadu, India Flex Full timeExperience:3.5 to 7 yearsLocation:ChennaiWork mode:Hybrid.Role Overview:As a Site Reliability Engineer (SRE) on the Factory Applications team, you will help maintain and scale Brix" - a cloud-native, containerized, microservices-based platform used to build global shop floor systems. Your focus will be on automation, reliability, and performance.Key...
-
Site Reliability Engineer, AVP
3 days ago
Chennai, Tamil Nadu, India NatWest Group Full timeJoin us as a Site Reliability EngineerYou'll manage the provision of stable, resilient, reliable applications with the end goal of minimising disruption to Customer & Colleague Journeys (CCJ)We'll look to you to identify and automate manual tasks and implement observability solutions, ensuring a thorough understanding of CCJ across applicationsThis is a...