Lead Site Reliability Engineer
3 weeks ago
Job Title: Site Reliability Engineering (SRE) LeadLocation: Hinjewadi Phase-1 (WFO)Experience :7+ years of experienceShift Time: 11:00 AM to 8:00 PMWorking Days: Monday to FridayAbout the RoleWe are seeking a highly skilled and experienced SRE Lead to drive the reliability, scalability, and performance of our multi-cloud infrastructure spanning AWS and Azure. You will lead a team responsible for building and maintaining automated deployment pipelines, infrastructure as code, and observability systems using GitHub Actions, Terraform, and Datadog.As the SRE Leader, you will collaborate closely with development, operations, and security teams to ensure our services are highly available, secure, and performant, while fostering a culture of automation, monitoring, and continuous improvement.Key Responsibilities- Lead and mentor a team of SRE engineers to design, build, and maintain reliable, scalable, and secure cloud infrastructure across AWS and Azure. - Architect and implement Infrastructure as Code (IaC) solutions primarily using Terraform to manage multi-cloud environments efficiently. - Develop, maintain, and optimize CI/CD pipelines leveraging GitHub Actions to enable fast and reliable software delivery. - Establish and drive best practices in site reliability, monitoring, alerting, and incident response using Datadog and other observability tools. - Collaborate with software engineering teams to improve system reliability through automation, load testing, and performance tuning. - Define and track SLOs, SLIs, and error budgets; lead incident retrospectives and continuous improvement initiatives. - Manage cloud resource costs and optimize usage across multiple cloud providers. - Promote a DevOps culture emphasizing automation, continuous deployment, and proactive incident management. - Stay current with the latest industry trends and technologies in cloud, automation, and SRE practices.Required Skills- 7+ years of experience in Site Reliability Engineering, DevOps, or cloud infrastructure roles. - Implement dashboards to monitor and track SLOs, SLIs, and error budgets; lead incident retrospectives and continuous improvement initiatives. - Proven experience leading and mentoring engineering teams. - Strong hands-on experience with AWS and Azure cloud platforms. - Expert in Infrastructure as Code using Terraform with multi-cloud deployments. - Proficient in building and managing CI/CD pipelines using GitHub Actions. - Deep knowledge of monitoring and observability tools, especially Datadog. - Solid understanding of networking, security, container orchestration (Kubernetes is a plus), and cloud-native architectures. - Strong scripting and automation skills (Python, Bash, or similar). - Experience with incident management, root cause analysis, and capacity planning. - Excellent communication, leadership, and collaboration skills.Technical Skills- IAC: Terraform - CICD : Git Action, Git workflow and ArgoCD - Observability: Datadog, Prometheus and Fluent bit - POD Orchestration: EKS and EKS Faregate - Cloud : AWS and AzzurePreferred- Certifications such as AWS Certified DevOps Engineer, Azure DevOps Engineer, or HashiCorp Terraform Associate. - Experience with Kubernetes and service mesh technologies. - Familiarity with chaos engineering and resilience testing. - Knowledge of security best practices in cloud environments.
-
Lead Site Reliability Engineer
2 hours ago
New Delhi, India Datum Technologies Group Full timeJob Details: Job Title: Lead Site Reliability Engineer (SRE) Duration: Contract to Hire (On the Payroll of Datum Technology Group) Location: Chennai || Mumbai || Gurugram Interview Process: Virtual (2 Rounds) + 1 Technical screening.Job Description: We are seeking a highly skilled and experiencedLead Site Reliability Engineer (SRE)to drive reliability,...
-
Lead Site Reliability Engineer
4 weeks ago
Delhi, India Futurism Technologies, INC. Full timeJob Title: Site Reliability Engineering (SRE) Lead Location: Hinjewadi Phase-1 (WFO)Experience : 7+ years of experienceShift Time : 11:00 AM to 8:00 PMWorking Days : Monday to FridayAbout the RoleWe are seeking a highly skilled and experienced SRE Lead to drive the reliability, scalability, and performance of our multi-cloud infrastructure spanning AWS and...
-
Site Reliability Engineer
1 week ago
New Delhi, India Datum Technologies Group Full timeJob Title: Site Reliability Engineer (SRE) – AWSExperience: 8+ years Location: Chennai / Mumbai Work Mode: HybridKey Skills:AWS, Terraform, Kubernetes, Docker, Grafana, Prometheus, DatadogJob Summary:We are looking for a skilled Site Reliability Engineer (SRE) with strong AWS experience and a solid background in DevOps, automation, observability, and...
-
Lead Site Reliability Engineer
4 weeks ago
Delhi, India Futurism Technologies, INC. Full timeJob Title: Site Reliability Engineering (SRE) Lead Location: Hinjewadi Phase-1 (WFO) Experience : 7+ years of experience Shift Time : 11:00 AM to 8:00 PM Working Days : Monday to Friday About the Role We are seeking a highly skilled and experienced SRE Lead to drive the reliability, scalability, and performance of our multi-cloud infrastructure spanning AWS...
-
Site Reliability Engineer
3 days ago
New Delhi, India Glocomms Full timeWe are currently looking for an SRE Lead - to join our customer - an IT consultancy with urgent projects on board. This will be a 6 month contract initially with an option to extend further.Must have 10+ years exp.Responsibilities: Assess application architecture and implement patterns for reliability and performance. Automate workflows and reduce manual...
-
Site Reliability Engineer
5 days ago
New Delhi, India Glocomms Full timeWe are currently looking for an SRE Lead - to join our customer - an IT consultancy with urgent projects on board.This will be a 6 month contract initially with an option to extend further.Must have 10+ years exp.Responsibilities:- Assess application architecture and implement patterns for reliability and performance. - Automate workflows and reduce manual...
-
Site Reliability Engineer
3 days ago
New Delhi, India Synechron Full timeWe have immediate opportunity forSRE (Senior Site Reliability Engineer) 5 to 9 years. Synechron –PuneJob Role: -SRE (Senior Site Reliability Engineer) Job Location: -PuneAbout Synechron We began life in 2001 as a small, self-funded team of technology specialists. Since then, we’ve grown our organization to 14,500+ people, across 58 offices, in 21...
-
Site Reliability Engineer
3 weeks ago
New Delhi, India Tata Consultancy Services Full timeRole: Site Reliability Engineer Experience: 4 to 7 Years Locations: Chennai/Pune/Kolkata
-
Site Reliability Engineer
2 weeks ago
New Delhi, India Datum Technologies Group Full timeJob Title: Site Reliability Engineer (SRE) – Azure & AIExperience: 7+ yearsWork Mode: HybridWork Location: Chennai/Mumbai/GurgaonJob Summary:We are looking for an experienced Site Reliability Engineer (SRE) with strong expertise in Microsoft Azure, AI infrastructure, and automation. The ideal candidate will have a solid background in managing cloud...
-
Site Reliability Engineer
2 weeks ago
New Delhi, India Grootan Technologies Full timeAbout the RoleWe are seeking a skilled Site Reliability Engineer (SRE) with 4–5 years of hands-on experience to join our engineering team. In this role, you will be responsible for building and maintaining reliable, scalable, and secure infrastructure to support our applications. You will leverage your expertise in automation, cloud platforms, and...