Lead Site Reliability Engineer
20 hours ago
Job Title:
Site Reliability Engineering (SRE) Lead
Location:
Hinjewadi Phase-1 (WFO)
Experience :
7+ years of experience
Shift Time
: 11:00 AM to 8:00 PM
Working
Days
: Monday to Friday
Notice Period:
Immediate to 15 Days preferred
About the Role
We are seeking a highly skilled and experienced
SRE Lead
to drive the reliability, scalability, and performance of our multi-cloud infrastructure spanning AWS and Azure. You will lead a team responsible for building and maintaining automated deployment pipelines, infrastructure as code, and observability systems using GitHub Actions, Terraform, and Datadog.
As the SRE Leader, you will collaborate closely with development, operations, and security teams to ensure our services are highly available, secure, and performant, while fostering a culture of automation, monitoring, and continuous improvement.
Key Responsibilities
- Lead and mentor a team of SRE engineers to design, build, and maintain reliable, scalable, and secure cloud infrastructure across AWS and Azure.
- Architect and implement Infrastructure as Code (IaC) solutions primarily using Terraform to manage multi-cloud environments efficiently.
- Develop, maintain, and optimize CI/CD pipelines leveraging GitHub Actions to enable fast and reliable software delivery.
- Establish and drive best practices in site reliability, monitoring, alerting, and incident response using Datadog and other observability tools.
- Collaborate with software engineering teams to improve system reliability through automation, load testing, and performance tuning.
- Define and track SLOs, SLIs, and error budgets; lead incident retrospectives and continuous improvement initiatives.
- Manage cloud resource costs and optimize usage across multiple cloud providers.
- Promote a DevOps culture emphasizing automation, continuous deployment, and proactive incident management.
- Stay current with the latest industry trends and technologies in cloud, automation, and SRE practices.
Required Skills
- 7+ years of experience in Site Reliability Engineering, DevOps, or cloud infrastructure roles.
- Implement dashboards to monitor and track SLOs, SLIs, and error budgets; lead incident retrospectives and continuous improvement initiatives.
- Proven experience leading and mentoring engineering teams.
- Strong hands-on experience with AWS and Azure cloud platforms.
- Expert in Infrastructure as Code using Terraform with multi-cloud deployments.
- Proficient in building and managing CI/CD pipelines using GitHub Actions.
- Deep knowledge of monitoring and observability tools, especially Datadog.
- Solid understanding of networking, security, container orchestration (Kubernetes is a plus), and cloud-native architectures.
- Strong scripting and automation skills (Python, Bash, or similar).
- Experience with incident management, root cause analysis, and capacity planning.
- Excellent communication, leadership, and collaboration skills.
Technical Skills
- IAC:
Terraform - CICD :
Git Action, Git workflow and ArgoCD - Observability:
Datadog, Prometheus and Fluent bit - POD Orchestration: EKS and EKS Faregate
- Cloud :
AWS and Azzure
Preferred
- Certifications such as AWS Certified DevOps Engineer, Azure DevOps Engineer, or HashiCorp Terraform Associate.
- Experience with Kubernetes and service mesh technologies.
- Familiarity with chaos engineering and resilience testing.
- Knowledge of security best practices in cloud environments.
-
Site Reliability Engineer
2 weeks ago
Pune, Maharashtra, India Ather Energy Full time ₹ 6,00,000 - ₹ 18,00,000 per yearYou'll be our: Site Reliability EngineerYou'll be based at: Pune Zonal OfficeYou'll be aligned with: Cloud and Data Platform Lead / Cloud ArchitectYou'll be a member of: Cloud and Data Platform TeamAther's fleet of smart scooters is growing rapidly, and so is the volume of data they generate. Our Vehicle Data Platform (VDP) is the core of this ecosystem, and...
-
Site Reliability Engineer
6 days ago
Pune, Maharashtra, India NielsenIQ Full time ₹ 12,00,000 - ₹ 24,00,000 per yearSite Reliability Engineer - Cloud Computing Engineering - T6 Job Description Senior Site Reliability Engineer, Pune At NielsenIQ Digital Shelf, we help the world's leading brands measure and improve their online performance. Formerly known as Data Impact, we've recently joined NielsenIQ. Today, we operate at the intersection of scale and agility — a...
-
Site Reliability Engineer
1 week ago
Pune, Maharashtra, India Fiserv Full time ₹ 8,00,000 - ₹ 24,00,000 per yearSite Reliability EngineerExp. Range-8 to14 YearsWhat does a successful Site Reliability Engineer (SRE) Expert do at Fiserv?The Site reliability engineer blends the principles of software engineering with the discipline of operations to create high-performing and reliable software systems. They are tasked with designing and implementing tools, processes, and...
-
Site Reliability Engineer
5 days ago
Pune, Maharashtra, India ENGEL Full time ₹ 6,00,000 - ₹ 18,00,000 per yearCompany DescriptionENGEL is a global leader in the production of injection moulding machines and their automation. The company produces systems that manufacture plastic parts used in various industries such as automotive, packaging, and consumer goods. With nine production plants worldwide and subsidiaries and representatives in over 85 countries, ENGEL...
-
SRE (Site Reliability Engineer)
1 week ago
Pune, Maharashtra, India Apex One Full time ₹ 6,00,000 - ₹ 18,00,000 per yearJob Overview We are looking for a detail-oriented and experienced Site Reliability Engineer to join our team. The Site Reliability Engineer will be responsible for creating and implementing scalable software solutions in order to meet system and application performance goals. You will also be responsible for troubleshooting system errors and resolving any...
-
Site Reliability Engineering
6 days ago
Pune, Maharashtra, India Amadeus Full time ₹ 12,00,000 - ₹ 36,00,000 per yearJob TitleSite Reliability Engineering (SRE) Manager – iHotelierRole OverviewAs an SRE Manager for iHotelier, you will lead a team responsible for ensuring the availability, scalability, and performance of mission-critical hospitality services. This role combines technical leadership, operational excellence, and strategic planning to deliver a seamless...
-
Site Reliability Engineer
3 days ago
Pune, Maharashtra, India FICO Full time ₹ 4,00,000 - ₹ 12,00,000 per yearSite Reliability Engineering-Engineer II FICO (NYSE: FICO) is a leading global analytics software company, helping businesses in 100+ countries make better decisions. Join our world-class team today and fulfill your career potential The Opportunity "The Site Reliability Engineering group is a global team responsible for providing 24x7 operational...
-
Site Reliability Engineer
5 days ago
Pune, Maharashtra, India Amdocs Full time ₹ 9,00,000 - ₹ 12,00,000 per yearJob ID: Required Travel :Minimal Managerial - No Location: :India- Pune (Amdocs Site) In one sentence As the SRE Lead, you will be responsible for the reliability & operational excellence of the amAIz (Telco Agentic Suite). You will lead a cross-functional team NFT, QA & DevOps Engineers, driving best practices in observability, automation, performance...
-
Site Reliability Engineer
5 days ago
Pune, Maharashtra, India Amdocs Full time ₹ 12,00,000 - ₹ 36,00,000 per yearJob ID: 205406Required Travel : MinimalManagerial - NoLocation: :India- Pune (Amdocs Site)In one sentenceAs the SRE Lead, you will be responsible for the reliability & operational excellence of the amAIz (Telco Agentic Suite). You will lead a cross-functional team NFT, QA & DevOps Engineers, driving best practices in observability, automation, performance...
-
Site Reliability Engineer
3 days ago
Pune, Maharashtra, India UBS Full time ₹ 10,00,000 - ₹ 25,00,000 per yearIndiaInformation Technology (IT)Group FunctionsJob Reference #319274BRCityPuneJob TypeFull TimeYour roleAre you an analytic thinker?Do you enjoy Site Reliability Engineering initiatives and proactive problem management across on-premises & Cloud Database ensuring high availability & stability of Database infrastructure services?Do you want to play a key role...