Lead Site Reliability Engineer
5 days ago
Job Title: Site Reliability Engineering (SRE) Lead Location: Hinjewadi Phase-1 (WFO)Experience : 7+ years of experienceShift Time : 11:00 AM to 8:00 PMWorking Days : Monday to FridayAbout the RoleWe are seeking a highly skilled and experienced SRE Lead to drive the reliability, scalability, and performance of our multi-cloud infrastructure spanning AWS and Azure. You will lead a team responsible for building and maintaining automated deployment pipelines, infrastructure as code, and observability systems using GitHub Actions, Terraform, and Datadog.As the SRE Leader, you will collaborate closely with development, operations, and security teams to ensure our services are highly available, secure, and performant, while fostering a culture of automation, monitoring, and continuous improvement.Key ResponsibilitiesLead and mentor a team of SRE engineers to design, build, and maintain reliable, scalable, and secure cloud infrastructure across AWS and Azure.Architect and implement Infrastructure as Code (IaC) solutions primarily using Terraform to manage multi-cloud environments efficiently.Develop, maintain, and optimize CI/CD pipelines leveraging GitHub Actions to enable fast and reliable software delivery.Establish and drive best practices in site reliability, monitoring, alerting, and incident response using Datadog and other observability tools.Collaborate with software engineering teams to improve system reliability through automation, load testing, and performance tuning.Define and track SLOs, SLIs, and error budgets; lead incident retrospectives and continuous improvement initiatives.Manage cloud resource costs and optimize usage across multiple cloud providers.Promote a DevOps culture emphasizing automation, continuous deployment, and proactive incident management.Stay current with the latest industry trends and technologies in cloud, automation, and SRE practices.Required Skills7+ years of experience in Site Reliability Engineering, DevOps, or cloud infrastructure roles.Implement dashboards to monitor and track SLOs, SLIs, and error budgets; lead incident retrospectives and continuous improvement initiatives.Proven experience leading and mentoring engineering teams.Strong hands-on experience with AWS and Azure cloud platforms.Expert in Infrastructure as Code using Terraform with multi-cloud deployments.Proficient in building and managing CI/CD pipelines using GitHub Actions.Deep knowledge of monitoring and observability tools, especially Datadog.Solid understanding of networking, security, container orchestration (Kubernetes is a plus), and cloud-native architectures.Strong scripting and automation skills (Python, Bash, or similar).Experience with incident management, root cause analysis, and capacity planning.Excellent communication, leadership, and collaboration skills.Technical SkillsIAC: TerraformCICD : Git Action, Git workflow and ArgoCDObservability: Datadog, Prometheus and Fluent bitPOD Orchestration: EKS and EKS FaregateCloud : AWS and AzzurePreferredCertifications such as AWS Certified DevOps Engineer, Azure DevOps Engineer, or HashiCorp Terraform Associate.Experience with Kubernetes and service mesh technologies.Familiarity with chaos engineering and resilience testing.Knowledge of security best practices in cloud environments.
-
Lead Site Reliability Engineer
5 days ago
Delhi, India Futurism Technologies, INC. Full timeJob Title: Site Reliability Engineering (SRE) Lead Location: Hinjewadi Phase-1 (WFO) Experience : 7+ years of experience Shift Time : 11:00 AM to 8:00 PM Working Days : Monday to Friday About the Role We are seeking a highly skilled and experienced SRE Lead to drive the reliability, scalability, and performance of our multi-cloud infrastructure spanning AWS...
-
Lead Site Reliability Engineer
3 days ago
New Delhi, India Futurism Technologies, INC. Full timeJob Title: Site Reliability Engineering (SRE) LeadLocation: Hinjewadi Phase-1 (WFO)Experience :7+ years of experienceShift Time: 11:00 AM to 8:00 PMWorking Days: Monday to FridayAbout the RoleWe are seeking a highly skilled and experienced SRE Lead to drive the reliability, scalability, and performance of our multi-cloud infrastructure spanning AWS and...
-
Lead Site Reliability Engineer
1 day ago
New Delhi, India Futurism Technologies, INC. Full timeJob Title:Site Reliability Engineering (SRE) Lead Location:Hinjewadi Phase-1 (WFO) Experience : 7+ years of experience Shift Time : 11:00 AM to 8:00 PM WorkingDays : Monday to FridayAbout the Role We are seeking a highly skilled and experiencedSRE Leadto drive the reliability, scalability, and performance of our multi-cloud infrastructure spanning AWS and...
-
Site Reliability Engineer
4 weeks ago
New Delhi, India HDFC Limited Full timeHiring for Lead / Sr Site Reliability Engineer for Mumbai & Bangalore Location Experience - 8 - 14 YearsJob Purpose Analysing, troubleshooting, and designing vital services, platforms, and infrastructure on GCP while always thinking about reliability, scalability, resilience, security, and performance.Job Responsibilities:Help build a Site Reliability...
-
Site Reliability Engineer
4 weeks ago
Delhi, India Sonata Software Full timeWe're Hiring: Senior Site Reliability EngineerLocation:Onsite (Office: Hyderabad – Mandatory from Day 1)Employment Type:Full-timeNotice Period:Immediate to 15 Days OnlyExperience:8+ YearsAbout the RoleWe’re looking for aSenior Site Reliability Engineer (SRE)to lead reliability initiatives across our production systems. This is a high-impact role where...
-
Site Reliability Engineer
4 weeks ago
Delhi, India Sonata Software Full timeWe're Hiring: Senior Site Reliability EngineerLocation: Onsite (Office: Hyderabad – Mandatory from Day 1)Employment Type: Full-timeNotice Period: Immediate to 15 Days OnlyExperience: 8+ YearsAbout the RoleWe’re looking for a Senior Site Reliability Engineer (SRE) to lead reliability initiatives across our production systems. This is a high-impact role...
-
Site Reliability Engineer
4 weeks ago
New Delhi, India Endpoint Clinical Full timeAbout Us:Endpoint is an interactive response technology (IRT®) systems and solutions provider that supports the life sciences industry. Since 2009, we have been working with a single vision in mind, to help sponsors and pharmaceutical companies achieve clinical trial success. Our solutions, realized through the proprietary PULSE® platform, have proven to...
-
Site Reliability Engineer
3 weeks ago
New Delhi, India Endpoint Clinical Full timeAbout Us:Endpoint is an interactive response technology (IRT®) systems and solutions provider that supports the life sciences industry. Since 2009, we have been working with a single vision in mind, to help sponsors and pharmaceutical companies achieve clinical trial success. Our solutions, realized through the proprietary PULSE® platform, have proven to...
-
Site Reliability Engineering Manager
4 weeks ago
New Delhi, India Tata Consultancy Services Full timeRole**: Manager, Site Reliability Engineering Required Technical Skill Set: Manager, Site Reliability Engineering Desired Experience Range: 12 - 18 yrs Notice Period: Immediate to 90Days only Location of Requirement:Bangalore We are currently planning to do a VirtualInterviewJob Description: Describe what the person will do in the role - how he/she will...
-
Site Reliability Engineering Manager
3 weeks ago
New Delhi, India Tata Consultancy Services Full timeRole**: Manager, Site Reliability EngineeringRequired Technical Skill Set: Manager, Site Reliability EngineeringDesired Experience Range: 12 - 18 yrsNotice Period: Immediate to 90Days onlyLocation of Requirement: BangaloreWe are currently planning to do a Virtual InterviewJob Description:Describe what the person will do in the role - how he/she will impact...