Lead Site Reliability Engineer

3 weeks ago


bangalore, India Futurism Technologies, INC. Full time

Job Title: Site Reliability Engineering (SRE) Lead Location: Hinjewadi Phase-1 (WFO) Experience : 7+ years of experience Shift Time : 11:00 AM to 8:00 PM Working Days : Monday to Friday About the Role We are seeking a highly skilled and experienced SRE Lead to drive the reliability, scalability, and performance of our multi-cloud infrastructure spanning AWS and Azure. You will lead a team responsible for building and maintaining automated deployment pipelines, infrastructure as code, and observability systems using GitHub Actions, Terraform, and Datadog. As the SRE Leader, you will collaborate closely with development, operations, and security teams to ensure our services are highly available, secure, and performant, while fostering a culture of automation, monitoring, and continuous improvement. Key Responsibilities Lead and mentor a team of SRE engineers to design, build, and maintain reliable, scalable, and secure cloud infrastructure across AWS and Azure. Architect and implement Infrastructure as Code (IaC) solutions primarily using Terraform to manage multi-cloud environments efficiently. Develop, maintain, and optimize CI/CD pipelines leveraging GitHub Actions to enable fast and reliable software delivery. Establish and drive best practices in site reliability, monitoring, alerting, and incident response using Datadog and other observability tools. Collaborate with software engineering teams to improve system reliability through automation, load testing, and performance tuning. Define and track SLOs, SLIs, and error budgets; lead incident retrospectives and continuous improvement initiatives. Manage cloud resource costs and optimize usage across multiple cloud providers. Promote a DevOps culture emphasizing automation, continuous deployment, and proactive incident management. Stay current with the latest industry trends and technologies in cloud, automation, and SRE practices. Required Skills 7+ years of experience in Site Reliability Engineering, DevOps, or cloud infrastructure roles. Implement dashboards to monitor and track SLOs, SLIs, and error budgets; lead incident retrospectives and continuous improvement initiatives. Proven experience leading and mentoring engineering teams. Strong hands-on experience with AWS and Azure cloud platforms. Expert in Infrastructure as Code using Terraform with multi-cloud deployments. Proficient in building and managing CI/CD pipelines using GitHub Actions. Deep knowledge of monitoring and observability tools, especially Datadog. Solid understanding of networking, security, container orchestration (Kubernetes is a plus), and cloud-native architectures. Strong scripting and automation skills (Python, Bash, or similar). Experience with incident management, root cause analysis, and capacity planning. Excellent communication, leadership, and collaboration skills. Technical Skills IAC: Terraform CICD : Git Action, Git workflow and ArgoCD Observability: Datadog, Prometheus and Fluent bit POD Orchestration: EKS and EKS Faregate Cloud : AWS and Azzure Preferred Certifications such as AWS Certified DevOps Engineer, Azure DevOps Engineer, or HashiCorp Terraform Associate. Experience with Kubernetes and service mesh technologies. Familiarity with chaos engineering and resilience testing. Knowledge of security best practices in cloud environments.



  • bangalore, India JPMorganChase Full time

    DescriptionAssume a critical role in defining the future of a globally recognized firm and have a direct and significant effect in a realm tailored for top achievers in site reliability. As a Lead Site Reliability Engineer at JPMorgan Chase within the Kinexys Cloud Platform on AWS, you will play a pivotal leadership role in your team, showcasing expertise...


  • bangalore, India super Full time

    Site Reliability Engineer (SRE) Level 3Overview:A Site Reliability Engineer (SRE) Level 3 is a senior technical leadership role focused on designing, implementing, and maintaining large-scale, complex, and highly reliable systems. This role emphasizes a blend of software and systems engineering to ensure the availability, latency, performance, and capacity...


  • bangalore, India Multiplier Technologies Private Limited Full time

    About usThe global hiring revolution is shaping a future where talent can thrive everywhere, driving innovation and progress on a global scale.Multiplier is at the forefront of this change. By removing barriers and simplifying global hiring, we're creating a level playing field where businesses and individuals – (like you) – can compete, grow, and...


  • bangalore, India Datum Technologies Group Full time

    Job Title: Site Reliability Engineer (SRE) – AWS Experience: 8+ years Location: Chennai / Mumbai Work Mode: Hybrid Key Skills: AWS, Terraform, Kubernetes, Docker, Grafana, Prometheus, Datadog Job Summary: We are looking for a skilled Site Reliability Engineer (SRE) with strong AWS experience and a solid background in DevOps, automation, observability, and...


  • bangalore, India Glocomms Full time

    We are currently looking for an SRE Lead - to join our customer - an IT consultancy with urgent projects on board.This will be a 6 month contract initially with an option to extend further.Must have 10+ years exp.Responsibilities:- Assess application architecture and implement patterns for reliability and performance.- Automate workflows and reduce manual...


  • Bangalore, India Aqilea (formerly Soltia) Full time

    We are a consulting company with a bunch of technology-interested and happy people! We love technology, we love design and we love quality. Our diversity makes us unique and creates an inclusive and welcoming workplace where each individual is highly valued.With us, each individual is her/himself and respects others for who they are and we believe that when...


  • bangalore, India London Stock Exchange Group Full time

    Senior Engineer, Site Reliability EngineeringOur TeamWe are evolving our Reliability Engineering team to move beyond support and operations. As a Senior Engineer in Site Reliability, you will be part of a diverse and inclusive organization that has full ownership of the availability, performance, and scalability of one of the most critical shared services at...


  • Bangalore, India Flipkart Full time

    Hiring Site Reliability Engineers Exp : 2.5 +years (Excluding internship) Location : Bangalore Apply Here : The engineer will work in the Reliability and Productivity Engineering team and is responsible for building industry standard large scale platforms to be utilised across FK that helps to significantly improve the reliability of systems and bring...


  • bangalore, India Cyberhaven Full time

    About the roleWe're looking for an experienced Site Reliability engineer for making sure systems are reliable, scalable, and performing well especially in production environments. Our technology is new and rapidly evolving as an early member on the team, you'll play a key role in shaping the reliability architecture, building scalable infrastructure, and...


  • bangalore, India Aqilea (formerly Soltia) Full time

    We are a consulting company with a bunch of technology-interested and happy peopleWe love technology, we love design and we love quality. Our diversity makes us unique and creates an inclusive and welcoming workplace where each individual is highly valued.With us, each individual is her/himself and respects others for who they are and we believe that when a...