
Site Reliability Engineer Lead
2 weeks ago
Job Title: SRE Lead (Support & Operations)
Job Summary:
We are seeking an experienced and proactive Site Reliability Engineering (SRE) Lead with a
strong background in support operations, service management, and debugging complex
systems built on Java and microservices architecture. This role is crucial in ensuring the
reliability, stability, and efficiency of our critical systems while driving process
improvements, incident management, and cross-functional collaboration. As an SRE Lead,
you will oversee system health, manage escalations, track and ensure ticket closures, follow
up on issues, and enhance support processes to deliver a seamless operational experience.
Experience: 7+ years
Key Responsibilities:
Service Reliability & Operational Excellence:
• Ensure high availability and performance of critical services through proactive
monitoring and issue resolution.
• Define and uphold Service Level Indicators (SLIs) and Service Level Objectives
(SLOs) aligned with business needs.
• Identify recurring operational challenges and implement process improvements to
enhance service reliability.
Incident & Problem Management:
• Lead incident response efforts, ensuring quick resolution and minimal business
impact.
• Establish robust on-call processes and ensure smooth incident handling across teams.
• Conduct post-incident reviews, documenting learnings and driving continuous
improvement initiatives.
• Collaborate with engineering teams to ensure long-term fixes for recurring incidents.
• Possess strong debugging skills and the ability to analyze and resolve complex issues.
Support & Escalation Management:
• Act as the primary point of contact for major incidents, working with cross-functional
teams to resolve issues.
• Manage support escalations efficiently, ensuring timely communication and
resolution.
• Track and ensure timely closure of support tickets and incidents.
• Follow up on pending issues to drive resolution and prevent recurring problems.
• Develop and enhance support playbooks and standard operating procedures (SOPs).
• Foster a culture of accountability and knowledge sharing within the team.
Collaboration & Stakeholder Management:
• Work closely with development, infrastructure, and business teams to align
operational goals.
• Ensure seamless communication between engineering teams, customer support, and
leadership.
• Provide regular updates on system health, incidents, and improvements to
stakeholders.
• Advocate for operational needs in engineering and product discussions.
Process Improvement & Automation:
• Streamline support workflows and implement best practices for efficient issue
resolution.
• Drive automation initiatives to reduce manual operational tasks and improve response
times.
• Ensure documentation and knowledge management practices are maintained
effectively.
Leadership & Team Development:
• Mentor and support a team of SREs, fostering a culture of reliability and operational
excellence.
• Promote a customer-first mindset within the team.
• Encourage collaboration, learning, and professional growth among team members.
Skills & Qualifications:
Required Skills:
• Strong experience in IT operations, support, or service reliability roles.
• Proven track record in incident management, troubleshooting, and root cause analysis.
• Strong Java knowledge with an understanding of microservices architecture.
• Experience with monitoring and alerting tools (e.g., Grafana, Prometheus, New Relic,
or similar).
• Familiarity with Kubernetes and cloud-based environments (AWS, Azure, GCP).
• Familiarity with ITIL practices and service management methodologies.
• Strong communication and stakeholder management skills.
• Ability to manage escalations effectively and ensure timely issue resolution.
• Strong skills in tracking support issues, ensuring ticket closures, and following up on
action items.
Preferred Qualifications:
• Prior experience in an SRE, IT operations, or support leadership role.
• Knowledge of ticketing and ITSM tools (e.g., ServiceNow, Jira Service Management,
or similar).
• Understanding of compliance, security, and best practices in support operations.
• Exposure to automation and process improvement initiatives.
-
Site Reliability Engineer
1 week ago
Bengaluru, Karnataka, India NatWest Group Full time ₹ 15,00,000 - ₹ 25,00,000 per yearSite Reliability Engineer, AVP Join us as a Site Reliability EngineerYou'll manage the provision of stable, resilient, reliable applications with the end goal of minimising disruption to Customer & Colleague Journeys (CCJ) We'll look to you to identify and automate manual tasks and implement observability solutions, ensuring a thorough understanding of...
-
Site Reliability Engineer
4 days ago
Bengaluru, Karnataka, India Visa Inc. Full time ₹ 1,20,000 - ₹ 3,00,000 per yearJob Description We are seeking an accomplished Site Reliability Engineer (SRE) Sr Consultant to join our dynamic Observability team. In this senior role, you will provide technical leadership in developing and maintaining reliable, secure, and cost-effective observability solutions that support our global operations. As the Sr. consultant SRE, you will...
-
Site Reliability Engineer
7 days ago
Bengaluru, Karnataka, India Chevron Full time ₹ 20,00,000 - ₹ 25,00,000 per yearTotal Number of Openings2About the position:Come join our Subsurface Digital Platform where we are driving continuous innovations to improve reliability, scalability and sustainability of Chevron business via Chevron's Digital Transformation. We are seeking a T-shaped dynamic Senior Site Reliability Engineer to lead and provide end-to-end solution support...
-
Site Reliability Engineer
6 days ago
Bengaluru, Karnataka, India NatWest Group Full time ₹ 20,00,000 - ₹ 25,00,000 per yearSite Reliability Engineer,VP Join us as a Site Reliability EngineerIn this key role, you'll improve, drive, and embed non-functional and operational characteristics such as availability, performance, efficiency, change management, monitoring, security, incident response, and capacity planning of our products and services You'll enjoy significant...
-
Site Reliability Engineer
2 weeks ago
Bengaluru, Karnataka, India Programming Full time ₹ 10,00,000 - ₹ 25,00,000 per yearRole - Site Reliability Engineering.Location - BengaluruYears of Expereince - 4+ YearsProfessional & Technical Skills:Must To Have Skills: Proficiency in Site Reliability Engineering.Good To Have Skills: Experience with cloud service providers such as AWS, Azure, or Google Cloud.Strong understanding of CI/CD tools and practices.Experience with container...
-
Site Reliability Engineering
2 weeks ago
Bengaluru, Karnataka, India Booking Holdings Full time ₹ 12,00,000 - ₹ 36,00,000 per yearRole Description:Engineering Manager - Site Reliability - Private CloudOur mission at is to create transformative, innovative, and personalized travel experiences for millions of customers all across the world. We want customers to have an amazing experience wherever and whenever they choose: mobile, web, and through partners and 3rd parties.About the team...
-
Site Reliability Engineer
2 weeks ago
Bengaluru, Karnataka, India FOSS United Full time ₹ 12,00,000 - ₹ 36,00,000 per yearAll JobsSite Reliability Engineer at ZEISS IndiaSite Reliability EngineerApplyPosted on September 11, 2025ZEISS IndiaKadubeesanahalli, BengaluruFull TImeJob DescriptionZEISS in IndiaZEISS in India is headquartered in Bengaluru and present in the fields of Industrial Quality Solutions, Research Microscopy Solutions, Medical Technology, Vision Care and Sports...
-
Lead Site Reliability Engineer
2 days ago
Bengaluru, Karnataka, India Jobs via eFinancialCareers Full time ₹ 8,00,000 - ₹ 24,00,000 per yearSenior Engineer, Site Reliability EngineeringOur TeamWe are evolving our Reliability Engineering team to move beyond support and operations. As a Senior Engineer in Site Reliability, you will be part of a diverse and inclusive organization that has full ownership of the availability, performance, and scalability of one of the most critical shared services at...
-
Site Reliability Engineer
1 week ago
Bengaluru, Karnataka, India, Karnataka HDFC Limited Full timeHiring for Lead / Sr Site Reliability Engineer for Mumbai & Bangalore LocationExperience - 8 - 14 Years Job PurposeAnalysing, troubleshooting, and designing vital services, platforms, and infrastructure on GCP while always thinking about reliability, scalability, resilience, security, and performance. Job Responsibilities: Help build a Site Reliability...
-
Site Reliability Engineer
2 weeks ago
Bengaluru, Karnataka, India WOW Softech Full time ₹ 1,04,000 - ₹ 1,30,878 per yearJob Title: SRE Lead (Engineering & Reliability)Job Summary:We are seeking an experienced and dynamic Site Reliability Engineering (SRE) Lead tooversee the reliability, scalability, and performance of our critical systems. As an SRE Lead,you will play a pivotal role in establishing and implementing SRE practices, leading a teamof engineers, and driving...