Director of Site Reliability Engineering

2 weeks ago

Bengaluru, Karnataka, India Five9 Full time US$ 1,50,000 - US$ 2,00,000 per year

Join us in bringing joy to customer experience. Five9 is a leading provider of cloud contact center software, bringing the power of cloud innovation to customers worldwide.

Living our values everyday results in our team-first culture and enables us to innovate, grow, and thrive while enjoying the journey together. We celebrate diversity and foster an inclusive environment, empowering our employees to be their authentic selves.

The Director of Site Reliability Engineering is responsible for leading the strategic vision, operational excellence, and organizational capability of our SRE function. This role combines technical leadership with people management to build and scale a world-class SRE organization that enables rapid innovation while maintaining exceptional reliability standards.

As the senior leader of the SRE discipline, you will establish the technical strategy, culture, and practices that ensure our systems can scale reliably to meet business demands. You will build and lead a team of SRE professionals, partner with engineering leadership across the organization, and drive the adoption of SRE principles and practices.

This is a hands-on leadership role requiring deep technical expertise, proven ability to scale engineering organizations, and a track record of building reliable systems at scale. The ideal candidate will balance reliability with tactical execution, driving both immediate operational excellence and long-term architectural improvements where necessary.

Key Responsibilities

Strategic Leadership & Vision

Define and execute the long-term SRE strategy aligned with business objectives and technical roadmap
Establish reliability standards, SLI/SLO frameworks, and error budget policies across services
Drive architectural decisions that improve system reliability, scalability, and operational efficiency
Partner with engineering leadership to influence platform and application design for reliability
Represent SRE perspective in executive technical discussions and strategic planning

Team Leadership & Development

Build, lead, and scale a high-performing SRE organization
Recruit, hire, and onboard top-tier SRE talent across multiple experience levels
Develop career progression frameworks and growth paths for SRE professionals
Foster a culture of continuous learning, blameless post-mortems, and operational excellence
Provide technical mentorship and leadership development for senior SRE staff

Operational Excellence & Incident Management

Manage and oversee enterprise-wide incident response processes and on-call practices
Drive root cause analysis programs and ensure systematic elimination of failure modes
Implement sustainable on-call practices that maintain work-life balance while ensuring coverage
Oversee capacity planning and resource optimization strategies across all services
Establish metrics and reporting frameworks for reliability, performance, and operational health

Cross-Functional Partnership

Collaborate with VP/Director level peers in Engineering, Product, and Infrastructure
Work with Security leadership to integrate reliability and security practices
Partner with Finance on cost optimization initiatives and capacity planning budgets
Engage with Customer Success and Support teams on reliability-impacting issues

Platform & Tooling Strategy

Drive the simplification and reduction of observability, monitoring, and alerting platforms
Establish automation standards and drive toil reduction initiatives
Help improve CI/CD pipeline architecture and deployment practices
Influence infrastructure-as-code and configuration management strategies

Organizational & Process Innovation

Implement SRE best practices including error budgets, toil tracking, and reliability reviews
Establish metrics-driven decision making and continuous improvement processes
Drive adoption of chaos engineering and proactive reliability testing
Create and maintain SRE documentation, runbooks, and knowledge sharing systems
Develop and execute disaster recovery and business continuity plans

Required Skills
Leadership & Management Experience

Bachelor's or Master's degree in Computer Science, Engineering, or equivalent experience
8+ years in engineering leadership roles, with 4+ years managing managers
Proven track record of building and scaling engineering teams
Experience with performance management, career development, and succession planning
Strong executive presence and ability to influence without authority
Experience driving organizational change and cultural transformation

Technical Expertise

Experience with multiple cloud platforms (AWS, GCP, Azure) and hybrid environments
Deep understanding of distributed systems, microservices architecture, and cloud platforms
Hands-on experience with modern observability tools (Prometheus, Grafana, Datadog, etc.)
Strong background in infrastructure automation, CI/CD, and infrastructure-as-code
Expertise in capacity planning, performance optimization, and cost management

SRE & Operations Mastery

Deep understanding of SRE principles, practices, and implementation at scale
Experience establishing SLI/SLO frameworks and error budget management
Proven track record of improving system reliability and reducing operational toil
Experience with incident management, post-mortem processes, and reliability engineering
Background in 24/7 operations and on-call management best practices

Business & Strategic Acumen

Understanding of budget management, resource allocation, and ROI analysis
Ability to communicate technical concepts to non-technical stakeholders and executives
Experience with vendor management and technology partnership decisions
Knowledge of compliance frameworks and regulatory requirements

Desired Skills
Advanced Technical Background

Background in container orchestration (Kubernetes) and service mesh technologies
Knowledge of database administration and data platform reliability
Experience with security engineering and DevSecOps practices

Success Metrics

Reliability & Performance

Achieve and maintain service availability targets (typically 99.9%+ uptime)
Reduce mean time to detection (MTTD) and mean time to recovery (MTTR)
Improve capacity planning accuracy and reduce over-provisioning costs
Increase deployment frequency while maintaining reliability standards

Team & Organizational Development

Build and retain a high-performing SRE organization with low attrition
Establish clear career progression and achieve high employee satisfaction scores
Develop internal talent and promote from within the SRE organization
Create sustainable on-call practices with reasonable operational load

Operational Excellence

Drive measurable reduction in operational toil and manual interventions
Establish comprehensive observability and proactive alerting across all services
Implement effective incident response with blameless post-mortem culture
Achieve cost optimization targets while maintaining reliability standards

Five9 embraces diversity and is committed to building a team that represents a variety of backgrounds, perspectives, and skills.  The more inclusive we are, the better we are.  Five9 is an equal opportunity employer.

View our privacy policy, including our privacy notice to California residents here:

Note: Five9 will never request that an applicant send money as a prerequisite for commencing employment with Five9.

Director of Site Reliability Engineering

2 weeks ago

Bengaluru, Karnataka, India Five9 Full time US$ 1,50,000 - US$ 2,00,000 per year

Join us in bringing joy to customer experience. Five9 is a leading provider of cloud contact center software, bringing the power of cloud innovation to customers worldwide. Living our values everyday results in our team-first culture and enables us to innovate, grow, and thrive while enjoying the journey together. We celebrate diversity and foster an...
Director - Site Reliability Engineering

2 weeks ago

Bengaluru, Karnataka, India Five9 Full time

Join us in bringing joy to customer experience. Five9 is a leading provider of cloud contact center software, bringing the power of cloud innovation to customers worldwide. Living our values everyday results in our team-first culture and enables us to innovate, grow, and thrive while enjoying the journey together. We celebrate diversity and foster an...
Site Reliability Engineering Leadership Position

2 weeks ago

Bengaluru, Karnataka, India beBeeReliability Full time US$ 1,80,000 - US$ 2,16,000

Leadership Role in Site Reliability EngineeringThe Director of Site Reliability Engineering plays a pivotal role in driving the strategic vision and operational excellence for our SRE function.This key leadership position combines technical expertise with people management to build and scale a world-class SRE organization that enables rapid innovation while...
Senior Site Reliability Engineer

1 week ago

Bengaluru, Karnataka, India CloudHire Full time

Job SummaryThe Technical Manager for Site Reliability Engineering (SRE) will lead a remote team of Site Reliability Engineers, ensuring operational excellence and fostering a high-performing team culture. Reporting to the US-based Director of Systems and Security, this role is responsible for overseeing day-to-day operations, technical mentorship, and...
Site Reliability Engineer

1 week ago

Bengaluru, Karnataka, India Programming Full time ₹ 1,04,000 - ₹ 1,30,878 per year

Role - Site Reliability Engineering.Location - BengaluruYears of Expereince - 4+ YearsProfessional & Technical Skills:Must To Have Skills: Proficiency in Site Reliability Engineering.Good To Have Skills: Experience with cloud service providers such as AWS, Azure, or Google Cloud.Strong understanding of CI/CD tools and practices.Experience with container...
Principal Site Reliability Engineer

2 days ago

Bengaluru, Karnataka, India Commonwealth Bank of Australia Full time ₹ 12,00,000 - ₹ 36,00,000 per year

Job Advert TextOrganization: At CommBank, we never lose sight of the role we play in other people's financial wellbeing. Our focus is to help people and businesses move forward to progress. To make the right financial decisions and achieve their dreams, targets, and aspirations. Regardless of where you work within our organisation, your initiative, talent,...
Site Reliability Engineering Director

6 days ago

Bengaluru, Karnataka, India beBeeEngineering Full time ₹ 1,50,00,000 - ₹ 2,00,00,000

Job OpportunityWe are seeking a strategic and technically proficient leader to oversee the design, implementation, and scaling of our reliability, observability, and operational practices.As this individual will play a critical role in ensuring our systems are highly available, scalable, and performant while maintaining a strong culture of reliability and...
Site Reliability Engineer

3 weeks ago

Bengaluru, Karnataka, India Enterprise Minds, Inc Full time

We're Hiring | Site Reliability Engineer | 8-10 years
Site Reliability Engineer

1 week ago

Bengaluru, Karnataka, India FOSS United Full time ₹ 1,04,000 - ₹ 1,30,878 per year

All JobsSite Reliability Engineer at ZEISS IndiaSite Reliability EngineerApplyPosted on September 11, 2025ZEISS IndiaKadubeesanahalli, BengaluruFull TImeJob DescriptionZEISS in IndiaZEISS in India is headquartered in Bengaluru and present in the fields of Industrial Quality Solutions, Research Microscopy Solutions, Medical Technology, Vision Care and Sports...
Site Reliability Engineer

2 weeks ago

Bengaluru, Karnataka, India TRUGlobal Full time ₹ 9,00,000 - ₹ 12,00,000 per year

Job Title: Site Reliability Engineer (SRE) with Python Development ExpertisePosition Overview: We are seeking a skilled Site Reliability Engineer (SRE) with strong Python development experience to join our team. The ideal candidate will be responsible for ensuring the reliability, availability, and performance of our services across both on-premises and...

Americas

Europe

Asia / Oceania

Africa

Director of Site Reliability Engineering