
Site Reliability Leader
1 day ago
Location Hyderabad
Employment Type Full-Time
Work Model 3 Days from office (Hybrid)
About the Role:
The SRE Manager will lead the reliability engineering function ensuring infrastructure resiliency and optimal operational performance. This hybrid role blends technical leadership with team mentorship and cross-functional coordination.
Key Responsibilities:
- Establish and lead the implementation of organizational reliability strategies aligning SLAs, SLOs, and Error Budgets with business goals and customer expectations.
- Develop and institutionalize incident response frameworks including escalation policies, on-call scheduling, service ownership mapping, and RCA process governance.
- Lead technical reviews for infrastructure reliability design, high-availability architectures, and resiliency patterns across distributed cloud services.
- Champion observability and monitoring culture by standardizing tooling, alert definitions, dashboard templates, and telemetry data schemas across all product teams.
- Drive continuous improvement through operational maturity assessments, toil elimination initiatives, and SRE OKRs aligned with product objectives.
- Collaborate with cloud engineering and platform teams to introduce self-healing systems, capacity-aware autoscaling, and latency-optimized service mesh patterns.
- Act as the principal escalation point for reliability-related concerns and ensure incident retrospectives lead to measurable improvements in uptime and MTTR.
- Owning runbook standardization, capacity planning, failure mode analysis, and production readiness reviews for new feature launches.
- Mentor and develop a high-performing SRE team fostering a proactive ownership culture, encouraging cross-functional knowledge sharing, and establishing technical career pathways.
- Collaborate with leadership, delivery, and customer stakeholders to define reliability goals, track performance, and demonstrate ROI on SRE investments.
Requirements:
- 10+ years total experience with 3+ years in a leadership role in SRE or Cloud Operations.
- Deep understanding of Kubernetes, GKE, Prometheus, Terraform, Cloud, Advanced GCP administration, CI/CD, Jenkins, Argo CD, GitHub Actions, Incident Management, Full lifecycle tools like OpsGenie.
- Nice to Have: Knowledge of service mesh and observability stacks, Strong scripting skills, Python, Bash, Big Query, Dataflow exposure for telemetry.
Benefits:
This is a great opportunity to work with a leading organization in the industry. We offer a competitive salary, comprehensive benefits package, and opportunities for growth and development.
-
Site Reliability Leader
7 days ago
Hyderabad, Telangana, India beBeeReliability Full time ₹ 1,50,00,000 - ₹ 2,50,00,000Job Title: Lead Site Reliability Engineer**About the Role**: This role focuses on ensuring platform and application availability, scalability, and reliability.Key Responsibilities:Build, monitor, and maintain highly scalable deployments.Install new releases and environments for applications.Proactively monitor systems and applications, develop monitoring...
-
Site Reliability Expert
6 days ago
Hyderabad, Telangana, India beBeeResponsibilities Full time ₹ 1,50,00,000 - ₹ 2,00,00,000Job Title:Achieving System Excellence About the Role:We are seeking a skilled Site Reliability Engineer to join our team. The ideal candidate will have 5+ years of experience in DevOps and Site Reliability Engineering, with a strong focus on ensuring smooth system operations. Key Responsibilities:Design, implement, and maintain scalable systems using...
-
Site Reliability Engineering
1 week ago
Hyderabad, Telangana, India Acesoft Labs Full time ₹ 1,04,000 - ₹ 1,30,878 per yearHi ,Kindly find the below JD :Job Title: Site Reliability Engineering (SRE) ManagerLocation: HyderabadEmployment Type: Full-TimeWork Model - 3 Days from office (Hybrid)Summary:The SRE Manager at TechBlocks India will lead the reliability engineering function, ensuring infrastructure resiliency and optimal operational performance. This hybrid role blends...
-
Site Reliability Engineer
1 week ago
Hyderabad, Telangana, India Talent Worx Full time US$ 1,20,000 - US$ 2,00,000 per yearTalent Worx is seeking a talented SRE (Site Reliability Engineer) to enhance our technology team. In this role, you will be pivotal in ensuring the reliability, performance, and availability of our applications and services.Your work will involve both software engineering and systems operations as you strive to improve customer experiences and operational...
-
Senior Site Reliability Engineer
7 days ago
Hyderabad, Telangana, India CloudHire Full time ₹ 7,00,000 - ₹ 12,00,000 per yearJob SummaryThe Technical Manager for Site Reliability Engineering (SRE) will lead a remote team of Site Reliability Engineers, ensuring operational excellence and fostering a high-performing team culture. Reporting to the US-based Director of Systems and Security, this role is responsible for overseeing day-to-day operations, technical mentorship, and...
-
Site Reliability Manager
6 days ago
Hyderabad, Telangana, India TechBlocks Full time ₹ 7,00,000 - ₹ 12,00,000 per yearAbout TechBlocks:TechBlocks is a global digital product engineering company with 16+ years of experience helping Fortune 500 enterprises and high-growth brands accelerate innovation, modernize technology, and drive digital transformation. From cloud solutions and data engineering to experience design and platform modernization, we help businesses solve...
-
Manager - Site Reliability
1 day ago
Hyderabad, Telangana, India ZORTECH SOLUTIONS PRIVATE LIMITED Full timeJob Title : Site Reliability Engineering (SRE) ManagerLocation : HyderabadEmployment Type : Full-TimeWork Model : 3 Days from office (Hybrid)Summary :The SRE Manager will lead the reliability engineering function, ensuring infrastructure resiliency and optimal operational performance. This hybrid role blends technical leadership with team mentorship and...
-
Site Reliability Engineer
6 days ago
Hyderabad, Telangana, India Jigya Software Services Full time ₹ 1,50,000 - ₹ 28,00,000 per yearJob Title:Senior Site Reliability Engineer (SRE) - AWS/KubernetesLocation:Hyderabad - OnsiteJob Type:Full-TimeAbout the Role:We are looking for a highly skilled and motivated Site Reliability Engineer to design, build, and maintain our high-performance, scalable cloud infrastructure. You will play a critical role in ensuring the reliability, performance, and...
-
Senior Site Reliability Engineer
2 weeks ago
Hyderabad, Telangana, India Microsoft Full time ₹ 9,00,000 - ₹ 12,00,000 per yearThe Windows Cloud division is looking for a Senior Site Reliability Engineer that will help us take the Windows Cloud platform, as well as the Windows 365 Cloud PC and Azure Virtual Desktop business to the next level.Windows 365 Cloud PC (W365) and Azure Virtual Desktop (AVD) have recently been recognized as leaders in the Gartner Magic Quadrant for Desktop...
-
Senior Site Reliability Expert
3 days ago
Hyderabad, Telangana, India beBeeSite Full time ₹ 2,24,00,000 - ₹ 3,51,20,000About Our Senior Site Reliability ExpertThe role of a senior site reliability expert is pivotal in ensuring the stability, scalability, and operational excellence of accounting and finance systems.Key ResponsibilitiesOperational Oversight: As a senior site reliability expert, you will be responsible for overseeing day-to-day operations for accounting and...