SRE Manager
16 hours ago
Company Description
Entain India is the engineering and delivery powerhouse for Entain, one of the world's leading global sports and gaming groups. Established in Hyderabad in 2001, we've grown from a small tech hub into a dynamic force, delivering cutting-edge software solutions and support services that power billions of transactions for millions of users worldwide.
Our focus on quality at scale drives us to create innovative technology that supports Entain's mission to lead the change in global sports and gaming sector. At Entain India, we make the impossible possible, together.
Job Description
We are seeking a talented and motivated SRE Manager to join our dynamic team. In this role, you will execute a range of site reliability activities, ensuring optimal service performance, reliability, and availability. You will collaborate with cross-functional engineering teams to develop scalable, fault-tolerant, and cost-effective cloud services.
If you are passionate about site reliability engineering and ready to make a significant impact, we would love to hear from you
Key Responsibilities:
A Site Reliability Engineering (SRE) Team Manager plays a crucial role in ensuring system reliability, scalability, and operational efficiency. Their responsibilities typically include:
- Leading & Mentoring – Guiding a team of SREs, fostering a culture of automation, resilience, and continuous improvement.
- Defining SLOs & SLIs – Establishing service level objectives (SLOs) and indicators (SLIs) to measure and maintain system performance.
- Incident Management – Overseeing incident response, conducting post-mortems, and implementing preventive measures.
- Collaboration with Engineering Teams – Working closely with developers to build scalable and resilient systems.
- Automation & Efficiency – Driving automation initiatives to reduce toil and enhance operational workflows.
- Risk & Compliance Management – Ensuring adherence to security, compliance, and reliability standards.
Optimizing Observability – Implementing monitoring tools and strategies to proactively detect and resolve issues.
implement automation tools, frameworks, and CI/CD pipelines, promoting best practices and code reusability.
- Enhance site reliability through process automation, reducing mean time to detection, resolution, and repair.
- Identify and manage risks through regular assessments and proactive mitigation strategies.
- Develop and troubleshoot large-scale distributed systems in both on-prem and cloud environments.
- Deliver infrastructure as code to improve service availability, scalability, latency, and efficiency.
- Monitor support processing for early detection of issues and share knowledge on emerging site reliability trends.
- Analyze data to identify improvement areas and optimize system performance through scale testing.
Qualifications
For Site Reliability Engineering (SRE), key skills and tools are essential for maintaining system reliability, scalability, and efficiency. Given your expertise in observability, compliance, and platform stability, here's a structured breakdown:
Key SRE Skills
- Infrastructure as Code (IaC) – Automating provisioning with Terraform, Ansible, or Kubernetes.
- Observability & Monitoring – Implementing distributed tracing, logging, and metrics for proactive issue detection.
- Security & Compliance – Ensuring privileged access controls, audit logging, and encryption.
- Incident Management & MTTR Optimization – Reducing downtime with automated recovery mechanisms.
- Performance Engineering – Optimizing API latency, P99 response times, and resource utilization.
- Dependency Management – Ensuring resilience in microservices with circuit breakers and retries.
- CI/CD & Release Engineering – Automating deployments while maintaining rollback strategies.
- Capacity Planning & Scalability – Forecasting traffic patterns and optimizing resource allocation.
- Chaos Engineering – Validating system robustness through fault injection testing.
- Cross-Team Collaboration – Aligning SRE practices with DevOps, security, and compliance teams.
Essential SRE Tools
- Monitoring & Observability: Datadog, Prometheus, Grafana, New Relic.
- Incident Response: PagerDuty, OpsGenie.
- Configuration & Automation: Terraform, Ansible, Puppet.
- CI/CD Pipelines: Jenkins, GitHub Actions, ArgoCD.
- Logging & Tracing: ELK Stack, OpenTelemetry, Jaeger.
- Security & Compliance: Vault, AWS IAM, Snyk.
Additional Information
We know that signing top players requires a great starting package, and plenty of support to inspire peak performance. Join us, and a competitive salary is just the beginning. Working for us in India, you can expect to receive great benefits like:
- Safe home pickup and home drop (Hyderabad Office Only)
- Group Mediclaim policy
- Group Critical Illness policy
- Communication & Relocation allowance
- Annual Health check
And outside of this, you'll have the chance to turn recognition from leaders and colleagues into amazing prizes. Join a winning team of talented people and be a part of an inclusive and supporting community where everyone is celebrated for being themselves.
At Entain India, we do what's right. It's one of our core values and that's why we're taking the lead when it comes to creating a diverse, equitable and inclusive future - for our people, and the wider global sports betting and gaming sector. However you identify, across any protected characteristic, our ambition is to ensure our people across the globe feel valued, respected and their individuality celebrated.
We comply with all applicable recruitment regulations and employment laws in the jurisdictions where we operate, ensuring ethical and compliant hiring practices globally.
Should you need any adjustments or accommodations to the recruitment process, at either application or interview, please contact us.
-
Manager SRE
9 hours ago
Hyderabad, Telangana, India PepsiCo Full timeJob DescriptionOverviewManager SRE for the Cloud automation and SRE analystResponsibilities- Candidate must have experience of 7-9 Years- Engineer should be having hands on experience on development.- Either Ansible and Terraform experience is required.- Python, powershell experience is preferred.- Engineer should develop automation scripts for the Cloud...
-
Manager SRE
2 days ago
Hyderabad, Telangana, India PepsiCo Full time ₹ 1,04,000 - ₹ 1,30,878 per yearOverviewManager SRE for the Cloud automation and SRE analystResponsibilitiesCandidate must have experience of 7-9 YearsEngineer should be having hands on experience on development.Either Ansible and Terraform experience is required.Python, powershell experience is preferred.Engineer should develop automation scripts for the Cloud team.Maintain existing code...
-
Sre Lead
1 day ago
Hyderabad, Telangana, India People Prime Worldwide Full timeAbout Client One of our MNC clients offers technology consulting and digital solutions to global enterprises across industries enabling transformative scale at unparalleled speed With 145 000 professionals across 90 countries helping 1100 clients it provides a full spectrum of services including consulting information technology enterprise...
-
Associate Manager SRE
5 days ago
Hyderabad, Telangana, India PepsiCo Full timeJob DescriptionOverviewWe are seeking a self-driven, inquisitive, and curious Site Reliability Engineer (SRE) to drive reliability, availability, performance, and security across our global digital product ecosystem. This role is central to ensuring a seamless and resilient experience for our users by blending deep engineering expertise with operational...
-
Associate Manager SRE
7 days ago
Hyderabad, Telangana, India Pepsico Full timeOverviewWe are seeking a self-driven, inquisitive, and curious Site Reliability Engineer (SRE) to drive reliability, availability, performance, and security across our global digital product ecosystem. This role is central to ensuring a seamless and resilient experience for our users by blending deep engineering expertise with operational excellence and...
-
sre
2 days ago
Hyderabad, Telangana, India TechVedika Full time US$ 90,000 - US$ 1,20,000 per yearCompany DescriptionTechVedika is a technology services company specializing in AI/ML, Product Engineering, and Cloud-based solutions. Since our founding in 2010, we have been committed to providing innovative technology solutions to enterprise clients across various industries, including Manufacturing, BFSI, Healthcare, IT, Supply Chain & Logistics, Retail,...
-
Associate Manager SRE
2 days ago
Hyderabad, Telangana, India Pepsico Full time ₹ 1,04,000 - ₹ 1,30,878 per yearOverview We are seeking a self-driven, inquisitive, and curious Site Reliability Engineer (SRE) to drive reliability, availability, performance, and security across our global digital product ecosystem. This role is central to ensuring a seamless and resilient experience for our users by blending deep engineering expertise with operational excellence and...
-
Associate Manager SRE
14 hours ago
Hyderabad, Telangana, India PepsiCo Full time US$ 90,000 - US$ 1,20,000 per yearOverviewWe are seeking a self-driven, inquisitive, and curious Site Reliability Engineer (SRE) to drive reliability, availability, performance, and security across our global digital product ecosystem. This role is central to ensuring a seamless and resilient experience for our users by blending deep engineering expertise with operational excellence and...
-
Azure SRE Manager
5 days ago
Hyderabad, Telangana, India beBeeCloud Full time ₹ 1,80,00,000 - ₹ 2,50,00,000Job PostingPosition OverviewWe seek a skilled and experienced leader to helm our Azure-focused SRE team. The ideal candidate will possess technical expertise in Azure cloud services, coupled with strong leadership abilities to ensure the reliability, scalability, and performance of our applications and infrastructure.As a manager, you will oversee a team of...
-
High Salary: Manager Sre
9 hours ago
Hyderabad, Telangana, India PepsiCo Full timeOverview Manager SRE for the Cloud automation and SRE analyst Responsibilities Candidate must have experience of 7-9 Years Engineer should be having hands on experience on development Either Ansible and Terraform experience is required Python powershell experience is preferred Engineer should develop automation scripts for the Cloud team Maintain...