
Site Reliability Leader
20 hours ago
Job Description:
We are seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key member of our engineering organization, you will be responsible for driving the reliability and scalability of cloud-based systems, identifying and implementing improvements for operational efficiency, and proactively monitoring our infrastructure.
Key Responsibilities:
- Drive the reliability and scalability of cloud-based systems while identifying and implementing improvements for operational efficiency and proactive monitoring.
- Automation and Tool Development: Continuously seek opportunities to automate workflows, develop self-sustainable tools, and improve operational efficiency.
- Incident Management: Facilitate partner inquiries and production incidents, ensuring compliance with internal SLAs. Responsibilities include responding to, investigating, and mitigating customer impact.
- Partner with the Global Partner Integrations (GPI), consumer engineering teams, and PMO to support product launches and other initiatives.
- Troubleshoot a production issue by reviewing source code, logs, operational metrics, stack trace, etc. to pinpoint a specific problem and then resolve it. You identify root causes and identify learnings to improve both operational processes.
- Be a result-driven creative thinker who drives innovation and produces delightful experiences for our customers.
- Demonstrate data-driven open-minded decision making, have an insatiable curiosity, love to invent and innovate to solve difficult challenges.
- Takes ownership of their work and consistently delivers results in a fast-paced environment.
- Actively support hyper-care and watch party events, providing real-time operational metrics and insights.
- Perform health checks on critical applications and services, ensuring uptime and availability.
- Write complex queries and scripts, analyze datasets, and pinpoint issues efficiently.
- Effectively communicate with global partners and stakeholders.
Requirements:
- Monitoring & Alerting: Experience implementing alerting, metrics, and logging using tools like Prometheus, CloudWatch, Elastic, and PagerDuty.
- Direct experience with at least one cloud provider (AWS, GCP, Azure, or other).
- Strong expertise in SQL hands-on experience working with databases.
- Experience building dashboards using tools like Databricks and Grafana.
- Familiarity with OAuth 2.0 authentication framework.
- Experience with tools such as PagerDuty and ServiceNow is a plus.
- Ability to work flexible shifts to provide global operational coverage and collaborate effectively with remote peers across disparate geographies and time zones.
What We Offer:
Warner Bros. Discovery offers a range of benefits including competitive salaries, comprehensive health insurance, retirement plans, paid time off, and more. We also offer opportunities for career growth and professional development, as well as a dynamic and inclusive work environment that values diversity, equity, and inclusion.
How to Apply:
To apply for this role, please submit your resume and cover letter through our website. We look forward to hearing from you
-
Site Reliability Engineering Leader
2 days ago
Hyderabad / Secunderabad, Telangana, India beBeeEngineering Full time ₹ 1,04,000 - ₹ 1,30,878Role OverviewWe are seeking a highly skilled Site Reliability Engineering (SRE) Lead to drive the reliability, scalability, and performance of our services. This role requires a strong background in technical expertise, leadership skills, and passion for operational excellence.Key ResponsibilitiesLead and mentor a team of SREs to ensure high availability and...
-
Reliability Engineering Leader
4 days ago
Hyderabad / Secunderabad, Telangana, India beBeeSite Full timeJob DescriptionAs a Senior Site Reliability Engineer, you will be responsible for ensuring the smooth operation of our production support services environments. This involves automating systems and applications to ensure 24x7 uptime, as well as implementing scripting languages such as Python.Required Skills and QualificationsFluency in scripting languages...
-
Site Reliability Leader
5 days ago
Chennai, Bengaluru / Bangalore, Hyderabad / Secunderabad, Telangana, India beBeeSRE Full time ₹ 1,04,000 - ₹ 1,30,878Job Title: Senior Site Reliability EngineerOverview:The Software Engineering team focuses on delivering application enhancements and new products to meet the needs of a changing world. We work at the forefront, designing and developing software for platforms, peripherals, applications, and diagnostics using the latest technologies, tools, and...
-
Site Reliability Engineer
5 days ago
Hyderabad / Secunderabad, Telangana, India beBeeReliability Full time ₹ 1,04,000 - ₹ 1,30,878Cloud Reliability EngineerWe are seeking a skilled Cloud Reliability Engineer to join our team. In this role, you will be responsible for implementing and driving Site Reliability Engineering (SRE) discipline in the project.Key Responsibilities:Implement and drive SRE discipline in the project.Evaluate emerging SRE tools and stay updated on the...
-
Site Infrastructure Reliability Engineer
1 week ago
Hyderabad / Secunderabad, Telangana, India beBeeInfrastructure Full time US$ 90,000 - US$ 1,20,000Job Overview">This role focuses on ensuring the reliability and performance of our site infrastructure.">Key Responsibilities">Maintaining and enhancing the scalability, availability, and security of our site infrastructure.Developing service level indicators and objectives to ensure high-quality services.Collaborating with cross-functional teams to...
-
Site Reliability Leader
10 hours ago
Hyderabad, Telangana, India beBeeReliability Full time ₹ 65 - ₹ 85Job Title : Site Reliability Engineering ManagerLocation HyderabadEmployment Type Full-TimeWork Model 3 Days from office (Hybrid)About the Role:The SRE Manager will lead the reliability engineering function ensuring infrastructure resiliency and optimal operational performance. This hybrid role blends technical leadership with team mentorship and...
-
Site Reliability Leader
6 days ago
Hyderabad, Telangana, India beBeeReliability Full time ₹ 1,50,00,000 - ₹ 2,50,00,000Job Title: Lead Site Reliability Engineer**About the Role**: This role focuses on ensuring platform and application availability, scalability, and reliability.Key Responsibilities:Build, monitor, and maintain highly scalable deployments.Install new releases and environments for applications.Proactively monitor systems and applications, develop monitoring...
-
Azure Cloud Site Reliability Engineer
1 week ago
Hyderabad / Secunderabad, Telangana, India beBeeCloud Full time US$ 1,04,000 - US$ 1,30,878Job DescriptionWe are seeking a highly skilled Azure Cloud Site Reliability Engineer (SRE) to join our organization. The ideal candidate will have a strong background in cloud infrastructure, automation, and operational excellence, with a focus on ensuring the reliability, scalability, and performance of our Azure cloud environments.The successful candidate...
-
Senior Site Reliability Specialist
4 days ago
Hyderabad / Secunderabad, Telangana, Chennai, Bengaluru / Bangalore, India beBeeReliability Full time ₹ 1,04,000 - ₹ 1,30,878About This RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our team. As a critical member of our reliability team, you will be responsible for developing sophisticated systems and software that meet the needs of our customers.
-
Site Reliability Expert
5 days ago
Hyderabad, Telangana, India beBeeResponsibilities Full time ₹ 1,50,00,000 - ₹ 2,00,00,000Job Title:Achieving System Excellence About the Role:We are seeking a skilled Site Reliability Engineer to join our team. The ideal candidate will have 5+ years of experience in DevOps and Site Reliability Engineering, with a strong focus on ensuring smooth system operations. Key Responsibilities:Design, implement, and maintain scalable systems using...