Observability Lead

2 days ago


Hyderabad, Telangana, India beBeeData Full time ₹ 2,00,00,000 - ₹ 3,00,00,000
Job Description

As a key member of our team, you will be responsible for establishing and advancing observability practices across our Enterprise Data & Analytics Platforms landscape.

With a deep understanding of monitoring tools and SRE principles, you will enable proactive issue detection, faster incident resolution, and continuous improvement of platform performance and reliability.

">Key Responsibilities:
  • Design and implement comprehensive observability frameworks to monitor performance, reliability, and availability of EDAA data platforms and services.
  • Define and track key SLIs, SLOs, and KPIs across critical platform components and data pipelines.
  • Lead the integration of monitoring, logging, tracing, and alerting tools to enable real-time insights and root cause analysis.
  • Collaborate with platform engineering, SRE, and product teams to enhance observability coverage and automate incident responses.
  • Drive the adoption of best practices in telemetry collection, dashboards, and visualization for operational excellence.
  • Oversee incident management processes and post-mortem practices to ensure continuous reliability improvements.
  • Provide leadership in tool evaluation and deployment across observability and performance management platforms.
  • Partner with security, compliance, and data governance teams to ensure visibility into data usage, lineage, and policy adherence.
  • Lead operational reviews and reporting to highlight system health, risks, and opportunities for optimization.

Required Skills & Qualifications

You will need 8+ years of experience in platform engineering, site reliability engineering (SRE), DevOps, or observability roles. You should have proficient knowledge of ETL Pipelines and strong expertise with observability tools and platforms. Additionally, you should have a deep understanding of monitoring distributed data platforms, data pipelines, and cloud-native architectures.

Benefits

This is a full-time role working in a hybrid environment. The ideal candidate will have a proven ability to lead cross-functional collaboration and align technical work with business outcomes. A Bachelor's degree in computer science, Engineering, or related field is required; an advanced degree or certifications are preferred.

Others

McDonald's is committed to providing qualified individuals with disabilities with reasonable accommodations to perform the essential functions of their jobs. McDonald's provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to sex, race, color, religion, ancestry or national origin, age, disability status, medical condition, marital status, sexual orientation, gender, gender identity, gender expression, transgender status, protected military or veteran status, citizenship status, genetic information, or any other characteristic protected by federal, state or local laws.



  • Hyderabad, Telangana, India beBeeObservability Full time ₹ 25,00,000 - ₹ 40,00,000

    We are seeking a highly skilled professional to lead the establishment and advancement of observability, monitoring, and reliability practices across the Enterprise Data & Analytics (EDAA) Platforms landscape.This role will ensure end-to-end visibility into platform performance, data pipeline health, system availability, and operational Service Level...


  • Hyderabad, Telangana, India algoleap Full time ₹ 1,04,000 - ₹ 1,30,878 per year

    Role: Observability EngineerJob Description:Senior Platform EngineerWe are seeking a highly experienced and driven Senior Observability Engineer to lead the design, development, and maintenance of observability solutions across our infrastructure, applications, and services. As a Senior Observability Engineer, you will be at the forefront of implementing...


  • Hyderabad, Telangana, India Algoleap Technologies Full time ₹ 5,00,000 - ₹ 8,00,000 per year

    SUMMARY Role: Observability EngineerJob Description:Senior Platform EngineerWe are seeking a highly experienced and driven Senior Observability Engineer to lead the design, development, and maintenance of observability solutions across our infrastructure, applications, and services. As a Senior Observability Engineer, you will be at the forefront of...


  • Hyderabad, Telangana, India beBeeObservability Full time ₹ 1,50,00,000 - ₹ 2,50,00,000

    Job TitleObservability EngineerKey ResponsibilitiesWe are seeking a seasoned Observability professional with at least 8+ years of experience in IT Infrastructure and Observability, Monitoring, or SRE roles.A strong background in Kubernetes and containerized environments is essential for this role.Expertise in monitoring tools such as Prometheus, Grafana,...


  • Hyderabad, Telangana, India beBeeData Full time ₹ 20,00,000 - ₹ 25,00,000

    Job Overview">We are seeking a skilled professional to lead our Data Platform Observability initiative.">This role requires a strong understanding of monitoring tools and site reliability engineering (SRE) principles.">The successful candidate will be responsible for establishing and advancing observability practices across the Enterprise Data & Analytics...


  • Hyderabad, Telangana, India beBeeData Full time ₹ 2,00,00,000 - ₹ 2,50,00,000

    Job Title: Data Platform Observability LeadJob Description:The role of a Data Platform Observability Lead involves establishing and advancing observability practices across the Enterprise Data & Analytics landscape. The successful candidate will play a crucial part in designing, implementing, and monitoring comprehensive observability frameworks to monitor...

  • Observability/AlOps

    1 week ago


    Hyderabad, Telangana, India IntraEdge Full time

    L2- Observability/AIOps (5 to 8 yrs exp). Site Reliability Engineering (SRE) is an engineering discipline that combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures internally critical and externally visible systems have reliability and uptime appropriate to users' needs and a fast...

  • Observability/AlOps

    1 week ago


    Hyderabad, Telangana, India IntraEdge Full time

    L2- Observability/AIOps (5 to 8 yrs exp).Site Reliability Engineering (SRE) is an engineering discipline that combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures internally critical and externally visible systems have reliability and uptime appropriate to users' needs and a fast...

  • Observability/AlOps

    6 days ago


    Hyderabad, Telangana, India IntraEdge Full time

    L2- Observability/AIOps (5 to 8 yrs exp).Site Reliability Engineering (SRE) is an engineering discipline that combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures internally critical and externally visible systems have reliability and uptime appropriate to users' needs and a fast...

  • Observability/AlOps

    2 weeks ago


    Hyderabad, Telangana, India IntraEdge Full time

    L2- Observability/AIOps (5 to 8 yrs exp).Site Reliability Engineering (SRE) is an engineering discipline that combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures internally critical and externally visible systems have reliability and uptime appropriate to users' needs and a fast...