Observability Engineer

3 days ago


Hyderabad, Telangana, India Mindlance Full time ₹ 12,00,000 - ₹ 36,00,000 per year

Observability Engineer

Location:
Hyderabad

Job Summary:

We are seeking a highly skilled and motivated
Grafana Dashboard Specialist
with strong expertise in DevOps automation to join our team. The ideal candidate will be responsible for designing, developing, and maintaining advanced Grafana dashboards that provide actionable insights into system performance, application metrics, and business KPIs. This role also requires deep expertise in automation, including CI/CD pipelines, Infrastructure as Code (IaC), and cloud-native operations for Grafana.

Key Responsibilities:

Grafana & Observability:

  • Design and implement visually compelling and data-rich Grafana dashboards for Observability.
  • Integrate Grafana Cloud with data sources such as Prometheus, Loki, ServiceNow, PagerDuty, Snowflake, and AWS.
  • Integrate telemetry data sources such as Tomcat, Liberty, Ping, Linux, Windows, and databases (Oracle, Postgres) via REST APIs.
  • Create alerting mechanisms for SLA breaches, latency spikes, and transaction anomalies.
  • Develop custom panels and alerts to monitor infrastructure, applications, and business metrics.
  • Collaborate with stakeholders to define KPIs and visualization needs.
  • Optimize dashboard performance and usability across teams.
  • Implement and manage OpenTelemetry instrumentation across services to collect distributed traces, metrics, and logs.
  • Integrate OpenTelemetry data pipelines with Grafana and other observability platforms.
  • Develop and maintain OpenTelemetry collectors and exporters for various environments.
  • Ensure monitoring solutions support high availability and performance.

DevOps & Automation:

  • Architect, design, and maintain CI/CD pipelines using tools such as Jenkins, Bitbucket, and Nexus.
  • Implement Infrastructure as Code (IaC) using Terraform and Ansible.
  • Automate deployment, scaling, and monitoring of both cloud-native and on-premises environments.
  • Ensure system reliability, scalability, and security through automated processes.
  • Collaborate with development and operations teams to streamline workflows and reduce manual intervention.

Subject Matter Expert (SME) Responsibilities:

  • Act as a technical advisor on automation and observability best practices.
  • Lead initiatives to improve system performance, reliability, and developer productivity.
  • Conduct training sessions and create documentation for internal teams.
  • Stay current with industry trends and emerging technologies in DevOps and observability.
  • Guide the adoption of OpenTelemetry standards and practices across engineering teams.
  • Optimize monitoring processes and tools for greater efficiency and effectiveness.

Required Qualifications:

  • Bachelor's degree in Computer Science, Engineering, or related field (or equivalent experience).
  • 5+ years of experience in DevOps, SRE, or infrastructure automation roles.
  • 3+ years of hands-on experience with Grafana and dashboard development.
  • Strong proficiency in scripting languages (Python, Bash, Go).
  • Experience with monitoring tools (Grafana Cloud, Prometheus, Loki, Dynatrace, Splunk, etc.).
  • Strong knowledge of CI/CD and cloud platforms (AWS and Azure).
  • Expertise in Kubernetes, Docker, and container orchestration.
  • Familiarity with security and compliance in automated environments.
  • Hands-on experience with OpenTelemetry instrumentation and data collection.

Preferred Qualifications:

  • Grafana certification or equivalent experience.
  • Experience with custom Grafana plugins or panel development.
  • Knowledge of business intelligence tools and data visualization principles.
  • Contributions to open-source DevOps or observability projects.
  • Strong communication and stakeholder management skills.
  • Experience with OpenTelemetry Collector configuration and integration.
  • Familiarity with distributed tracing concepts.

If you are passionate about
Observability, Grafana, and Automation
, and want to make an impact by building cutting-edge monitoring solutions, we'd love to hear from you

EEO:

"Mindlance is an Equal Opportunity Employer and does not discriminate in employment on the basis of – Minority/Gender/Disability/Religion/LGBTQI/Age/Veterans."



  • Hyderabad, Telangana, India algoleap Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Role: Observability EngineerJob Description:Senior Platform EngineerWe are seeking a highly experienced and driven Senior Observability Engineer to lead the design, development, and maintenance of observability solutions across our infrastructure, applications, and services. As a Senior Observability Engineer, you will be at the forefront of implementing...


  • Hyderabad, Telangana, India Algoleap Technologies Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    SUMMARY Role: Observability EngineerJob Description:Senior Platform EngineerWe are seeking a highly experienced and driven Senior Observability Engineer to lead the design, development, and maintenance of observability solutions across our infrastructure, applications, and services. As a Senior Observability Engineer, you will be at the forefront of...


  • Hyderabad, Telangana, India, Telangana Mindlance Full time

    Observability EngineerLocation: HyderabadJob Summary:We are seeking a highly skilled and motivated Grafana Dashboard Specialist with strong expertise in DevOps automation to join our team. The ideal candidate will be responsible for designing, developing, and maintaining advanced Grafana dashboards that provide actionable insights into system performance,...

  • Observability/AlOps

    4 weeks ago


    Hyderabad, Telangana, India IntraEdge Full time

    L2- Observability/AIOps (5 to 8 yrs exp).Site Reliability Engineering (SRE) is an engineering discipline that combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures internally critical and externally visible systems have reliability and uptime appropriate to users' needs and a fast...

  • Observability/AlOps

    3 days ago


    Hyderabad, Telangana, India IntraEdge Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    L2- Observability/AIOps (5 to 8 yrs exp).Site Reliability Engineering (SRE) is an engineering discipline that combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures internally critical and externally visible systems have reliability and uptime appropriate to users' needs and a fast...


  • Hyderabad, Telangana, India Data Economy Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    Job Summary:We are seeking an experienced Observability Engineer with a strong DevOps background to design, implement, and manage observability solutions across cloud and on-prem environments. The ideal candidate will have expertise in monitoring, logging, tracing, and alerting to ensure high system availability, performance, and reliability.Key...


  • Hyderabad, Telangana, India Resource Informatics Group, Inc Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    Role:Architect / Technical Manager (15+ yrs)Location: IndiaRate:$Market All InclusiveScope of Work:Own technical direction for Dynatrace, LogicMonitor, and ELK.Drive Observability Platform Engineering, enforce standards/taxonomies, and lead automation pipelines.Liaise with client stakeholders and tool vendors.Oversee SOP/Runbook creation and ensure...


  • Hyderabad, Telangana, India ServiceNow Full time ₹ 8,00,000 - ₹ 24,00,000 per year

    Company Description It all started in sunny San Diego, California in 2004 when a visionary engineer, Fred Luddy, saw the potential to transform how we work. Fast forward to today — ServiceNow stands as a global market leader, bringing innovative AI-enhanced technology to over 8,100 customers, including 85% of the Fortune 500. Our intelligent cloud-based...


  • Hyderabad, Telangana, India ServiceNow Full time ₹ 8,00,000 - ₹ 24,00,000 per year

    Company DescriptionIt all started in sunny San Diego, California in 2004 when a visionary engineer, Fred Luddy, saw the potential to transform how we work. Fast forward to today — ServiceNow stands as a global market leader, bringing innovative AI-enhanced technology to over 8,100 customers, including 85% of the Fortune 500. Our intelligent cloud-based...


  • Hyderabad, Telangana, India Kiash Solutions LLP Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    Exp -- 4+ YearsShift PM to 11:30 PM ISTMandatory-- Python with LLM OpsJob Description-- We are looking for a hands-on AI Engineer with strong expertise in LLM integration, platform observability, performance optimization, and API development. The ideal candidate will work on critical platform enhancements, including LLM API integrations, observability...