Observability Engineer
3 days ago
Observability Engineer
Location:
Hyderabad
Job Summary:
We are seeking a highly skilled and motivated
Grafana Dashboard Specialist
with strong expertise in DevOps automation to join our team. The ideal candidate will be responsible for designing, developing, and maintaining advanced Grafana dashboards that provide actionable insights into system performance, application metrics, and business KPIs. This role also requires deep expertise in automation, including CI/CD pipelines, Infrastructure as Code (IaC), and cloud-native operations for Grafana.
Key Responsibilities:
Grafana & Observability:
- Design and implement visually compelling and data-rich Grafana dashboards for Observability.
- Integrate Grafana Cloud with data sources such as Prometheus, Loki, ServiceNow, PagerDuty, Snowflake, and AWS.
- Integrate telemetry data sources such as Tomcat, Liberty, Ping, Linux, Windows, and databases (Oracle, Postgres) via REST APIs.
- Create alerting mechanisms for SLA breaches, latency spikes, and transaction anomalies.
- Develop custom panels and alerts to monitor infrastructure, applications, and business metrics.
- Collaborate with stakeholders to define KPIs and visualization needs.
- Optimize dashboard performance and usability across teams.
- Implement and manage OpenTelemetry instrumentation across services to collect distributed traces, metrics, and logs.
- Integrate OpenTelemetry data pipelines with Grafana and other observability platforms.
- Develop and maintain OpenTelemetry collectors and exporters for various environments.
- Ensure monitoring solutions support high availability and performance.
DevOps & Automation:
- Architect, design, and maintain CI/CD pipelines using tools such as Jenkins, Bitbucket, and Nexus.
- Implement Infrastructure as Code (IaC) using Terraform and Ansible.
- Automate deployment, scaling, and monitoring of both cloud-native and on-premises environments.
- Ensure system reliability, scalability, and security through automated processes.
- Collaborate with development and operations teams to streamline workflows and reduce manual intervention.
Subject Matter Expert (SME) Responsibilities:
- Act as a technical advisor on automation and observability best practices.
- Lead initiatives to improve system performance, reliability, and developer productivity.
- Conduct training sessions and create documentation for internal teams.
- Stay current with industry trends and emerging technologies in DevOps and observability.
- Guide the adoption of OpenTelemetry standards and practices across engineering teams.
- Optimize monitoring processes and tools for greater efficiency and effectiveness.
Required Qualifications:
- Bachelor's degree in Computer Science, Engineering, or related field (or equivalent experience).
- 5+ years of experience in DevOps, SRE, or infrastructure automation roles.
- 3+ years of hands-on experience with Grafana and dashboard development.
- Strong proficiency in scripting languages (Python, Bash, Go).
- Experience with monitoring tools (Grafana Cloud, Prometheus, Loki, Dynatrace, Splunk, etc.).
- Strong knowledge of CI/CD and cloud platforms (AWS and Azure).
- Expertise in Kubernetes, Docker, and container orchestration.
- Familiarity with security and compliance in automated environments.
- Hands-on experience with OpenTelemetry instrumentation and data collection.
Preferred Qualifications:
- Grafana certification or equivalent experience.
- Experience with custom Grafana plugins or panel development.
- Knowledge of business intelligence tools and data visualization principles.
- Contributions to open-source DevOps or observability projects.
- Strong communication and stakeholder management skills.
- Experience with OpenTelemetry Collector configuration and integration.
- Familiarity with distributed tracing concepts.
If you are passionate about
Observability, Grafana, and Automation
, and want to make an impact by building cutting-edge monitoring solutions, we'd love to hear from you
EEO:
"Mindlance is an Equal Opportunity Employer and does not discriminate in employment on the basis of – Minority/Gender/Disability/Religion/LGBTQI/Age/Veterans."
-
Observability Engineer
7 days ago
Hyderabad, Telangana, India algoleap Full time ₹ 12,00,000 - ₹ 36,00,000 per yearRole: Observability EngineerJob Description:Senior Platform EngineerWe are seeking a highly experienced and driven Senior Observability Engineer to lead the design, development, and maintenance of observability solutions across our infrastructure, applications, and services. As a Senior Observability Engineer, you will be at the forefront of implementing...
-
Observability Engineer
3 days ago
Hyderabad, Telangana, India Algoleap Technologies Full time ₹ 12,00,000 - ₹ 36,00,000 per yearSUMMARY Role: Observability EngineerJob Description:Senior Platform EngineerWe are seeking a highly experienced and driven Senior Observability Engineer to lead the design, development, and maintenance of observability solutions across our infrastructure, applications, and services. As a Senior Observability Engineer, you will be at the forefront of...
-
Observability Engineer
2 hours ago
Hyderabad, Telangana, India, Telangana Mindlance Full timeObservability EngineerLocation: HyderabadJob Summary:We are seeking a highly skilled and motivated Grafana Dashboard Specialist with strong expertise in DevOps automation to join our team. The ideal candidate will be responsible for designing, developing, and maintaining advanced Grafana dashboards that provide actionable insights into system performance,...
-
Observability/AlOps
4 weeks ago
Hyderabad, Telangana, India IntraEdge Full timeL2- Observability/AIOps (5 to 8 yrs exp).Site Reliability Engineering (SRE) is an engineering discipline that combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures internally critical and externally visible systems have reliability and uptime appropriate to users' needs and a fast...
-
Observability/AlOps
3 days ago
Hyderabad, Telangana, India IntraEdge Full time ₹ 12,00,000 - ₹ 36,00,000 per yearL2- Observability/AIOps (5 to 8 yrs exp).Site Reliability Engineering (SRE) is an engineering discipline that combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures internally critical and externally visible systems have reliability and uptime appropriate to users' needs and a fast...
-
Observability Engineer/AWS Devops Engineer
7 days ago
Hyderabad, Telangana, India Data Economy Full time ₹ 15,00,000 - ₹ 25,00,000 per yearJob Summary:We are seeking an experienced Observability Engineer with a strong DevOps background to design, implement, and manage observability solutions across cloud and on-prem environments. The ideal candidate will have expertise in monitoring, logging, tracing, and alerting to ensure high system availability, performance, and reliability.Key...
-
Hyderabad, Telangana, India Resource Informatics Group, Inc Full time ₹ 15,00,000 - ₹ 25,00,000 per yearRole:Architect / Technical Manager (15+ yrs)Location: IndiaRate:$Market All InclusiveScope of Work:Own technical direction for Dynatrace, LogicMonitor, and ELK.Drive Observability Platform Engineering, enforce standards/taxonomies, and lead automation pipelines.Liaise with client stakeholders and tool vendors.Oversee SOP/Runbook creation and ensure...
-
Hyderabad, Telangana, India ServiceNow Full time ₹ 8,00,000 - ₹ 24,00,000 per yearCompany Description It all started in sunny San Diego, California in 2004 when a visionary engineer, Fred Luddy, saw the potential to transform how we work. Fast forward to today — ServiceNow stands as a global market leader, bringing innovative AI-enhanced technology to over 8,100 customers, including 85% of the Fortune 500. Our intelligent cloud-based...
-
Hyderabad, Telangana, India ServiceNow Full time ₹ 8,00,000 - ₹ 24,00,000 per yearCompany DescriptionIt all started in sunny San Diego, California in 2004 when a visionary engineer, Fred Luddy, saw the potential to transform how we work. Fast forward to today — ServiceNow stands as a global market leader, bringing innovative AI-enhanced technology to over 8,100 customers, including 85% of the Fortune 500. Our intelligent cloud-based...
-
Hyderabad, Telangana, India Kiash Solutions LLP Full time ₹ 15,00,000 - ₹ 25,00,000 per yearExp -- 4+ YearsShift PM to 11:30 PM ISTMandatory-- Python with LLM OpsJob Description-- We are looking for a hands-on AI Engineer with strong expertise in LLM integration, platform observability, performance optimization, and API development. The ideal candidate will work on critical platform enhancements, including LLM API integrations, observability...