Senior Observability Engineer
3 days ago
Role Overview:We are seeking a Senior Observability Engineer with strong expertise in designing, implementing, and optimizing observability solutions. In this role, you will be key to shaping the future of observability at Cognite, assessing existing observability frameworks, identifying gaps, and building robust capabilities encompassing log aggregation, event correlation, noise reduction, and comprehensive telemetry analysis to enable proactive operational excellence and reliability for our services.Key Responsibilities Conduct assessments of existing observability architectures to identify gaps and improvement opportunities. Design and implement scalable log aggregation pipelines for centralized and efficient data collection. Apply noise-reduction techniques to filter irrelevant or false-positive alerts, enhancing focus on actionable issues. Develop and maintain monitoring dashboards that deliver actionable insights across applications and infrastructure. Lead the migration fromLightsteptoHoneycomb , ensuring seamless data pipeline transitions, OpenTelemetry alignment, and stakeholder adoption. Collaborate with infrastructure and product teams to integrate observability tooling into CI/CD workflows and cloud environments. Analyze telemetry data (metrics, logs, traces) to troubleshoot complex system behaviors and recommend improvements. Participate in production debugging and incident troubleshooting using telemetry data Mentor junior engineers on log management, event correlation, distributed tracing. alert management. Stay current on observability innovations and recommend adoption strategies aligned with organizational goals. Support post-incident reviews and continuous improvement through data-driven root cause analysis. Drive continuous improvement in reliability and operational excellence through proactive observability initiatives.Key Skills8+ years of experience in software or systems engineering, with at least 3 years focused on observability or SRE practices. Hands-on experience with observability tools such asHoneycomb ,VictoriaMetrics ,Lightstep ,Prometheus ,Grafana ,OpenTelemetry ,Splunk ,Datadog , orNew Relic . Strong knowledge of OpenTelemetry instrumentation (metrics, traces, logs) andSLIs/SLOsfor reliability tracking. Experience withdistributed tracing ,event correlation , andnoise reductionframeworks. Proficiency in one or more programming/scripting languages such asPython, Java, Kotlin, Go , orShell . Working knowledge ofInfrastructure as Code (Terraform)andCI/CD (Jenkins, Github Actions,...)pipelines. Familiarity withcloud platforms(AWS, Azure, GCP) andcontainer orchestration(Kubernetes). Strong analytical, troubleshooting, and communication skills with the ability to work effectively across teams. Experience conductingobservability gap assessmentsand definingimprovement plans . Experience working incomplex or multi-cloud environmentsis preferred.
-
Senior Observability Engineer
5 days ago
New Delhi, India Cognite Full timeRole Overview:We are seeking a Senior Observability Engineer with strong expertise in designing, implementing, and optimizing observability solutions. In this role, you will be key to shaping the future of observability at Cognite, assessing existing observability frameworks, identifying gaps, and building robust capabilities encompassing log aggregation,...
-
SRE Observability Engineer
2 weeks ago
New Delhi, India TerraGiG Full timeWe are looking forSRE Observability Engineer About the Role: Duration: Permanent Location: Hyderabad Timings: Full Time (As per company timings) Notice Period: (Immediate Joiner - Only) Experience: 6-10 Years JD: Position: SRE Observability Engineer Exp: 5+ to 10 Years Location: Hyderabad Mandatory Skills: Observability, Grafana and Writing queries using...
-
SRE Observability Engineer
2 weeks ago
New Delhi, India TerraGiG Full timeWe are looking forSRE Observability Engineer About the Role: Duration: Permanent Location: Hyderabad Timings: Full Time (As per company timings) Notice Period: (Immediate Joiner - Only) Experience: 6-10 Years JD: Position: SRE Observability Engineer Exp: 5+ to 10 Years Location: Hyderabad Mandatory Skills: Observability, Grafana and Writing queries using...
-
SRE Observability Engineer
3 weeks ago
New Delhi, India TerraGiG Full timeWe are looking for SRE Observability EngineerAbout the Role:Duration: PermanentLocation: HyderabadTimings: Full Time (As per company timings)Notice Period: (Immediate Joiner - Only)Experience: 6-10 YearsJD:Position: SRE Observability EngineerExp: 5+ to 10 YearsLocation: HyderabadMandatory Skills: Observability, Grafana and Writing queries using Prometheus...
-
Senior Consultant – Observability
1 day ago
New Delhi, India World Wide Technology Full timeWorld Wide Technology (WWT), a global technology integrator and IT solutions provider. World Wide Technology, established in 1990 in St. Louis, Missouri, collaborates with OEMs like Cisco and Dell EMC to offer infrastructure security and custom app development services to Fortune 500 companies in various sectors. With over 10,000 employees globally, we...
-
Senior Consultant – Observability
1 day ago
New Delhi, India World Wide Technology Full timeWorld Wide Technology (WWT), a global technology integrator and IT solutions provider. World Wide Technology, established in 1990 in St. Louis, Missouri, collaborates with OEMs like Cisco and Dell EMC to offer infrastructure security and custom app development services to Fortune 500 companies in various sectors. With over 10,000 employees globally, we...
-
Senior Consultant – Observability
5 days ago
New Delhi, India World Wide Technology Full timeWorld Wide Technology (WWT), a global technology integrator and IT solutions provider. World Wide Technology, established in 1990 in St. Louis, Missouri, collaborates with OEMs like Cisco and Dell EMC to offer infrastructure security and custom app development services to Fortune 500 companies in various sectors. With over 10,000 employees globally, we...
-
Senior Consultant – Observability
5 days ago
New Delhi, India World Wide Technology Full timeWorld Wide Technology (WWT), a global technology integrator and IT solutions provider. World Wide Technology, established in 1990 in St. Louis, Missouri, collaborates with OEMs like Cisco and Dell EMC to offer infrastructure security and custom app development services to Fortune 500 companies in various sectors. With over 10,000 employees globally, we...
-
Senior Site Reliability Engineer – Grafana
4 weeks ago
New Delhi, India Aptimized Full timeJob Description – Senior Site Reliability Engineer (SRE) – Grafana & ObservabilityPosition: Senior Site Reliability Engineer – Grafana & ObservabilityLocation: [Hyderabad /Hybrid]Experience: 10–20+ yearsOperating globally, Aptimized is a premium ERP, HCM, and Technology Optimization Consulting agency. Our team at Aptimized focuses on helping our...
-
Cloud Engineer-Observability
4 weeks ago
New Delhi, India Smarsh Full timeAbout the team: The Observability team builds and manages the single telemetry and observability service used by all product teams on the Smarsh platform. It provides "as a service" telemetry, monitoring, and visualization capabilities that enable our product teams to operate, support, and triage the applications and services under their product portfolio....