Observability Engineer
3 weeks ago
Job Description As an Observability Engineer, you will utilize your extensive Information Technology knowledge and experience to support/streamline AHEAD's Managed Services platforms and services. You will work with a collaborative team ensuring development efforts are well documented and delivered with quality along with maintaining the tooling architecture at the platform level. You will work with customer service owners, process owners and various service delivery groups and participate in meetings in a professional and courteous manner. The Observability Engineer is highly skilled on various platforms with strong experience supporting and maintaining integrations with external third-party tools. The Observability Engineer is a key role in the Managed Services team. The ideal candidate will have the knowledge and experience to work with a variety of technologies in diverse environments. This position will be focused on automating operational tasks, as well as maintaining and expanding our existing operational tool set, with a goal of driving efficiencies across the Managed Services team. Roles And Responsibilities - Should have experience with Datadog, Logic Monitor or Elastic. - Configure and tune monitoring tools to allow Managed Services to proactively manage customer environments - Document processes and standard operating procedures across the managed services team - Support P1 Platform outages, as needed - Provide third-level support and troubleshooting assistance - Automate processes and standard operating procedures across the managed services team. These processes could involve working with a variety of technology stacks. - Engage effectively with customers, vendors, and other team members - Obtain and/or maintain technical skills required to meet the obligations of our customers - Document operational processes / procedures to optimize support and management of systems - Be proactive in spotting and fixing potential problems - Provide emergency after-hours support as part of a scheduled on-call rotation - Provide periodic after-hours support for scheduled maintenance activities Expectations - Recognized subject matter expert in professional discipline - Contribute to development of innovative and high impact solutions for complex challenges - Provide measurable input into new products, processes, standards, and / or plans - Demonstrate deep expertise across multiple automation/tooling technologies - Able to support the deployment of moderately complex solutions - Communicate with internal customers and relevant stakeholders - Provide measurable input into new products, processes, standards, and / or plans Required Skills & Expertise - BS/BA Degree in Computer Science or equivalent industry experience - Recognized subject matter expert in professional discipline - 3+ years administrating an enterprise environment with 24x7x365 uptime requirements - Demonstrated experience with monitoring and event management technologies - Scripting and automation skills with PowerShell Perl or Python - Experience interacting with SOAP and Rest APIs - Excellent oral and written communication skills - Experience with LogicMonitor platform - Experience with Datadog platform - Experience with API development and integrating infrastructure technologies. - Experience with Elastic Observability Platform Desired Skills & Experience - Experience with ServiceNow - Industry technical certifications such as MCSA, MCSE, ITIL, CCNA, NPP etc. - Experience working in a Managed Services organization - Experience working for a SaaS provider or MSP - Multiple certifications in LogicMonitor: LMCA, LMCP, LMCI, & LMCD - Elastic certified Engineer, Observability Engineer, or Analyst
-
Observability Engineer
6 days ago
Bengaluru, Gurugram, Pune, India Xebia It Architects Full time US$ 90,000 - US$ 1,20,000 per yearSoftware Engineer with 3+ years of experience, specializing in developing and managing Python-based projects and data pipelines.Should have Proven track record in enhancing pipeline performance, ensuring SLA compliance, and automating workflows to streamline.Strong Experience in SRE Best Practices and Observabilityoperations and reduce manual effort....
-
Observability Engineer
3 weeks ago
India GreatHR Solutions Full timePosition: Observability Engineer Relevant Years of Experience: 6+ yearsJob Type: ContractWork Location: RemoteSalary Range: 12L - 15L PARole Overview: The Observability Engineer will lead the instrumentation, telemetry routing, backend stack evaluation, and validation process to migrate RegEd's observability stack from New Relic to OpenTelemetry with Azure...
-
Observability Engineer
3 weeks ago
India GreatHR Solutions Full timePosition: Observability Engineer Relevant Years of Experience: 6+ years Job Type: Contract Work Location: Remote Salary Range: 12L - 15L PA Role Overview: The Observability Engineer will lead the instrumentation, telemetry routing, backend stack evaluation, and validation process to migrate RegEd's observability stack from New Relic to OpenTelemetry with...
-
Observability Engineer
4 days ago
Bengaluru, Chennai, Gurugram, India Krazy Mantra HR Solutions Pvt. Ltd Full time ₹ 15,00,000 - ₹ 25,00,000 per yearWe are looking for a skilled Observability Engineer with 4 to 7 years of experience. The ideal candidate will have expertise in Data Integration & Standardization, OTEL, Java, Python, or frontend languages, and Grafana. This position is available in Bangalore, Mumbai, Chennai, Hyderabad, Noida, and Gurgaon.Roles and ResponsibilityDesign and implement data...
-
Gurugram, India Dunnhumby Full timeWe are seeking a talented Service Experience Manager with service management strategy for Media Were looking for : - 8+ years in service management, including systems monitoring, site reliability engineering, or infrastructure operations.- 2+ years in a team lead or managerial role- Coaches team members to deepen both technical monitoring skills and business...
-
Observability Engineer
2 days ago
India Sophos Full time ₹ 6,00,000 - ₹ 18,00,000 per yearAbout Us Role Summary We are looking for a skilled Observability Engineer to join our IT Operations team. The ideal candidate will have hands-on experience with infrastructure and application monitoring tools, incident management platforms, and cloud monitoring. This role will focus on managing and optimizing our observability platforms, ensuring proactive...
-
Observability Engineer
4 days ago
India Sophos Technology GmbH Full time ₹ 40,00,000 - ₹ 1,20,00,000 per yearAbout UsSophos is a global leader and innovator of advanced security solutions for defeating cyberattacks. The company acquired Secureworks in February 2025, bringing together two pioneers that have redefined the cybersecurity industry with their innovative, native AI-optimized services, technologies and products. Sophos is now the largest pure-play Managed...
-
Observability Architect
2 days ago
India SigNoz Full time ₹ 12,00,000 - ₹ 36,00,000 per yearAbout the RoleWe're seeking an Observability Architect to help engineering teams successfully adopt and scale observability practices using SigNoz and OpenTelemetry. This is a deeply technical role where you'll work directly with customers to design, implement, and optimize observability architectures for complex distributed systems.You won't be writing...
-
Lead Observability Engineer
3 weeks ago
Bengaluru, India InvestCloud, Inc. Full timeJob Description Key Responsibilities - Own the design, deployment, and lifecycle management of the Splunk Enterprise platform, including indexer and search head clustering, forwarders, and knowledge objects. - Define and implement best practices for data onboarding, parsing, enrichment, and storage to support observability use cases. - Collaborate with...
-
Platform Engineer/Distributed Systems Engineer
2 weeks ago
Gurugram, India whitetable.ai Full timeDescription :Job Title : Platform Engineer / Distributed Systems EngineerLocation : Full Time, In Office (Gurugram / Bengaluru)About Us :We are disrupting the Observability domain by leveraging AI Agents and Large Language Models (LLMs) to revolutionize monitoring, troubleshooting, and automation for applications, cloud, and on-prem infrastructure. Our...