Observability Lead

3 weeks ago


India Aptimized Full time

Job Summary : Aptimized is seeking a highly skilled Observability Lead to spearhead our monitoring, logging and analytics initiatives. The ideal candidate will have expertise in Grafana, Vector, Power BI and Fabric Resource, ensuring comprehensive system visibility, performance optimization and data-driven insights. This role involves designing and implementing observability solutions to enhance operational efficiency and proactive incident management. Key Responsibilities : - Develop and Implement Observability Strategies : Design and maintain end-to-end observability frameworks leveraging Grafana, Vector, Power BI and Fabric Resource. - Monitoring & Dashboards : Create and optimize dashboards, alerts and visualizations to provide real-time system performance insights. - Log Management & Aggregation : Configure and maintain Vector for efficient log collection, transformation and shipping across distributed environments. - Performance Analytics & Reporting : Utilize Power BI and Fabric Resource to analyze system performance metrics and generate actionable insights for stakeholders. - Incident Detection & Resolution : Implement automated alerts and anomaly detection mechanisms to ensure proactive issue resolution. - Collaboration & Stakeholder Engagement : Work with DevOps, SRE and IT teams to define observability best practices and integrate monitoring solutions into CI/CD pipelines. - Continuous Improvement : Stay updated with industry best practices and emerging observability technologies to enhance system monitoring capabilities. Required Qualifications : - Proficiency in Grafana : Experience in building real-time dashboards, configuring alerts and integrating with various data sources (i.e., Prometheus, Loki, InfluxDB). - Vector Expertise : Strong understanding of log collection, processing, and routing using Vector in cloud or on-prem environments. - Power BI & Fabric Resource Knowledge : Ability to transform system telemetry data into meaningful insights using Microsoft's Power BI and Fabric Resource. - Scripting & Automation : Hands-on experience with scripting (Python, Bash, or PowerShell) for automating monitoring tasks. - Cloud & Infrastructure Monitoring : Experience in observability solutions for AWS, Azure or Google Cloud environments. - Strong Analytical Skills : Ability to interpret performance data, identify trends and recommend optimizations. - Excellent Communication Skills : Ability to present insights and recommendations to technical and non-technical stakeholders. Preferred Qualifications : - Experience with additional monitoring tools such as Prometheus, OpenTelemetry, or Datadog. - Familiarity with infrastructure as code (Terraform, Ansible) for deploying monitoring configurations. - Knowledge of distributed systems and microservices architectures (ref:hirist.tech)


  • Observability Lead

    3 weeks ago


    India Aptimized Full time

    Job SummaryAptimized seeks a highly skilled Observability Lead to spearhead monitoring, logging and analytics initiatives. The ideal candidate will have expertise in Grafana, Vector, Power BI and Fabric Resource ensuring comprehensive system visibility, performance optimization and data-driven insights.Main ResponsibilitiesDevelop and Implement Observability...


  • India beBee Careers Full time

    Senior Observability ExpertWe are seeking an experienced Senior Observability Expert to join our team. This role requires a proven track record of expertise in observability, monitoring, and performance management of large-scale, distributed systems.Key Responsibilities:Develop comprehensive solutions for complex issues.Leverage automation concepts and tools...


  • India beBee Careers Full time

    About The RoleWe are seeking a skilled Senior Product Manager to play a crucial role in defining the product vision for Observability. This individual will help define the product strategy, roadmap, and product definition, as well as drive execution and go-to-market support to launch new capabilities and drive adoption.Key responsibilities include:Contribute...


  • India beBee Careers Full time

    About the RoleWe are looking for an experienced Lead OpenTelemetry Developer to join our team. As a key member of our team, you will be responsible for developing and maintaining OpenTelemetry-based solutions.This role involves working on instrumentation, data collection, and observability tools to ensure seamless integration and monitoring of...


  • India beBee Careers Full time

    We are looking for an experienced software engineer to join our team and lead the development of OpenTelemetry-based solutions.This role involves working on instrumentation, data collection, and observability tools to ensure seamless integration and monitoring of applications.You'll have the opportunity to work on cutting-edge technologies and develop...


  • India NEXTHIRE LLP Full time

    Role : Cloud and Observability Engineer Experience : 3-6 Years Location : Gurugram About the Job : Coralogix is a modern, full-stack observability platform transforming how businesses process and understand their data. Our unique architecture powers in-stream analytics without reliance on expensive indexing or hot storage. We specialize in comprehensive...


  • India beBee Careers Full time

    Omnichannel Monitoring EngineerThis role requires an Omnichannel Monitoring Engineer to design and implement observability solutions for our enterprise clients. The ideal candidate will have expertise in monitoring and performance management of large-scale distributed systems.The successful candidate will have experience working with both legacy systems and...


  • India beBee Careers Full time

    As a skilled OpenTelemetry Developer, you will have the opportunity to work on challenging projects that involve developing and maintaining innovative solutions.This role involves writing code that impacts thousands of users every month. You'll implement your critical thinking and technical skills to develop cutting-edge software, and you'll have the chance...


  • India MINDTEL GLOBAL PRIVATE LIMITED Full time

    Job Description : We are seeking an experienced Senior Technical Architect to lead the design and implementation of DevOps, Observability, and Site Reliability Engineering (SRE) solutions. The ideal candidate will have a deep understanding of system architecture, automation, and best practices to enhance system reliability and performance across the...


  • Bengaluru, Karnataka, India, Karnataka HDFC Bank Full time

    Key Responsibilities:Design, implement, and maintain observability practices and toolsWork closely with Lead Observability Engineers and Architects/engineers of Other departments to gather requirements and provide solutionsImplement monitoring, logging, and alerting strategiesDevelop and implement dashboards, alerts, and metrics to track system health and...