Observability Engineer

2 weeks ago


Navi Mumbai, Maharashtra, India South Star Full time ₹ 12,00,000 - ₹ 24,00,000 per year

At South Star, we are committed to empowering businesses in the telecom sector by providing comprehensive support and management solutions for their IT infrastructure. Based in Navi Mumbai, we specialize in offering a wide range of services, including application environment support, IT infrastructure environment management, network monitoring, end-user support, and service design & monitoring. Our focus on the telecom sector allows us to deliver exceptional results that are tailored to our unique clients.

We are looking for an Observability Engineer to design, implement, and manage our enterprise-level monitoring and observability infrastructure. The successful candidate will be responsible for architecting robust observability solutions utilizing industry-leading platforms such as Grafana, Prometheus, AppDynamics, and Splunk Observability. This position supports engineering teams by providing advanced dashboards, effective alerting mechanisms, and comprehensive data correlation that deliver critical insights into system performance, reliability, and behavior.

Key Responsibilities

Architecture & Design

  • Design and implement scalable observability architectures that support monitoring across cloud, on-premises, and hybrid environments
  • Establish observability standards, patterns, and best practices across the organization
  • Evaluate and integrate new monitoring technologies and tools to enhance visibility capabilities
  • Design data retention, aggregation, and storage strategies for metrics, logs, and traces

Platform Management

  • Deploy, configure, and maintain enterprise monitoring platforms including Grafana, Prometheus, AppDynamics, and Splunk Observability
  • Ensure high availability, performance, and scalability of observability infrastructure
  • Manage platform upgrades, patches, and capacity planning
  • Integrate observability tools with existing CI/CD pipelines and infrastructure automation

Dashboard & Visualization Development

  • Create and maintain comprehensive dashboards that provide actionable insights for application and infrastructure teams
  • Build executive-level reporting dashboards for system health and performance metrics
  • Develop custom visualizations tailored to specific business and technical requirements
  • Implement role-based access and dashboard governance

Alerting & Incident Response

  • Design intelligent alerting strategies that minimize noise and prioritize critical issues
  • Configure multi-channel alert routing and escalation policies
  • Establish SLI/SLO/SLA frameworks and implement corresponding monitoring
  • Collaborate with incident response teams to improve detection and diagnosis capabilities
  • Conduct post-incident reviews to enhance monitoring coverage and alert accuracy

Collaboration & Enablement

  • Partner with development, operations, and security teams to instrument applications and infrastructure
  • Provide guidance on observability best practices, including logging standards, metrics collection, and distributed tracing
  • Conduct training sessions and create documentation for observability tools and practices
  • Act as subject matter expert for monitoring-related questions and troubleshooting

Required Qualifications

  • 3-5+ years of experience with enterprise monitoring and observability platforms
  • Hands-on expertise with Grafana, Prometheus, AppDynamics, and Splunk Observability (or similar tools)
  • Strong understanding of monitoring fundamentals: metrics, logs, traces, and events
  • Experience with containerized environments (Kubernetes, Docker)
  • Proficiency in scripting languages (Python, Bash, PowerShell) for automation
  • Knowledge of application performance monitoring (APM) concepts and practices
  • Experience with configuration management tools (Ansible, Terraform) for infrastructure as code
  • Understanding of networking, system administration, and distributed systems architecture

Preferred Qualifications

  • Experience with OpenTelemetry and distributed tracing implementations
  • Familiarity with PromQL, SPL (Splunk Processing Language), and other query languages
  • Knowledge of time-series databases (InfluxDB, TimescaleDB, Prometheus TSDB)
  • Experience implementing SRE practices and establishing SLI/SLO frameworks
  • Background in software development or DevOps engineering
  • Certifications in relevant monitoring platforms or cloud technologies
  • Experience in regulated industries with compliance monitoring requirements

Technical Skills

  • Monitoring Platforms: Grafana, Prometheus, AppDynamics, Splunk Observability
  • Scripting/Programming: Python, Bash, Go, PowerShell
  • Container Orchestration: Kubernetes, Docker, container monitoring best practices
  • Configuration Management: Ansible, GitOps workflows
  • Data Formats: JSON, YAML, Prometheus exposition format
  • Version Control: Git, GitLab/GitHub

Personal Attributes

  • Strong analytical and problem-solving abilities
  • Excellent communication skills with ability to explain complex technical concepts
  • Self-motivated with ability to work independently and prioritize effectively
  • Detail-oriented with commitment to documentation and knowledge sharing
  • Collaborative mindset with focus on enabling team success

Shift Schedule

  • The standard in-office schedule is Monday to Friday, from 11:00 am to 8:00 pm IST. Remote work is permitted during maintenance windows.


  • Mumbai, Maharashtra, India Integra Software Services Full time

    Detail Specification No. of Positions 1 Work Location Mumbai Job DescriptionWe are seeking a highly experienced and visionary Observability Lead to spearhead our monitoring and infrastructure management initiatives. This senior role requires deep expertise in the Elastic Stack and a comprehensive understanding of modern distributed system telemetry.The...


  • Mumbai, Maharashtra, India Ashnik Pte Ltd Full time US$ 12,00,000 - US$ 30,00,000 per year

    Location: MumbaiPlease send us your resume at Build platforms that process billions of events per day and power mission-critical decisions.About the RoleAshnik is looking for a smart and hands-on Solution Architect with deep expertise in observability and modern AI-driven architectures.This is a senior, customer-facing role where you will design, architect...


  • Mumbai, Maharashtra, India VuNet Systems Full time

    Join Our Journey at VuNetVuNet is a pioneer in Business Journey Observability , leveraging Big Data and Machine Learning to transform digital experiences across the financial services. Our deep-tech platform provides end-to-end visibility into customer journeys — empowering proactive issue resolution, operational resilience, and superior user...

  • Sr. Engineer-QA

    1 week ago


    Navi Mumbai, Maharashtra, India Way2Go Full time

    ROLE: - Sr. Engineer – QA / QC - MEP (Electrical / Mechanical ) SALARY: - UPTO 10 LPA LOCATION: - Navi -Mumbai. QUALIFICATION: - BE / B Tech in Electrical or Mechanical EXPERIENCE: - 5 to 8 years of relevant experience ROLE- Sr. Engineer – QA/QC is responsible to ensure the quality process is followed without any deviation and delay. KEY...

  • DevOps Engineer

    2 weeks ago


    Navi Mumbai, Maharashtra, India Allerin Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    Company DescriptionAllerin is a software solutions provider known for delivering innovative and agile solutions. We emphasize efficiency from design to development to delivery, ensuring our solutions not only meet current requirements but also scale for future needs. Our focus on increasing client productivity has made us proficient, reliable, and stable...


  • Mumbai, Maharashtra, India DHI Solutions Full time

    DescriptionAn SRE spends just as much of their time working on systems as they do writing code. Youll be tasked with all manner of work from building operational tooling, automating operational workflows, performing architecture and design reviews, investigating system failures and complex outages, improving our monitoring infrastructure, defining service...


  • Navi Mumbai, Maharashtra, India Weekday AI Full time

    This role is for one of the Weekday's clientsSalary range: Rs Rs ie INR 4-6 LPA)Min Experience: 14 yearsLocation: IndiaJobType: full-time As a Lead Engineering Manager – Machine Learning, you will play a pivotal leadership role in building and scaling high-impact, production-grade machine learning systems that power data-driven products and platforms. This...


  • Navi Mumbai, Maharashtra, India K20s - Kinetic Technologies Private Limited Full time

    **Cloud Operations EngineerLocation:Navi Mumbai (Airoli)Experience 4 + YearsExperience**5+ years overall in Cloud Operations, including:Minimum 5 years of hands-on experience with Google Cloud Platform (GCP)Minimum 3 years of experience in Kubernetes administrationCertificationsGCP Certified Professional – MandatoryWork Hours24x7 support coverageRotational...

  • on Engineer

    1 week ago


    Mumbai, Maharashtra, India VuNet Systems Full time

    Join Our Journey at VuNetVuNet is a pioneer in Business Journey Observability , leveraging Big Data and Machine Learning to transform digital experiences across the financial services. Our deep-tech platform provides end-to-end visibility into customer journeys — empowering proactive issue resolution, operational resilience, and superior user...


  • Navi Mumbai, Maharashtra, India Pert Telecom Solutions Full time

    Role: Presales Solutions Engineer/ManagerExperience: Minimum 4 Years as a relevant experienceLocation: Gurgaon/MumbaiReporting to: Director/Lead Solution ArchitectJob descriptionWe are looking for a suitable candidate with great communication skills and an understanding of the Telecom industry. As a Senior Solutions Expert you will work closely across...