Engineer/Senior Engineer – Observability

3 hours ago


Chennai, Tamil Nadu, India TOCUMULUS Full time ₹ 1,00,00,000 - ₹ 2,00,00,000 per year

About Us:
ToCumulus Technology Solutions is a leading IT solutions provider specializing in cloud transformation, application modernization, and digital innovation. We are committed to helping enterprises like ADNOC accelerate their digital journey and modernize their legacy applications.

Job description:

The Engineer/Senior Engineer – Observability Engineering is key member of Service Reliability Engineering. He/she will be ultimately responsible for system Observability, reliability Monitoring and reducing time to detect by continuously finetuning the monitoring infrastructure of the services our SRE team supports.

As a Reliability engineering team member- With proactive and predictive monitoring our Production & development team can continue to innovate by spotting small bugs and big disasters before they actually happen. That's your main mission as an Monitoring & Observability Engineer. Next to our Elastic community, you'll be part of our multidisciplinary Innovative Tech team, where DevOps, Agile, Cloud & Software Engineering experts all work together to create remarkable solutions based on cutting-edge technology.

Location
:
Chennai (Preferred) /Mumbai

What will you be doing?

·
Implement, maintain, and consult on the observability and monitoring framework that supports the needs of multiple internal stakeholders.

· Manage Opera/Prometheus/Grafana/Splunk to support custom metric delivery dashboards.

· Design and build an observability infrastructure for all engineering teams to consume

· Design and develop tools for metric collection, analysis, and reporting

· Educate and lead efforts to improve observability among all engineering teams

· Responsible for the availability, performance, scaling, monitoring and incident response of FSS technology platform and services.

· Ensure the site and services are up 24*7 with no unplanned downtimes. Participate in a rotating on-call schedule to troubleshoot and resolve production escalations from our 24x7x365 NOC & Customer Success teams

· Debugging of the code issues based on web service and API responses, errors, events, logs, etc. Monitor and optimize application performance within the deployment architecture.

· Identify and collect the appropriate measurements, and synthesize the correct queries, to show intuitive and insightful visualizations which characterize the behavior of complex systems

· Continue evolving monitoring tooling toward a standards-based self-service automated platform and come up with creative solutions to solve problems

· Ensure proper reviews are built to minimize the Mean Time to Recover (MTTR) and Mean Time to Failure (MTTF).

· Implementation of ITIL processes like Incident management, problem management and change management.

· You will add, tune and maintain alert configurations and documentation as needed.

What you will bring along

· BS/MS/MCA Degree in Computer Science, Electrical & Computer Engineering or Mathematics or equivalent experience.

· 3-8 years of relevant reliability engineering work experience in any of the Online technology companies.

· Ability to understand the business services and map it to the reliability engineering design and review

· Excellent analytical, problem-solving and communication skills

· Driven and self-motivated, work creatively to solve challenging problems.

· Experience with design and implementation of Continuous Delivery and/or DevOps solutions or architecture patterns.

· Experience with code repository management, code merge and quality checks, continuous integration, and automated deployment & management using tools like Jenkins, Git, Ansible, Artifactory, Jira, Sonar

· Abreast of industry standards and trends related to telemetry and software pipelines

· Experience rationalizing and implementing monitoring and observability toolchain at enterprise scale

· Previous experience of public clouds (AWS and Terraform)

· Knowledge and experience of containers and Kubernetes cluster

· Hands on experience consolidating application and system logs at enterprise scale

· Experience with automation tools (Chef, Ansible)

· Experience with metrics exporters and integrations

· Experience with metrics collection and storage (Prometheus, Influx DB)

· Experience with log collection and storage (ELK, Splunk, Logstash)

· Experience with metric and log query languages (PromQL, LogQL, Sumo Logic)

· Experience with alert and notification management (Alertmanager, PagerDuty, Teams integrations)

· Experience with building dashboards (Grafana, Loki, Sumologic, Tenable)

· Proven development background with Go, Python, Shell or Java

· Security awareness, with an emphasis on designing for security best practices

Why Join Us?

  • Be part of a transformative project with leading enterprise clients.
  • Opportunity to work with cutting-edge cloud technologies.
  • Collaborative and innovative work environment.
  • Competitive salary and benefits.


  • Chennai, Tamil Nadu, India HariNex Solutions Full time ₹ 9,60,000 - ₹ 15,60,000 per year

    Job description: Engineer/Senior Engineer – ObservabilityLocation: Chennai (Preferred) /MumbaiRole Type- ContractGrafana Developer Expertise ( Grafana, Prometheus , Splunk) With 2~3 years of ExperienceThe Engineer/Senior Engineer – Observability Engineering is key member of Service Reliability Engineering. He/she will be ultimately responsible for system...

  • Proposal Engineer

    2 weeks ago


    Chennai, Tamil Nadu, India Tetakisu Engineer Full time ₹ 40,00,000 - ₹ 1,20,00,000 per year

    Responsibilities:* Collaborate with sales team on proposal development* Ensure compliance with company standards and policies* Prepare detailed proposals within deadlines* Manage budget and resources effectively

  • Campus Ambassador

    2 days ago


    Chennai, Tamil Nadu, India Upcoming Engineer Full time ₹ 60,000 - ₹ 1,20,000 per year

    Upcoming Engineer is on a mission to make India's engineering students career-ready through workshops, expert sessions, and real-world exposure.We're looking for dynamic Campus Ambassadors in Chennai who are passionate about personal growth, networking, and helping fellow students prepare for the professional world.What You'll Do:Represent Upcoming Engineer...


  • Chennai, Tamil Nadu, India VCS Staffing Geek Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    Urgent Hiring-Role-Observability EngineerOffice Location: Chennai, IndiaWork Mode - HybridResponsibilitiesBuilding data pipelines: Design, build, and maintain observability data pipelines to ingest metrics, logs, and traces, ideally using the OpenTelemetry (OTEL) standard.Scripting & Automation: Develop and maintain automation scripts and tools to streamline...


  • Chennai, Tamil Nadu, India Standard Chartered Bank Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Job ID: 40310Location: Chennai, INArea of interest: TechnologyJob type: Regular EmployeeWork style: Office WorkingOpening date: 17 Sept 2025Job SummaryAs the Technical Squad Lead, Central Platform Development, you will play a critical role in making the internal state of the bank's application and infrastructure services visible to stakeholders for...


  • Chennai, Tamil Nadu, India Bahwan CyberTek Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    Required Skills & Qualifications:8+ years of experience in IT infrastructure, observability, or monitoring engineering.Strong hands-on experience with modern observability platforms and tools.Proficiency in automation and scripting (Python, PowerShell) and infrastructure-as-code tools (Terraform, Ansible).Experience with AIOps platforms and their integration...


  • Chennai, Tamil Nadu, India VCS Staffing Geek Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Job type: 1 Years Extendable ContractRole Observability Migration EngineerLocation: ChennaiMode- HybridResponsibilitiesBuilding data pipelines: Design, build, and maintain observability data pipelines to ingest metrics, logs, and traces, ideally using the OpenTelemetry (OTEL) standard.Scripting & Automation: Develop and maintain automation scripts and tools...


  • Chennai, Tamil Nadu, India Ford Motor Full time ₹ 1,00,00,000 - ₹ 2,00,00,000 per year

    DescriptionSeeking a highly skilled Technical Project Manager who has hands on experience with Hybrid Cloud, Kubernetes, Device Edge and Observability. In this role, you will contribute to building Edge Observability by collaborating with a diverse cross functional and cross geographical (US and India) as well as act as liason with Manufacturing on...


  • Chennai, Tamil Nadu, India Ford Motor Company Full time ₹ 15,00,000 - ₹ 30,00,000 per year

    Seeking a highly skilled Technical Project Manager who has hands on experience with Hybrid Cloud, Kubernetes, Device Edge and Observability. In this role, you will contribute to building Edge Observability by collaborating with a diverse cross functional and cross geographical (US and India) as well as act as liason with Manufacturing on Infrastructure...


  • Chennai, Tamil Nadu, India NeoSOFT Full time ₹ 6,00,000 - ₹ 18,00,000 per year

    Role & responsibilities3-8 years of relevant reliability engineering work experience in any of the Online technology companies.Ability to understand the business services and map it to the reliability engineering design and reviewExcellent analytical, problem-solving and communication skillsDriven and self-motivated, work creatively to solve challenging...