DevOps - Observability and Monitoring
4 weeks ago
Job Role: Sr DevOps – Observability and MonitoringExperience: 10+ Years Location: Mumbai (Onsite)About the Role: We are seeking an experiencedSenior DevOps Observability and Monitoring Leadto design, implement, and manage comprehensive monitoring and observability solutions across our cloud and on-premise infrastructure. The role focuses on ensuringsystem reliability, performance, and proactive incident managementthrough advanced monitoring, alerting, and observability strategies. Key Responsibilities: Lead the design, deployment, and maintenance ofobservability frameworksacross applications and infrastructure. Implement and managemonitoring, logging, tracing, and alerting solutionsusing tools such as Prometheus, Grafana, ELK Stack, Datadog, Splunk, or equivalent. Collaborate with development, QA, and operations teams to ensureperformance, availability, and reliabilityof critical systems. Define and enforcebest practices for monitoring, incident management, and observabilityacross the organization. Develop dashboards, metrics, and reports to provide actionable insights to stakeholders. Implement automatedalerting, anomaly detection, and root cause analysisprocesses. Optimize monitoring solutions for scalability, performance, and cost-efficiency. Mentor junior engineers and promote a culture of proactive system health and observability. Evaluate and recommend new tools and technologies to enhance observability and monitoring capabilities. Key Skills and Qualifications: 10+ years of experience in DevOps, cloud infrastructure, and observability/monitoring roles. Strong hands-on experience withmonitoring and observability tools(Prometheus, Grafana, ELK Stack, Datadog, Splunk, New Relic). Solid understanding ofcloud platforms(AWS, Azure, GCP) and hybrid infrastructure. Experience withlogging, tracing, and metrics collectionfor large-scale distributed systems. Strong scripting and automation skills (Python, Bash, PowerShell) for monitoring and alerting workflows. Knowledge ofCI/CD pipelines, containerization (Docker), and orchestration (Kubernetes)is a plus. Excellent problem-solving, leadership, and stakeholder management skills. Proven experience in defining observability strategies and leading monitoring initiatives in enterprise environments.
-
DevOps Specialist
2 weeks ago
Delhi, India Zoos Global Full timeCompany DescriptionZoos Global is a trusted partner for over 100 Indian companies, assisting them in scaling their DevOps practices with the right SaaS tools. Zoos is the preferred partner for many Indian enterprises and startups, committed to collaboration with more tech leaders.Role DescriptionThis is a full-time on-site role located in Gurgaon for a...
-
DevOps Specialist
1 week ago
Delhi, India Zoos Global Full timeCompany DescriptionZoos Global is a trusted partner for over 100 Indian companies, assisting them in scaling their DevOps practices with the right SaaS tools. Zoos is the preferred partner for many Indian enterprises and startups, committed to collaboration with more tech leaders.Role DescriptionThis is a full-time on-site role located in Gurgaon for a...
-
Observability Architect
2 weeks ago
New Delhi, India Tata Consultancy Services Full timeJob Role: Observability ArchitectLocation: Hyderabad, Chennai, Bangalore, Pune, Mumbai, Delhi, Kolkata, Noida, IndoreRole OverviewWe are seeking an experienced observability architect with deep expertise in AppDynamics and end to end observability design across enterprise applications, infra and cloud platforms. The architect will be responsible for deifning...
-
Cloud Engineer-Observability
19 hours ago
New Delhi, India Smarsh Full timeAbout the team: The Observability team builds and manages the single telemetry and observability service used by all product teams on the Smarsh platform. It provides "as a service" telemetry, monitoring, and visualization capabilities that enable our product teams to operate, support, and triage the applications and services under their product portfolio....
-
Senior Site Reliability Engineer – Grafana
5 days ago
New Delhi, India Aptimized Full timeJob Description – Senior Site Reliability Engineer (SRE) – Grafana & ObservabilityPosition: Senior Site Reliability Engineer – Grafana & ObservabilityLocation: [Hyderabad /Hybrid]Experience: 10–20+ yearsOperating globally, Aptimized is a premium ERP, HCM, and Technology Optimization Consulting agency. Our team at Aptimized focuses on helping our...
-
Splunk Observability
1 week ago
New Delhi, India Tata Consultancy Services Full timeRole: Splunk ObservabilityExperience range: 4-6 yearsLocation: BangaloreNOTE: Relevant experience in Splunk Observability is a mustJob description:- Splunk Observability APM(application Performance Monitoring) - SignalFX query language - OpenTelemetry - Cloud Native Technologies: Familiarity with cloud native technologies like kubernetes, Docker and...
-
New Delhi, India Tata Consultancy Services Full timeGreetings from TCS!!!!!!!TCS Hiring for Observability(Prometheus, Grafana, ELK Stack)Job Location: ChennaiExperience Range: 6-10 YearsJob Description :Strong hands-on experience with observability tools:- Prometheus, Grafana, ELK Stack (Elasticsearch, Logstash, Kibana), OpenTelemetry.Expertise in distributed tracing and metrics collection.Familiarity with...
-
New Delhi, India Tata Consultancy Services Full timeGreetings from TCS!!!!!!! TCS Hiring for Observability(Prometheus ,Grafana ,ELK Stack) Job Location: Chennai Experience Range: 6-10 Years Job Description : Strong hands-on experience withobservability tools : Prometheus ,Grafana ,ELK Stack (Elasticsearch, Logstash, Kibana) ,OpenTelemetry . Expertise indistributed tracingandmetrics collection . Familiarity...
-
Cloud Engineer- Observability
7 days ago
New Delhi, India Smarsh Full timeWho are we?Smarsh empowers its customers to manage risk and unleash intelligence in their digital communications. Our growing community of over 6500 organizations in regulated industries counts on Smarsh every day to help them spot compliance, legal or reputational risks in 80+ communication channels before those risks become regulatory fines or headlines....
-
Cloud Engineer- Observability
18 hours ago
New Delhi, India Smarsh Full timeWho are we?Smarsh empowers its customers to manage risk and unleash intelligence in their digital communications. Our growing community of over 6500 organizations in regulated industries counts on Smarsh every day to help them spot compliance, legal or reputational risks in 80+ communication channels before those risks become regulatory fines or headlines....