Command Center/Site Reliability Manager
3 weeks ago
We are seeking a strategic and operationally strong Command Center / Site Reliability Manager to lead our global incident response and network operations functions. This leadership role is responsible for driving operational excellence, leading a high-performing team, and ensuring the resilience and reliability of our production systems and services. You will lead the team responsible for 24x7 incident detection, escalation, communication, and resolution of critical service outages while overseeing real-time monitoring and triage of infrastructure and application health.Responsibilities : - Lead end-to-end management of Critical Service Outages (P0/P1 incidents), driving timely resolution through coordinated incident response, effective communication with stakeholders, and robust post-incident reviews with actionable remediation.- Oversee a 24x7 Network Operations Center (NOC), implementing scalable observability, alerting, and monitoring strategies to ensure infrastructure, application, and network reliability. - Continuously optimize alert triage, diagnostics, and noise reduction to boost efficiency.- Build and develop a high-performing team of incident managers, NOC engineers, and shift leads. - Foster operational maturity through training, performance management, and close collaboration with Engineering, SRE, DevOps, and Product teams.- Define and uphold standards for incident SLAs, escalation processes, runbooks, and playbooks, while ensuring continuous shift coverage, smooth handoffs, and comprehensive KPI reporting on system health and incident trends.Requirements :- 6+ years of experience in Technical Operations, Site Reliability, NOC, or Incident Management roles.- 2+ years in a people management or team leadership role.- Deep knowledge of major incident management, escalation practices, and real-time service recovery strategies.- Strong technical understanding of cloud-native architectures (AWS, Azure, GCP), infrastructure monitoring, and DevOps practices.- Proven experience working with observability tools (e. g., Datadog, Splunk, Grafana, Prometheus), incident tools (PagerDuty), and ITSM platforms (e. g., ServiceNow, Jira).- Prior experience supporting high-availability SaaS or telecommunications systems is a strong plus.- Experience with customer-facing incident communication practices. (ref:hirist.tech)
-
Manager, Site Reliability Engineering
1 week ago
Gurugram, India Cvent Full timeCvent is looking for a Manager, Site Reliability Engineering to help us scale our systems and ensure stability, reliability and performance and rapid deployments of our platform. We build teams that are inclusive, collaborative, and have a strong sense of ownership for the things they build. If you have a passion and track record for solving problems;...
-
Manager, Site Reliability Engineering
1 week ago
Gurugram, India Cvent Full timeCvent is looking for a Manager, Site Reliability Engineering to help us scale our systems and ensure stability, reliability and performance and rapid deployments of our platform. We build teams that are inclusive, collaborative, and have a strong sense of ownership for the things they build. If you have a passion and track record for solving problems;...
-
Manager, Site Reliability Engineering
1 week ago
Gurugram, India Cvent Full timeCvent is looking for a Manager, Site Reliability Engineering to help us scale our systems and ensure stability, reliability and performance and rapid deployments of our platform. We build teams that are inclusive, collaborative, and have a strong sense of ownership for the things they build. If you have a passion and track record for solving problems;...
-
Manager, Site Reliability Engineering
1 week ago
Gurugram, India Cvent Full timeCvent is looking for a Manager, Site Reliability Engineering to help us scale our systems and ensure stability, reliability and performance and rapid deployments of our platform. We build teams that are inclusive, collaborative, and have a strong sense of ownership for the things they build. If you have a passion and track record for solving problems;...
-
Manager, Site Reliability Engineering
1 week ago
gurugram, India Cvent Full timeCvent is looking for a Manager, Site Reliability Engineering to help us scale our systems and ensure stability, reliability and performance and rapid deployments of our platform. We build teams that are inclusive, collaborative, and have a strong sense of ownership for the things they build. If you have a passion and track record for solving problems;...
-
Manager, Site Reliability Engineering
1 week ago
Gurugram, India Cvent Full timeCvent is looking for a Manager, Site Reliability Engineering to help us scale our systems and ensure stability, reliability and performance and rapid deployments of our platform. We build teams that are inclusive, collaborative, and have a strong sense of ownership for the things they build. If you have a passion and track record for solving problems;...
-
Manager, Site Reliability Engineering
1 week ago
gurugram, India Cvent Full timeCvent is looking for a Manager, Site Reliability Engineering to help us scale our systems and ensure stability, reliability and performance and rapid deployments of our platform. We build teams that are inclusive, collaborative, and have a strong sense of ownership for the things they build. If you have a passion and track record for solving problems;...
-
Manager, Site Reliability Engineering
1 week ago
Gurugram, India Cvent Full timeCvent is looking for a Manager, Site Reliability Engineering to help us scale our systems and ensure stability, reliability and performance and rapid deployments of our platform. We build teams that are inclusive, collaborative, and have a strong sense of ownership for the things they build. If you have a passion and track record for solving problems;...
-
Manager, site reliability engineering
7 days ago
Gurugram, India Cvent Full timeCvent is looking for a Manager, Site Reliability Engineering to help us scale our systems and ensure stability, reliability and performance and rapid deployments of our platform. We build teams that are inclusive, collaborative, and have a strong sense of ownership for the things they build. If you have a passion and track record for solving problems;...
-
Site Reliability Engineer
1 week ago
Gurugram, India S&P Global Full timeThis job is with S&P Global, an inclusive employer and a member of myGwork – the largest global platform for the LGBTQ+ business community. Please do not contact the recruiter directly.About the Role:OSTTRA India The Role: Site Reliability Engineer The Team:SRE is a global team that provides technical support across the suite of OSTTRA products. The SRE...