Current jobs related to Command Center/Site Reliability Manager - Gurgaon, Haryana - Zyoin Group

  • Site Reliability

    4 days ago


    Gurgaon, Haryana, India Weekday Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    This role is for one of our clientsCompany Name: NeemtreeIndustry: Technology, Information and MediaSeniority level: Mid-Senior levelMin Experience: 4 yearsLocation: Gurugram, Delhi, NCRJobType: full-timeWe're looking for a Site Reliability & Automation Engineer who thrives at the intersection of infrastructure, automation, and reliability. In this role,...


  • Gurgaon, Haryana, India American Express Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    At American Express, our culture is built on a 175-year history of innovation, shared values and Leadership Behaviors, and an unwavering commitment to back our customers, communities, and colleagues. As part of Team Amex, you'll experience this powerful backing with comprehensive support for your holistic well-being and many opportunities to learn new...


  • Gurgaon, Haryana, India American Express Full time ₹ 1,50,000 - ₹ 28,00,000 per year

    At American Express, our culture is built on a 175-year history of innovation, shared values and Leadership Behaviors, and an unwavering commitment to back our customers, communities, and colleagues. As part of Team Amex, you'll experience this powerful backing with comprehensive support for your holistic well-being and many opportunities to learn new...


  • Gurgaon, Haryana, India Gemini Solutions Pvt Ltd Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    Position SummaryIn this role, you will play a crucial part in shaping the firm's infrastructure reliability and efficiency by implementing robust Site Reliability Engineering practices. Your contribution will be pivotal in ensuring the availability, scalability, and performance of our systems and applications. Leveraging your strong technical skills and...


  • Gurgaon, Haryana, India Cvent Full time US$ 1,50,000 - US$ 2,00,000 per year

    Cvent is looking for a Manager, Site Reliability Engineering to help us scale our systems and ensure stability, reliability and performance and rapid deployments of our platform. We build teams that are inclusive, collaborative, and have a strong sense of ownership for the things they build. If you have a passion and track record for solving problems;...


  • Gurgaon, Haryana, India Aerial Telecom Solutions (ATS) Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Position Overview:SRE- Lead will be responsible for managing a team of engineers focused on software deployments and site reliability engineering practices. The role will involve overseeing the deployment process of software applications and services, implementing automation, monitoring, and alerting tools, and ensuring the reliability, availability, and...


  • Gurgaon, Haryana, India ElevenX Capital Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    About the Role:We are looking for a skilled Site Reliability Engineer (SRE) to join our team and help us ensure the reliability, scalability, and performance of our critical systems. As an SRE, you will work closely with development and operations teams to build and maintain highly available services, automate operational tasks, and monitor system health.Key...


  • Gurgaon, Haryana, India NatWest Group Full time ₹ 9,00,000 - ₹ 12,00,000 per year

    Site Reliability Engineer Join us as a Site Reliability EngineerIn this key role, you'll support the improvement of non-functional and operational characteristics such as availability, performance, efficiency, change management, monitoring, security, incident response, and capacity planning of our products and services You'll enjoy significant...


  • Gurgaon, Haryana, India RBS Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Join us as a Site Reliability EngineerIn this key role, you'll support the improvement of non-functional and operational characteristics such as availability, performance, efficiency, change management, monitoring, security, incident response, and capacity planning of our products and servicesYou'll enjoy significant stakeholder interaction, working in...


  • Gurgaon, Haryana, India Bravura Solutions Full time ₹ 10,00,000 - ₹ 25,00,000 per year

    Bravura's Commitment and MissionAt Bravura Solutions, collaboration, diversity and excellence matter. We value your ideas, giving you room to be curious and innovate in an exciting, fast-paced, and flexible environment. We look for many different skills and abilities, as well as how you can add value to Bravura and our culture.As a Global FinTech market...

Command Center/Site Reliability Manager

4 weeks ago


Gurgaon, Haryana, India Zyoin Group Full time

We are seeking a strategic and operationally strong Command Center / Site Reliability Manager to lead our global incident response and network operations functions.

This leadership role is responsible for driving operational excellence, leading a high-performing team, and ensuring the resilience and reliability of our production systems and services.

You will lead the team responsible for 24x7 incident detection, escalation, communication, and resolution of critical service outages while overseeing real-time monitoring and triage of infrastructure and application health.

Responsibilities :

- Lead end-to-end management of Critical Service Outages (P0/P1 incidents), driving timely resolution through coordinated incident response, effective communication with stakeholders, and robust post-incident reviews with actionable remediation.

- Oversee a 24x7 Network Operations Center (NOC), implementing scalable observability, alerting, and monitoring strategies to ensure infrastructure, application, and network reliability.

- Continuously optimize alert triage, diagnostics, and noise reduction to boost efficiency.

- Build and develop a high-performing team of incident managers, NOC engineers, and shift leads.

- Foster operational maturity through training, performance management, and close collaboration with Engineering, SRE, DevOps, and Product teams.

- Define and uphold standards for incident SLAs, escalation processes, runbooks, and playbooks, while ensuring continuous shift coverage, smooth handoffs, and comprehensive KPI reporting on system health and incident trends.

Requirements :

- 6+ years of experience in Technical Operations, Site Reliability, NOC, or Incident Management roles.

- 2+ years in a people management or team leadership role.

- Deep knowledge of major incident management, escalation practices, and real-time service recovery strategies.

- Strong technical understanding of cloud-native architectures (AWS, Azure, GCP), infrastructure monitoring, and DevOps practices.

- Proven experience working with observability tools (e. g., Datadog, Splunk, Grafana, Prometheus), incident tools (PagerDuty), and ITSM platforms (e. g., ServiceNow, Jira).

- Prior experience supporting high-availability SaaS or telecommunications systems is a strong plus.

- Experience with customer-facing incident communication practices.

(ref:hirist.tech)