Manager, Site Reliability Engineering

4 weeks ago


New Delhi, India Cvent Full time

Cvent is looking for a Manager, Site Reliability Engineering to help us scale our systems and ensure stability, reliability and performance and rapid deployments of our platform. We build teams that are inclusive, collaborative, and have a strong sense of ownership for the things they build. If you have a passion and track record for solving problems; moreover, have strong leadership skills, this is a great fit for you.As Manager, SRE you will demonstrate both emerging and current technologies, methods, and processes contributing to the evolution of software deployment processes, enhancing security, reducing risk, and improving the overall end-user experience. As part of the Technology R&D Team, you will play an integral part in advancing DevOps maturity and be a part of a new culture of quality and site reliability. You will continually improve our CI/CD tools, processes, and procedures. You will also be responsible for regular reporting to Senior Technology Leaders and providing updates on organizational risk exposure and risk related issues.What You Will Be Doing:- Set the direction and strategy for your team, and help shape the overall SRE program for the company - Support the growth by ensuring a robust, scalable, cloud-first infrastructure - Own site stability, performance and capacity planning - Participate early in the SDLC to ensure reliability is built in from the beginning, and creating plans for successful implementations/launches - Foster a learning and ownership culture within the team and the larger Cvent organization - Ensure best engineering practices through automation, infrastructure as code, robust system monitoring, alerting, auto scaling, self-healing, etc... - Manage complex technical projects and a team of SREs - Recruit and develop staff; build a culture of excellence in site reliability and automation - Lead by example – roll up your sleeves by debugging and coding; participate in on-call rotation & occasional travel - Represent the technology perspective and priorities to leadership and other stakeholders by continuously communicating timeline, scope, risks, and technical road mapWhat You Need for this Position:- 12+ years of hands-on technical leadership and people management experience - 3+ years of demonstrable experience leading site reliability and performance in large-scale, high-traffic environments - Strong leadership, communication and interpersonal skills geared to getting things done - Developing themselves and the talent within their charge – fostering and creating opportunity for the team - Architect-level understanding of one or more of the major public cloud services (AWS, GCP or Azure), using them to effectively design secure and scalable services - Strong understanding of SRE concepts and the DevOps culture, with a focus on leveraging software engineering tools, methodologies and concepts - In-depth understanding of automation and CI/CD processes to go along with excellent reasoning and problem-solving skills - Experience with Unix/Linux environments with a deep grasp on system internals - Worked on large-scale distributed systems including multi-tiered architecture - Strong knowledge of modern platforms like Fargate, Docker, Kubernetes etc. - Experience working with monitoring tools (Datadog, NewRelic, ELK stack, etc) and Database technologies (SQL Server, Postgres and Couchbase preferred) - Validated breadth of understanding and development of solutions based on multiple technologies, including networking, cloud, database, and scripting languages. - Experience in prompt engineering, building AI Agents, or MCP is a plus



  • New Delhi, India People Hire Consulting Full time

    Looking for a Manager, Site Reliability Engineering to help us scale our systems and ensure stability, reliability and performance and rapid deployments of our platform. We build teams that are inclusive, collaborative, and have a strong sense of ownership for the things they build. If you have a passion and track record for solving problems; moreover, have...


  • New Delhi, India Datum Technologies Group Full time

    Job Title: Site Reliability Engineer (SRE) – Azure & AIExperience: 7+ yearsWork Mode: HybridWork Location: Chennai/Mumbai/GurgaonJob Summary:We are looking for an experienced Site Reliability Engineer (SRE) with strong expertise in Microsoft Azure, AI infrastructure, and automation. The ideal candidate will have a solid background in managing cloud...


  • New Delhi, India Grootan Technologies Full time

    About the RoleWe are seeking a skilled Site Reliability Engineer (SRE) with 4–5 years of hands-on experience to join our engineering team. In this role, you will be responsible for building and maintaining reliable, scalable, and secure infrastructure to support our applications. You will leverage your expertise in automation, cloud platforms, and...


  • New Delhi, India Datum Technologies Group Full time

    Job Title: Site Reliability Engineer (SRE) – AWSExperience: 8+ years Location: Chennai / Mumbai Work Mode: HybridKey Skills:AWS, Terraform, Kubernetes, Docker, Grafana, Prometheus, DatadogJob Summary:We are looking for a skilled Site Reliability Engineer (SRE) with strong AWS experience and a solid background in DevOps, automation, observability, and...


  • New Delhi, India Elios Talent Full time

    Site Reliability EngineerKey Highlights️ Build, automate, and support cloud-native infrastructure powering high-availability platforms⚡ Contribute to automation-first engineering across AWS, Terraform, CI/CD, and observability toolingImprove reliability, uptime, system health, and performance across production environmentsStrengthen DevSecOps...


  • New Delhi, India Andor Tech Full time

    Hiring!! About AndorTech AndorTech is aglobal IT services and consulting firmfounded in 2009, headquartered in Bangalore. The company specializes insoftware engineering, AI-enabled IT services, application support, analytics, and test automation. With a presence across India, the USA, Europe, and the UAE, AndorTech partners withGlobal Capability Centers...


  • New Delhi, India JRD Systems Full time

    Site Reliability Engineer (Windows / Cloud / Automation) Job Summary: We are seeking an experienced Site Reliability Engineer with a strong background in managing Windows infrastructure and cloud environments. The ideal candidate will be responsible for designing, implementing, automating, and maintaining scalable infrastructure solutions across AWS, Azure,...


  • New Delhi, India Andor Tech Full time

    Hiring!!About AndorTech AndorTech is aglobal IT services and consulting firmfounded in 2009, headquartered in Bangalore. The company specializes insoftware engineering, AI-enabled IT services, application support, analytics, and test automation . With a presence across India, the USA, Europe, and the UAE, AndorTech partners withGlobal Capability Centers...


  • New Delhi, India JRD Systems Full time

    Site Reliability Engineer (Windows / Cloud / Automation)Job Summary:We are seeking an experienced Site Reliability Engineer with a strong background in managing Windows infrastructure and cloud environments. The ideal candidate will be responsible for designing, implementing, automating, and maintaining scalable infrastructure solutions across AWS, Azure,...


  • New Delhi, India JRD Systems Full time

    Site Reliability Engineer (Windows / Cloud / Automation)Job Summary:We are seeking an experienced Site Reliability Engineer with a strong background in managing Windows infrastructure and cloud environments. The ideal candidate will be responsible for designing, implementing, automating, and maintaining scalable infrastructure solutions across AWS, Azure,...