Senior Site Reliability Engineer

2 weeks ago


Gurgaon, Haryana, India Cvent Full time ₹ 12,00,000 - ₹ 36,00,000 per year

Site Reliability is about combining development and operations knowledge and skills to help make the organization better. If you have SRE or development background and have experience on improving reliability of your services/products by adding Observability to it – Cvent SRE can benefit from your skillsets. Ultimately, we are looking for passionate people who love learning, love technology and always want to make things better.

As a Senior SRE on the SRE Observability team, you will be responsible for helping Cvent to achieve our reliability goals. We are looking for someone with the drive, ownership and ability to take on challenging problems, both technical and process related, in a dynamic, collaborative and highly distributed, multi-disciplinary team environment. You will use your background as a generalist to work closely with product development teams, Cloud Infrastructure and other SRE teams to ensure the effective observability and improve reliability of our products, SLDC and Infrastructure. You must be able to see the big picture and work collaboratively with teams to solve hard multi-disciplinary problems. Technical expertise in topics such as cloud operations, the software development lifecycle, and Observability tools will be of great help to you. We use SRE principals such as blameless postmortems and a focus on automation to ensure we're constantly improving our knowledge and maintaining a good quality of life. Overall, we're passionate about continuous improvement, learning

and participating in dynamic day to day work where success is rewarded with recognition and upward mobility.

What You Will Be Doing


•Enlighten, Enable and Empower a fast-growing set of multi-disciplinary teams, across multiple applications and locations.


•Tackle complex development, automation and business process problems. Champion Cvent standards and best practices.


•Ensure the scalability, performance, and resilience of Cvent products and processes.


•Work with product development teams, Cloud Automation and other SRE teams to ensure a holistic understanding of observability gaps and their effective and efficient identification and resolution.


•Identify recurring problems and anti-patterns in development, operational and security processes and help respective team to build observability for those.


•Develop build, test and deployment automation that seamlessly targets multiple on-premises and AWS regions.


•Give back by working on and contributing to Open-Source projects.

What You Need for this Position

Must have skills:


•Excellent communication skills and track record working in distributed teams


•A passion for and track record in making things better for your peers.


•Experience managing AWS services / operational knowledge of managing applications in AWS – ideally via automation.


•Fluent in at least one scripting languages like Typescript, Javascript, Python, Ruby and Bash.


•Experience with SDLC methodologies (preferably Agile).


•Experience with Observability (Logging, Metrics, Tracing) and SLI/SLO


•Working with APM, monitoring, and logging tool (Datadog, New Relic, Splunk)


•Good understanding of containerization concepts - docker, ECS, EKS, Kubernetes.


•Self-motivation and the ability to work under minimal supervision


•Troubleshooting and responding to incidents, set a standard for others to prevent the issues in future.

Good to have skills:


•Experience with Infrastructure as Code (IaC) tools such as CloudFormation, CDK (preferred) and Terraform.


•Experience managing 3 tier application stacks.


•Understanding of basic networking concepts.


•Experience on Server configuration through Chef, Puppet, Ansible or equivalent


•Working experience with NoSQL databases such as MongoDB, Couchbase, Postgres etc


•Use APM data to Troubleshooting and finding performance bottleneck



  • Gurgaon, Haryana, India Aerial Telecom Solutions (ATS) Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Position Overview:SRE- Lead will be responsible for managing a team of engineers focused on software deployments and site reliability engineering practices. The role will involve overseeing the deployment process of software applications and services, implementing automation, monitoring, and alerting tools, and ensuring the reliability, availability, and...


  • Gurgaon, Haryana, India NatWest Group Full time ₹ 9,00,000 - ₹ 12,00,000 per year

    Site Reliability Engineer Join us as a Site Reliability EngineerIn this key role, you'll support the improvement of non-functional and operational characteristics such as availability, performance, efficiency, change management, monitoring, security, incident response, and capacity planning of our products and services You'll enjoy significant...


  • Gurgaon, Haryana, India RBS Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Join us as a Site Reliability EngineerIn this key role, you'll support the improvement of non-functional and operational characteristics such as availability, performance, efficiency, change management, monitoring, security, incident response, and capacity planning of our products and servicesYou'll enjoy significant stakeholder interaction, working in...

  • Site Reliability

    5 days ago


    Gurgaon, Haryana, India Weekday Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    This role is for one of our clientsCompany Name: NeemtreeIndustry: Technology, Information and MediaSeniority level: Mid-Senior levelMin Experience: 4 yearsLocation: Gurugram, Delhi, NCRJobType: full-timeWe're looking for a Site Reliability & Automation Engineer who thrives at the intersection of infrastructure, automation, and reliability. In this role,...


  • Gurgaon, Haryana, India Bravura Solutions Full time ₹ 10,00,000 - ₹ 25,00,000 per year

    Bravura's Commitment and MissionAt Bravura Solutions, collaboration, diversity and excellence matter. We value your ideas, giving you room to be curious and innovate in an exciting, fast-paced, and flexible environment. We look for many different skills and abilities, as well as how you can add value to Bravura and our culture.As a Global FinTech market...


  • Gurgaon, Haryana, India Leapwork Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    At Leapwork, our vision is to break down the barriers between humans and computers through the world's most accessible automation platform. We are the leading global AI-powered visual test automation solution, enabling some of the world's largest enterprises to adopt, scale, and maintain automation – in under 30 days.In today's environment, where...


  • Gurgaon, Haryana, India Leapwork Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    At Leapwork, our vision is to break down the barriers between humans and computers through the world's most accessible automation platform. We are the leading global AI-powered visual test automation solution, enabling some of the world's largest enterprises to adopt, scale, and maintain automation – in under 30 days.In today's environment,...


  • Gurgaon, Haryana, India Gemini Solutions Pvt Ltd Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    Position SummaryIn this role, you will play a crucial part in shaping the firm's infrastructure reliability and efficiency by implementing robust Site Reliability Engineering practices. Your contribution will be pivotal in ensuring the availability, scalability, and performance of our systems and applications. Leveraging your strong technical skills and...


  • Gurgaon, Haryana, India RBS Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Join us as a Site Reliability EngineerIn this key role, you'll improve, drive, and embed non-functional and operational characteristics such as availability, performance, efficiency, change management, monitoring, security, incident response, and capacity planning of our products and servicesYou'll enjoy significant stakeholder interaction, working in...


  • Gurgaon, Haryana, India EDGE Executive Search Full time ₹ 1,04,000 - ₹ 1,30,878 per year

    The JobThe SRE is a global team that provides technical support across the suite of products. The team works closely with a highly competent Technical Operation Centre (TOC), Development and Infrastructure teams to deliver proactive tasks to improve the supportability of our platforms. Our work helps to ensure that the company provides a high-quality...