Site Reliability Engineer

4 hours ago


Noida, Uttar Pradesh, India Race Consulting Full time ₹ 1,35,000 - ₹ 40,50,000 per year

We are looking for an accomplished Site Reliability Engineer (SRE) for one of our client, to lead the observability and monitoring strategy for our AI-integrated ASOC platform and its associated products. This role requires a strong foundation in SDLC, agile practices, automated testing, and deep expertise in building reliable, scalable, and data-intensive systems.

What You'll Be Doing

  • Design and implement observability and monitoring systems for data analytics, SIEM, and AI platforms using Prometheus, Grafana, and related tools.
  • Automate operational tasks with Kubernetes, Python, Terraform, and ArgoCD Workflows to enhance deployment speed and reliability.
  • Define and prioritize SLOs, SLIs, and SLAs in collaboration with cross-functional teams.
  • Champion proactive incident management with automated alerts, reducing MTTD and MTTR for Sev1/Sev2 incidents.
  • Conduct postmortems and root cause analyses (RCA) to drive continuous improvement.
  • Ensure secure and gradual deployment practices with strong testing and fail-fast validation.
  • Perform capacity planning and performance tuning to support scalable infrastructure.
  • Foster collaboration across Engineering, Observability, MonOps, and CloudOps teams.
  • Maintain transparent communication about service status, incidents, and resolutions.
  • Advocate for and implement cutting-edge automation, observability, and monitoring technologies.

What We Need To See

  • 9+ years of experience in Site Reliability Engineering / DevOps with large-scale, data-intensive systems.
  • Bachelor's degree in Computer Science, Engineering, or equivalent experience.
  • Expertise in observability & monitoring tools (Prometheus, Grafana, Kubernetes).
  • Strong knowledge of cloud platforms (AWS preferred).
  • Proven experience in Docker, Kubernetes, Python, Terraform, Ansible, CI/CD pipelines (GitLab CI, ArgoCD).
  • Solid grasp of SDLC and Agile methodologies with experience defining SLOs, SLIs, and SLAs.
  • Strong automation and scripting skills for incident resolution.
  • Exceptional analytical and problem-solving skills with hands-on debugging of performance bottlenecks and dependency issues in production.
  • Experience with capacity planning and performance monitoring tools (e.g., Locust, Prometheus).
  • Familiarity with secure deployment practices and automated testing frameworks.
  • Self-starter with a "get things done" attitude, able to work independently.
  • (Nice to Have) Background in cybersecurity or data lake platforms.

Job Type: Full-time

Pay: ₹1,350, ₹4,050,000.00 per year

Benefits:

  • Food provided
  • Health insurance
  • Life insurance
  • Provident Fund

Work Location: In person



  • Noida, Uttar Pradesh, India CorroHealth Full time ₹ 1,04,000 - ₹ 1,30,878 per year

    We are seeking a highly skilled Site Reliability Engineer (SRE) to join our team. The ideal candidate will have a deep understanding of both software engineering and systems administration, with a focus on creating scalable and reliable systems. You will work closely with development and operations teams to ensure the reliability, availability, and...


  • Noida, Uttar Pradesh, India HCLTech Full time

    Job Title: Site Reliability Engineer (SRE) - LEADDepartment: COEJob Summary:We are seeking a highly skilled and motivated Site Reliability Engineer (SRE) to join our engineering team. As an SRE, you will be responsible for ensuring the reliability, availability, and performance of our systems and services. You will work closely with development and...


  • Noida, Uttar Pradesh, India Times Internet Full time ₹ 1,04,000 - ₹ 1,30,878 per year

    Role:Site Reliability EngineerExperience:8-14 yearsLocation:Sector 16, NoidaNotice Period:Immediate / Serving onlyAbout Times InternetAt Times Internet, we create premium digital products that simplify and enhance the lives ofmillions. As India's largest digital products company, we have a significant presence across awide range of categories, including...


  • Noida, Uttar Pradesh, India ALIQAN Technologies Full time ₹ 9,00,000 - ₹ 12,00,000 per year

    Greetings from ALIQAN TechnologiesWe are hiring Site Reliability & DevOps Engineer for one of our client MNCs.Job Title:Devops EngineerExp: 4-6 YrsLocation:Remote Key ResponsibilitiesInfrastructure & Platform Engineering Design, implement, and maintain scalable cloud infrastructure using Infrastructure as Code (IaC) principles Architect and manage...


  • Noida, Uttar Pradesh, India Times Internet Full time

    Role: Site Reliability Engineer Experience: 8-14 years Location: Sector 16, Noida Notice Period: Immediate / Serving only About Times Internet At Times Internet, we create premium digital products that simplify and enhance the lives of millions. As India's largest digital products company, we have a significant presence across a wide...


  • Noida, Uttar Pradesh, India CorroHealth Full time

    Hiring AlertWe are looking for highly skilled Lead Site Reliability Engineer (SRE) for our Product Development team based out at Noida LocationOnly Immediate Joiners preferredJob DescriptionWe are seeking a highly skilled Site Reliability Engineer (SRE) to join our team. The ideal candidate will have a deep understanding of both software engineering and...


  • Noida, Uttar Pradesh, India CorroHealth Full time

    Hiring AlertWe are looking for highly skilled Lead Site Reliability Engineer (SRE) for our Product Development team based out at Noida LocationOnly Immediate Joiners preferredJob DescriptionWe are seeking a highly skilled Site Reliability Engineer (SRE) to join our team. The ideal candidate will have a deep understanding of both software engineering and...


  • Noida, Uttar Pradesh, India CorroHealth Full time

    Hiring AlertWe are looking for highly skilled Lead Site Reliability Engineer (SRE) for our Product Development team based out at Noida LocationOnly Immediate Joiners preferredJob DescriptionWe are seeking a highly skilled Site Reliability Engineer (SRE) to join our team. The ideal candidate will have a deep understanding of both software engineering and...


  • Noida, Uttar Pradesh, India CorroHealth Full time

    Hiring AlertWe are looking for highly skilled Lead Site Reliability Engineer (SRE) for our Product Development team based out at Noida LocationOnly Immediate Joiners preferredJob DescriptionWe are seeking a highly skilled Site Reliability Engineer (SRE) to join our team. The ideal candidate will have a deep understanding of both software engineering and...


  • Noida, Uttar Pradesh, India CorroHealth Full time

    Hiring AlertWe are looking for highly skilled Lead Site Reliability Engineer (SRE) for our Product Development team based out at Noida LocationOnly Immediate Joiners preferredJob DescriptionWe are seeking a highly skilled Site Reliability Engineer (SRE) to join our team. The ideal candidate will have a deep understanding of both software engineering and...