Site Reliability Engineer I

2 weeks ago


Noida, Uttar Pradesh, India Innovaccer Full time ₹ 12,00,000 - ₹ 36,00,000 per year

About the Role

We at Innovaccer are looking for a Site Reliability Engineer-I to build the most amazing product experience. Youll get to work with other engineers to build delightful feature experiences to understand and solve our customers pain points

A Day in the Life

  • Take ownership of SRE pillars: Deployment, Reliability, Scalability, Service Availability (SLA/SLO/SLI), Performance, and Cost.
  • Lead production rollouts of new releases and emergency patches using CI/CD pipelines while continuously improving deployment processes.
  • Establish robust production promotion and change management processes with quality gates across Dev/QA teams.
  • Roll out a complete observability stack across systems to proactively detect and resolve outages or degradations.
  • Analyze production system metrics, optimize system utilization, and drive cost efficiency.
  • Manage autoscaling of the platform during peak usage scenarios.
  • Perform triage and RCA by leveraging observability toolchains across the platform architecture.
  • Reduce escalations to higher-level teams through proactive reliability improvements.
  • Participate in the 24x7 OnCall Production Support team.
  • Lead monthly operational reviews with executives covering KPIs such as uptime, RCA, CAP (Corrective Action Plan), PAP (Preventive Action Plan), and security/audit reports.
  • Operate and manage production and staging cloud platforms, ensuring uptime and SLA adherence.
  • Collaborate with Dev, QA, DevOps, and Customer Success teams to drive RCA and product improvements.
  • Implement security guidelines (e.g., DDoS protection, vulnerability management, patch management, security agents).
  • Manage least-privilege RBAC for production services and toolchains.
  • Build and execute Disaster Recovery plans and actively participate in Incident Response.
  • Work with a cool head under pressure and avoid shortcuts during production issues.
  • Collaborate effectively across teams with excellent verbal and written communication skills.
  • Build strong relationships and drive results without direct reporting lines.
  • Take ownership, be highly organized, self-motivated, and accountable for high-quality delivery.

What You Need

  • Experience : 13 years in production engineering, site reliability, or related roles.
  • Solid hands-on experience with at least one cloud provider (AWS, Azure, GCP) with automation focus (certifications preferred).
  • Strong expertise in Kubernetes and Linux.
  • Proficiency in scripting/programming (Python required).
  • Strong understanding of observability toolchains (Logs, Metrics, Tracing).
  • Knowledge of CI/CD pipelines and toolchains (Jenkins, ArgoCD, GitOps).
  • Familiarity with persistence stores (Postgres, MongoDB), data warehousing (Snowflake, Databricks), and messaging (Kafka).
  • Exposure to monitoring/observability tools such as ElasticSearch, Prometheus, Jaeger, NewRelic, etc.
  • Proven experience in production reliability, scalability, and performance systems.
  • Experience in 24x7 production environments with process focus.
  • Familiarity with ticketing and incident management systems.
  • Security-first mindset with knowledge of vulnerability management and compliance.
  • Advantageous: hands-on experience with Kafka, Postgres, and Snowflake.
  • Excellent judgment, analytical thinking, and problem-solving skills.
  • Ability to quickly identify and drive optimal solutions within constraints.


  • Noida, Uttar Pradesh, India Cloud Angles Digital Transformation Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    Job SummarySite Reliability Engineers (SRE's) cover the intersection of Software Engineer and Systems Administrator. In other words, they can both create code and manage the infrastructure on which the code runs. This is a very wide skillset, but the end goal of an SRE is always the same: to ensure that all SLAs are met, but not exceeded, so as to balance...


  • Noida, Uttar Pradesh, India Cloud Angles Digital Transformation Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    About the Role:We are seeking a skilled and proactive Site Reliability Engineer I & II (SRE II) to join our growing infrastructure team. As an SRE II, you will play a critical role in ensuring the reliability, scalability, and performance of our systems. Youll work independently and collaboratively to design, implement, and maintain robust infrastructure...


  • Noida, Uttar Pradesh, India Innovaccer Analytics Full time ₹ 80,00,000 - ₹ 1,50,00,000 per year

    Engineering at InnovaccerWith every line of code, we accelerate our customers' success, turning complex challenges into innovative solutions. Collaboratively, we transform each data point we gather into valuable insights for our customers. Join us and be part of a team that's turning dreams of better healthcare into reality, one line of code at a time....


  • Greater Noida, Uttar Pradesh, India TRH Consultancy Services Full time ₹ 4,00,000 - ₹ 12,00,000 per year

    Description : We are seeking a Site Reliability Engineer with expertise in OpenTelemetry to join our team in India. The ideal candidate will be responsible for ensuring the reliability, availability, and performance of our systems while implementing best practices for observability and monitoring.Responsibilities : - Design, implement, and maintain...


  • Noida, Uttar Pradesh, India FarEye Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    Join Us as a Site Reliability Engineer at FarEyeAtFarEye, we believe logistics isn't just about moving goods — it's aboutcreating delightful delivery experiencesfor millions worldwide.Ourlow-code Intelligent Delivery Management Platformempowers150+ global enterprises across 30+ countries, making deliveriessmarter, faster, and greener.This is your...


  • Noida, Uttar Pradesh, India Biz2X Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    About the Role:We are seeking an experienced and passionate Senior Site Reliability Engineer (SRE) to join our team. In this role, you will work on improving the availability, reliability, and scalability of our services, systems, and infrastructure. You will collaborate closely with development, operations, and security teams to ensure that our systems are...


  • Noida, Uttar Pradesh, India TekPillar® Full time ₹ 24,00,000 - ₹ 36,00,000 per year

    Job Role:Site Reliability EngineerExperience:5 to 10 YearsLocation:Noida, Gurgaon, Bhubaneswar, Pune, Pollachi, Chennai (5 days work from office)Notice Period:Immediate JoinerCTC:Up to 24 LPARequired Skills & Qualifications5–10 years of relevant experience in Site Reliability Engineering / DevOps .Strong scripting knowledge: Bash, Python .Hands-on...


  • Noida, Uttar Pradesh, India Innovaccer Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    About the RoleWe at Innovaccer are looking for a Site Reliability Engineer-II to build the most amazing product experience. Youll get to work with other engineers to build delightful feature experiences to understand and solve our customers pain pointsA Day in the LifeTake ownership of SRE pillars: Deployment, Reliability, Scalability, Service Availability...


  • Noida, Uttar Pradesh, India Uplers Full time ₹ 8,00,000 - ₹ 18,00,000 per year

    Site Reliability EngineerExperience:3 - 5 Years ExpSalary :CompetitivePreferred Notice Period: Within 15 DaysShift: 10:00AM to 6:00PM ISTOpportunity Type:Onsite (Noida)Placement Type:Full-time(*Note: This is a requirement for one of Uplers' Clients)Must have skills required :AWSandTerraformandLinuxUzio (One of Uplers' Clients) is Looking for:Site Reliability...


  • Noida, Uttar Pradesh, India MyOperator Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    About Us:MyOperator is a Business AI Operator, a category leader that unifies WhatsApp, Calls, and AI-powered chat & voice bots into one intelligent business communication platform. Unlike fragmented communication tools, MyOperator combines automation, intelligence, and workflow integration to help businesses run WhatsApp campaigns, manage calls, deploy AI...