Site Reliability Engineer I
2 days ago
About the Role
We at Innovaccer are looking for a Site Reliability Engineer-I to build the most amazing product experience. Youll get to work with other engineers to build delightful feature experiences to understand and solve our customers pain points
A Day in the Life
- Take ownership of SRE pillars: Deployment, Reliability, Scalability, Service Availability (SLA/SLO/SLI), Performance, and Cost.
- Lead production rollouts of new releases and emergency patches using CI/CD pipelines while continuously improving deployment processes.
- Establish robust production promotion and change management processes with quality gates across Dev/QA teams.
- Roll out a complete observability stack across systems to proactively detect and resolve outages or degradations.
- Analyze production system metrics, optimize system utilization, and drive cost efficiency.
- Manage autoscaling of the platform during peak usage scenarios.
- Perform triage and RCA by leveraging observability toolchains across the platform architecture.
- Reduce escalations to higher-level teams through proactive reliability improvements.
- Participate in the 24x7 OnCall Production Support team.
- Lead monthly operational reviews with executives covering KPIs such as uptime, RCA, CAP (Corrective Action Plan), PAP (Preventive Action Plan), and security/audit reports.
- Operate and manage production and staging cloud platforms, ensuring uptime and SLA adherence.
- Collaborate with Dev, QA, DevOps, and Customer Success teams to drive RCA and product improvements.
- Implement security guidelines (e.g., DDoS protection, vulnerability management, patch management, security agents).
- Manage least-privilege RBAC for production services and toolchains.
- Build and execute Disaster Recovery plans and actively participate in Incident Response.
- Work with a cool head under pressure and avoid shortcuts during production issues.
- Collaborate effectively across teams with excellent verbal and written communication skills.
- Build strong relationships and drive results without direct reporting lines.
- Take ownership, be highly organized, self-motivated, and accountable for high-quality delivery.
What You Need
- Experience : 1-3 years in production engineering, site reliability, or related roles.
- Solid hands-on experience with at least one cloud provider (AWS, Azure, GCP) with automation focus (certifications preferred).
- Strong expertise in Kubernetes and Linux.
- Proficiency in scripting/programming (Python required).
- Strong understanding of observability toolchains (Logs, Metrics, Tracing).
- Knowledge of CI/CD pipelines and toolchains (Jenkins, ArgoCD, GitOps).
- Familiarity with persistence stores (Postgres, MongoDB), data warehousing (Snowflake, Databricks), and messaging (Kafka).
- Exposure to monitoring/observability tools such as ElasticSearch, Prometheus, Jaeger, NewRelic, etc.
- Proven experience in production reliability, scalability, and performance systems.
- Experience in 24x7 production environments with process focus.
- Familiarity with ticketing and incident management systems.
- Security-first mindset with knowledge of vulnerability management and compliance.
- Advantageous: hands-on experience with Kafka, Postgres, and Snowflake.
- Excellent judgment, analytical thinking, and problem-solving skills.
- Ability to quickly identify and drive optimal solutions within constraints.
-
Site Reliability Engineer
2 weeks ago
Noida, Uttar Pradesh, India Cloud Angles Digital Transformation Full time ₹ 15,00,000 - ₹ 25,00,000 per yearAbout the Role:We are seeking a skilled and proactive Site Reliability Engineer I & II (SRE II) to join our growing infrastructure team. As an SRE II, you will play a critical role in ensuring the reliability, scalability, and performance of our systems. Youll work independently and collaboratively to design, implement, and maintain robust infrastructure...
-
3545-Site Reliability Engineer I
6 days ago
Noida, Uttar Pradesh, India Innovaccer Analytics Full time ₹ 80,00,000 - ₹ 1,50,00,000 per yearEngineering at InnovaccerWith every line of code, we accelerate our customers' success, turning complex challenges into innovative solutions. Collaboratively, we transform each data point we gather into valuable insights for our customers. Join us and be part of a team that's turning dreams of better healthcare into reality, one line of code at a time....
-
Site Reliability Engineer II Noida Location
4 days ago
Noida, Uttar Pradesh, India Cloud Angles Digital Transformation Full time ₹ 15,00,000 - ₹ 25,00,000 per yearJob Summary:Site Reliability Engineers (SRE's) cover the intersection of Software Engineer and Systems Administrator. In other words, they can both create code and manage the infrastructure on which the code runs. This is a very wide skillset, but the end goal of an SRE is always the same: to ensure that all SLAs are met, but not exceeded, so as to balance...
-
Site Reliability Engineer
3 days ago
Noida, Uttar Pradesh, India Vimerse Infotech Full time ₹ 6,00,000 - ₹ 18,00,000 per yearSite Reliability Engineer Exp -5-10 Yrs Location :(Mumbai / Bangalore / Hyderabad / Pune / Noida) Job description: Must have Skills : Docker, Kubernetes, Terraform, AWS, Linux, Grafana /Prometheus, APM(New Relic), RUM, Python is must
-
Senior Site Reliability Engineer
2 weeks ago
Noida, Uttar Pradesh, India Biz2X Full time ₹ 15,00,000 - ₹ 25,00,000 per yearAbout the Role:We are seeking an experienced and passionate Senior Site Reliability Engineer (SRE) to join our team. In this role, you will work on improving the availability, reliability, and scalability of our services, systems, and infrastructure. You will collaborate closely with development, operations, and security teams to ensure that our systems are...
-
Senior Site Reliability Engineer
6 days ago
Noida, Uttar Pradesh, India TekPillar® Full time ₹ 24,00,000 - ₹ 36,00,000 per yearJob Role:Site Reliability EngineerExperience:5 to 10 YearsLocation:Noida, Gurgaon, Bhubaneswar, Pune, Pollachi, Chennai (5 days work from office)Notice Period:Immediate JoinerCTC:Up to 24 LPARequired Skills & Qualifications5–10 years of relevant experience in Site Reliability Engineering / DevOps .Strong scripting knowledge: Bash, Python .Hands-on...
-
Site Reliability Engineer
7 days ago
Noida, Uttar Pradesh, India MyOperator Full time ₹ 12,00,000 - ₹ 36,00,000 per yearAbout Us:MyOperator is a Business AI Operator, a category leader that unifies WhatsApp, Calls, and AI-powered chat & voice bots into one intelligent business communication platform. Unlike fragmented communication tools, MyOperator combines automation, intelligence, and workflow integration to help businesses run WhatsApp campaigns, manage calls, deploy AI...
-
DevOps / Site Reliability Engineer (SRE)
29 minutes ago
Noida, Uttar Pradesh, India TrackMyShuttle Full time ₹ 12,00,000 - ₹ 36,00,000 per yearAbout UsTrackMyShuttle is a next-generation mobility platform enabling shuttle operators to instantly transform their fleets into intelligent mobility solutions. Our mission is to make transportation accessible, affordable, sustainable, customisable, and safer for everyone worldwide.Were building systems that scale globally, process real-time data from...
-
Site Reliability Engineering
4 days ago
Noida, Uttar Pradesh, India The Techgalore Full time ₹ 20,00,000 - ₹ 25,00,000 per yearSRE (Devops)Exp: 6 yearsWork Location: RemoteContractSkills:4+ Years of experience in system administration, application development, infrastructuredevelopment4+ years of demonstrated expertise in building and managing highly scaled productioninfrastructure in the cloud (Azure or Google Cloud)3+ years of experience in APM tools like dynatrace, Prometheus, &...
-
Site Reliability Engineer-II
2 days ago
Noida, Uttar Pradesh, India Innovaccer Full time ₹ 12,00,000 - ₹ 36,00,000 per yearAbout the Role We at Innovaccer are looking for a Site Reliability Engineer-II to build the most amazing product experience. Youll get to work with other engineers to build delightful feature experiences to understand and solve our customers pain pointsA Day in the LifeTake ownership of SRE pillars: Deployment, Reliability, Scalability, Service Availability...