Site Reliability Engineer
3 days ago
About the Role
We at Innovaccer are looking for a Site Reliability Engineer-II to build the most amazing product experience. Youll get to work with other engineers to build delightful feature experiences to understand and solve our customers pain points
A Day in the Life
- Take ownership of SRE pillars: Deployment, Reliability, Scalability, Service Availability (SLA/SLO/SLI), Performance, and Cost.
- Lead production rollouts of new releases and emergency patches using CI/CD pipelines while continuously improving deployment processes.
- Establish robust production promotion and change management processes with quality gates across Dev/QA teams.
- Roll out a complete observability stack across systems to proactively detect and resolve outages or degradations.
- Analyze production system metrics, optimize system utilization, and drive cost efficiency.
- Manage autoscaling of the platform during peak usage scenarios.
- Perform triage and RCA by leveraging observability toolchains across the platform architecture.
- Reduce escalations to higher-level teams through proactive reliability improvements.
- Participate in the 24x7 OnCall Production Support team.
- Lead monthly operational reviews with executives covering KPIs such as uptime, RCA, CAP (Corrective Action Plan), PAP (Preventive Action Plan), and security/audit reports.
- Operate and manage production and staging cloud platforms, ensuring uptime and SLA adherence.
- Collaborate with Dev, QA, DevOps, and Customer Success teams to drive RCA and product improvements.
- Implement security guidelines (e.g., DDoS protection, vulnerability management, patch management, security agents).
- Manage least-privilege RBAC for production services and toolchains.
- Build and execute Disaster Recovery plans and actively participate in Incident Response.
- Work with a cool head under pressure and avoid shortcuts during production issues.
- Collaborate effectively across teams with excellent verbal and written communication skills.
- Build strong relationships and drive results without direct reporting lines.
- Take ownership, be highly organized, self-motivated, and accountable for high-quality delivery.
What You Need
- Experience: 4-7 years in production engineering, site reliability, or related roles.
- Solid hands-on experience with at least one cloud provider (AWS, Azure, GCP) with automation focus (certifications preferred).
- Strong expertise in Kubernetes and Linux.
- Proficiency in scripting/programming (Python required).
- Observability is very critical for the scale of our systems and ability to find insights/behavior, detect problem/failures. Looking for leads to drive this charter spanning across logs, metrics, mesh, tracing etc.
- Knowledge of CI/CD pipelines and toolchains (Jenkins, ArgoCD, GitOps).
- Familiarity with persistence stores (Postgres, MongoDB), data warehousing (Snowflake, Databricks), and messaging (Kafka).
- Exposure to monitoring/observability tools such as ElasticSearch, Prometheus, Jaeger, NewRelic, etc.
- Proven experience in production reliability, scalability, and performance systems.
- Experience in 24x7 production environments with process focus.
- Familiarity with ticketing and incident management systems.
- Security-first mindset with knowledge of vulnerability management and compliance.
- Advantageous: hands-on experience with Kafka, Postgres, and Snowflake.
- Excellent judgment, analytical thinking, and problem-solving skills.
- Ability to quickly identify and drive optimal solutions within constraints.
- Lead least privilege based RBAC for various production services and tool chains.
- Able to perform with cool head under pressure situations without taking any shortcuts.
- Collaboration with solid verbal and oral communication skills are very critical to this role. Strong cross-functional collaboration skills, relationship building skills, and ability to achieve results without direct reporting relationships
- Ability to quickly identify and drive to the optimal solution when presented with a series of constraints.
- Excellent judgment, analytical thinking, and problem-solving skills.
- Self-motivated individual that possesses excellent time management and organizational skills.
- Strong sense of personal responsibility and accountability for delivering high quality work.
-
Site Reliability Engineer
7 days ago
Noida, Uttar Pradesh, India CorroHealth Full time ₹ 15,00,000 - ₹ 25,00,000 per yearWe are seeking a highly skilled Site Reliability Engineer (SRE) to join our team. The ideal candidate will have a deep understanding of both software engineering and systems administration, with a focus on creating scalable and reliable systems. You will work closely with development and operations teams to ensure the reliability, availability, and...
-
Site Reliability Engineer
22 hours ago
Noida, Uttar Pradesh, India Cloud Angles Digital Transformation Full time ₹ 15,00,000 - ₹ 25,00,000 per yearJob SummarySite Reliability Engineers (SRE's) cover the intersection of Software Engineer and Systems Administrator. In other words, they can both create code and manage the infrastructure on which the code runs. This is a very wide skillset, but the end goal of an SRE is always the same: to ensure that all SLAs are met, but not exceeded, so as to balance...
-
Site Reliability Engineer
4 weeks ago
Noida, Uttar Pradesh, India Times Internet Full timeRole: Site Reliability Engineer Experience: 8-14 years Location: Sector 16, Noida Notice Period: Immediate / Serving only About Times Internet At Times Internet, we create premium digital products that simplify and enhance the lives of millions. As India's largest digital products company, we have a significant presence across a wide range of categories,...
-
Site Reliability Engineer
1 week ago
Noida, Uttar Pradesh, India Times Internet Full time ₹ 1,04,000 - ₹ 1,30,878 per yearRole:Site Reliability EngineerExperience:8-14 yearsLocation:Sector 16, NoidaNotice Period:Immediate / Serving onlyAbout Times InternetAt Times Internet, we create premium digital products that simplify and enhance the lives ofmillions. As India's largest digital products company, we have a significant presence across awide range of categories, including...
-
Site Reliability Engineer
4 weeks ago
Noida, Uttar Pradesh, India Times Internet Full timeRole: Site Reliability Engineer Experience: 8-14 years Location: Sector 16, Noida Notice Period: Immediate / Serving only About Times Internet At Times Internet, we create premium digital products that simplify and enhance the lives of millions. As India's largest digital products company, we have a significant presence across a wide...
-
Site Reliability Engineer
1 week ago
Noida, Uttar Pradesh, India ALIQAN Technologies Full time ₹ 20,00,000 - ₹ 25,00,000 per yearGreetings from ALIQAN TechnologiesWe are hiring Site Reliability & DevOps Engineer for one of our client MNCs.Job Title:Devops EngineerExp: 4-6 YrsLocation:Remote Key ResponsibilitiesInfrastructure & Platform Engineering Design, implement, and maintain scalable cloud infrastructure using Infrastructure as Code (IaC) principles Architect and manage...
-
Site Reliability Engineer
7 days ago
Noida, Uttar Pradesh, India Race Consulting Full time ₹ 1,35,000 - ₹ 40,50,000 per yearWe are looking for an accomplished Site Reliability Engineer (SRE) for one of our client, to lead the observability and monitoring strategy for our AI-integrated ASOC platform and its associated products. This role requires a strong foundation in SDLC, agile practices, automated testing, and deep expertise in building reliable, scalable, and data-intensive...
-
Site Reliability Engineer
7 days ago
Noida, Uttar Pradesh, India Thales Full time ₹ 5,00,000 - ₹ 12,00,000 per yearLocation: Noida, IndiaThales people architect identity management and data protection solutions at the heart of digital security. Business and governments rely on us to bring trust to the billons of digital interactions they have with people. Our technologies and services help banks exchange funds, people cross borders, energy become smarter and much more....
-
Senior Site Reliability Engineer
2 weeks ago
Noida, Uttar Pradesh, India TekPillar® Full time ₹ 24,00,000 - ₹ 36,00,000 per yearJob Role:Site Reliability EngineerExperience:5 to 10 YearsLocation:Noida, Gurgaon, Bhubaneswar, Pune, Pollachi, Chennai (5 days work from office)Notice Period:Immediate JoinerCTC:Up to 24 LPARequired Skills & Qualifications5–10 years of relevant experience in Site Reliability Engineering / DevOps .Strong scripting knowledge: Bash, Python .Hands-on...
-
Site Reliability Engineer
5 days ago
Noida, Uttar Pradesh, India Uplers Full time ₹ 8,00,000 - ₹ 18,00,000 per yearSite Reliability EngineerExperience:3 - 5 Years ExpSalary :CompetitivePreferred Notice Period: Within 15 DaysShift: 10:00AM to 6:00PM ISTOpportunity Type:Onsite (Noida)Placement Type:Full-time(*Note: This is a requirement for one of Uplers' Clients)Must have skills required :AWSandTerraformandLinuxUzio (One of Uplers' Clients) is Looking for:Site Reliability...