
Site Reliability Engineer
4 hours ago
We are looking for an accomplished Site Reliability Engineer (SRE) for one of our client, to lead the observability and monitoring strategy for our AI-integrated ASOC platform and its associated products. This role requires a strong foundation in SDLC, agile practices, automated testing, and deep expertise in building reliable, scalable, and data-intensive systems.
What You'll Be Doing
- Design and implement observability and monitoring systems for data analytics, SIEM, and AI platforms using Prometheus, Grafana, and related tools.
- Automate operational tasks with Kubernetes, Python, Terraform, and ArgoCD Workflows to enhance deployment speed and reliability.
- Define and prioritize SLOs, SLIs, and SLAs in collaboration with cross-functional teams.
- Champion proactive incident management with automated alerts, reducing MTTD and MTTR for Sev1/Sev2 incidents.
- Conduct postmortems and root cause analyses (RCA) to drive continuous improvement.
- Ensure secure and gradual deployment practices with strong testing and fail-fast validation.
- Perform capacity planning and performance tuning to support scalable infrastructure.
- Foster collaboration across Engineering, Observability, MonOps, and CloudOps teams.
- Maintain transparent communication about service status, incidents, and resolutions.
- Advocate for and implement cutting-edge automation, observability, and monitoring technologies.
What We Need To See
- 9+ years of experience in Site Reliability Engineering / DevOps with large-scale, data-intensive systems.
- Bachelor's degree in Computer Science, Engineering, or equivalent experience.
- Expertise in observability & monitoring tools (Prometheus, Grafana, Kubernetes).
- Strong knowledge of cloud platforms (AWS preferred).
- Proven experience in Docker, Kubernetes, Python, Terraform, Ansible, CI/CD pipelines (GitLab CI, ArgoCD).
- Solid grasp of SDLC and Agile methodologies with experience defining SLOs, SLIs, and SLAs.
- Strong automation and scripting skills for incident resolution.
- Exceptional analytical and problem-solving skills with hands-on debugging of performance bottlenecks and dependency issues in production.
- Experience with capacity planning and performance monitoring tools (e.g., Locust, Prometheus).
- Familiarity with secure deployment practices and automated testing frameworks.
- Self-starter with a "get things done" attitude, able to work independently.
- (Nice to Have) Background in cybersecurity or data lake platforms.
Job Type: Full-time
Pay: ₹1,350, ₹4,050,000.00 per year
Benefits:
- Food provided
- Health insurance
- Life insurance
- Provident Fund
Work Location: In person
-
Site Reliability Engineer
4 hours ago
Noida, Uttar Pradesh, India CorroHealth Full time ₹ 1,04,000 - ₹ 1,30,878 per yearWe are seeking a highly skilled Site Reliability Engineer (SRE) to join our team. The ideal candidate will have a deep understanding of both software engineering and systems administration, with a focus on creating scalable and reliable systems. You will work closely with development and operations teams to ensure the reliability, availability, and...
-
Site Reliability Engineer
4 weeks ago
Noida, Uttar Pradesh, India HCLTech Full timeJob Title: Site Reliability Engineer (SRE) - LEADDepartment: COEJob Summary:We are seeking a highly skilled and motivated Site Reliability Engineer (SRE) to join our engineering team. As an SRE, you will be responsible for ensuring the reliability, availability, and performance of our systems and services. You will work closely with development and...
-
Site Reliability Engineer
2 days ago
Noida, Uttar Pradesh, India Times Internet Full time ₹ 1,04,000 - ₹ 1,30,878 per yearRole:Site Reliability EngineerExperience:8-14 yearsLocation:Sector 16, NoidaNotice Period:Immediate / Serving onlyAbout Times InternetAt Times Internet, we create premium digital products that simplify and enhance the lives ofmillions. As India's largest digital products company, we have a significant presence across awide range of categories, including...
-
Site Reliability Engineer
2 days ago
Noida, Uttar Pradesh, India ALIQAN Technologies Full time ₹ 9,00,000 - ₹ 12,00,000 per yearGreetings from ALIQAN TechnologiesWe are hiring Site Reliability & DevOps Engineer for one of our client MNCs.Job Title:Devops EngineerExp: 4-6 YrsLocation:Remote Key ResponsibilitiesInfrastructure & Platform Engineering Design, implement, and maintain scalable cloud infrastructure using Infrastructure as Code (IaC) principles Architect and manage...
-
Site Reliability Engineer
2 days ago
Noida, Uttar Pradesh, India Times Internet Full timeRole: Site Reliability Engineer Experience: 8-14 years Location: Sector 16, Noida Notice Period: Immediate / Serving only About Times Internet At Times Internet, we create premium digital products that simplify and enhance the lives of millions. As India's largest digital products company, we have a significant presence across a wide...
-
Lead Site Reliability Engineer
1 week ago
Noida, Uttar Pradesh, India CorroHealth Full timeHiring AlertWe are looking for highly skilled Lead Site Reliability Engineer (SRE) for our Product Development team based out at Noida LocationOnly Immediate Joiners preferredJob DescriptionWe are seeking a highly skilled Site Reliability Engineer (SRE) to join our team. The ideal candidate will have a deep understanding of both software engineering and...
-
Lead Site Reliability Engineer
2 weeks ago
Noida, Uttar Pradesh, India CorroHealth Full timeHiring AlertWe are looking for highly skilled Lead Site Reliability Engineer (SRE) for our Product Development team based out at Noida LocationOnly Immediate Joiners preferredJob DescriptionWe are seeking a highly skilled Site Reliability Engineer (SRE) to join our team. The ideal candidate will have a deep understanding of both software engineering and...
-
Lead Site Reliability Engineer
2 weeks ago
Noida, Uttar Pradesh, India CorroHealth Full timeHiring AlertWe are looking for highly skilled Lead Site Reliability Engineer (SRE) for our Product Development team based out at Noida LocationOnly Immediate Joiners preferredJob DescriptionWe are seeking a highly skilled Site Reliability Engineer (SRE) to join our team. The ideal candidate will have a deep understanding of both software engineering and...
-
Lead Site Reliability Engineer
1 week ago
Noida, Uttar Pradesh, India CorroHealth Full timeHiring AlertWe are looking for highly skilled Lead Site Reliability Engineer (SRE) for our Product Development team based out at Noida LocationOnly Immediate Joiners preferredJob DescriptionWe are seeking a highly skilled Site Reliability Engineer (SRE) to join our team. The ideal candidate will have a deep understanding of both software engineering and...
-
Lead Site Reliability Engineer
2 weeks ago
Noida, Uttar Pradesh, India CorroHealth Full timeHiring AlertWe are looking for highly skilled Lead Site Reliability Engineer (SRE) for our Product Development team based out at Noida LocationOnly Immediate Joiners preferredJob DescriptionWe are seeking a highly skilled Site Reliability Engineer (SRE) to join our team. The ideal candidate will have a deep understanding of both software engineering and...