Site Reliability Engineer

2 weeks ago


India Zensar Technologies Full time ₹ 20,00,000 - ₹ 25,00,000 per year
Description

Candidate having skilled and proactive Site Reliability Engineer (SRE) with 10 Years experience 

The SRE will be responsible for ensuring the reliability, scalability, and performance of our systems and infrastructure.

This role blends software engineering with IT operations to build fault-tolerant, self-healing systems and drive continuous improvement across our technology stack.

Required Skills & Qualifications:

Proficiency in Core Java technology 

Hands-on experience with Kubernetes and container orchestration.

Strong understanding of CI/CD pipelines and tools (GitLab CI/CD, Jenkins).

Familiarity with monitoring tools, Batch processing

Excellent problem-solving and communication skills.

Ability to work in on-call rotations and respond to incidents effectively.

Key Responsibilities:

System Reliability & Availability :Design and maintain fault-tolerant architectures using redundancy, load balancing, and failover mechanisms.Monitor system health using observability tools and respond to incidents to minimize downtime.

Incident Management : Implement automated alerting and response systems. Conduct blameless postmortems and drive long-term improvements.

Automation & Tooling : Automate repetitive tasks using scripting and Infrastructure as Code (IaC) tools like Terraform, Ansible.

Develop and maintain internal tools for deployment, monitoring, and debugging.

Performance Monitoring : Use metrics, logs, and traces to identify and resolve performance bottlenecks. Build monitoring systems that alert on symptoms rather than outages.

Capacity Planning & Scalability :  Analyze traffic patterns and infrastructure load to predict demand. Optimize resource allocation and implement scalable solutions.

Collaboration & Culture : Work closely with development, QA, and operations teams to foster a culture of shared responsibility.

Promote transparency and continuous feedback loops.



Same Posting Description for Internal and External Candidates


  • India Grootan Technologies Full time

    About the Role We are seeking a skilled Site Reliability Engineer (SRE) with 4–5 years of hands-on experience to join our engineering team. In this role, you will be responsible for building and maintaining reliable, scalable, and secure infrastructure to support our applications. You will leverage your expertise in automation, cloud platforms, and...


  • India Datum Technologies Group Full time

    Job Title: Site Reliability Engineer (SRE) – AWS Experience: 8+ years Location: Chennai / Mumbai Work Mode: Hybrid Key Skills: AWS, Terraform, Kubernetes, Docker, Grafana, Prometheus, Datadog Job Summary: We are looking for a skilled Site Reliability Engineer (SRE) with strong AWS experience and a solid background in DevOps, automation, observability, and...


  • India Akamai Technologies Full time

    Job Description Job Description Do you like collaborating across teams to solve complex problems Do you enjoy solving large scale distributed content delivery challenges Join our highly skilled Compute Site Reliability team Our team designs, develops, and manages applications and infrastructure that support Akamai's Compute products and services. We...


  • India Akamai Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Description Do you like collaborating across teams to solve complex problems? Do you enjoy solving large scale distributed content delivery challenges?Join our highly skilled Compute Site Reliability teamOur team designs, develops, and manages applications and infrastructure that support Akamai's Compute products and services. We specialize in creating...


  • Chennai, India Datum Technologies Group Full time

    Job Description Job Title: Site Reliability Engineer (SRE) Azure & AI Experience: 7+ years Work Mode: Hybrid Work Location: Chennai/Mumbai/Gurgaon Job Summary: We are looking for an experienced Site Reliability Engineer (SRE) with strong expertise in Microsoft Azure, AI infrastructure, and automation. The ideal candidate will have a solid background in...


  • Chennai, India Datum Technologies Group Full time

    Job Description Job Title: Site Reliability Engineer (SRE) AWS Experience: 8+ years Location: Chennai / Mumbai Work Mode: Hybrid Key Skills: AWS, Terraform, Kubernetes, Docker, Grafana, Prometheus, Datadog Job Summary: We are looking for a skilled Site Reliability Engineer (SRE) with strong AWS experience and a solid background in DevOps, automation,...


  • India CitNOW Group Full time

    About us Founded in 2008, CitNOW is an innovative, enterprise-level software product suite that allows automotive dealerships globally to sell more vehicles and parts more profitably. CitNOW’s app-based platform provides a secure, brand-compliant solution – for dealers to build trust, transparency and long-lasting relationships. CitNOW Group was formed...


  • Hyderabad, India UBS Full time

    Job Description Job Reference # 322870BR Job Type Full Time Your role Are you an analytic thinker Do you enjoy Site Reliability Engineering initiatives and proactive problem management across on-premises & Cloud Database ensuring high availability & stability of Database infrastructure services Do you want to play a key role in transforming our firm into an...


  • Bengaluru, Karnataka, , India Qure ai Technologies Full time ₹ 12,00,000 - ₹ 24,00,000 per year

    About Qure.AI:Qure.AI is an equal opportunity employer. is a leading Healthcare Artificial Intelligence (AI) company disrupting the 'status quo' by enhancing diagnostic imaging and improving health outcomes with the assistance of machine -supported tools. taps deep learning technology to provide an automated interpretation of radiology exams like X -rays,...


  • India Zensar Technologies Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    Candidate having skilled and proactive Site Reliability Engineer (SRE) with 10 Years experienceThe SRE will be responsible for ensuring the reliability, scalability, and performance of our systems and infrastructure.This role blends software engineering with IT operations to build fault-tolerant, self-healing systems and drive continuous improvement across...