Site Reliability Engineer

1 day ago


Chennai, Tamil Nadu, India Elgebra Full time ₹ 12,00,000 - ₹ 36,00,000 per year

Role Overview :

We are seeking a highly experienced and technically proficient Site Reliability Engineer (SRE) to join our team in support of our client, Qincline. The ideal candidate will have 7 or more years of dedicated experience in Site Reliability Engineering or a closely related discipline. This pivotal role requires a strong focus on ensuring the reliability, scalability, performance, and operational efficiency of large-scale, complex production systems. You'll be instrumental in bridging the gap between development and operations by applying engineering principles to operational challenges.

Key Responsibilities :

Reliability & Performance Engineering :

- System Reliability : Design, build, and maintain robust, fault-tolerant production systems and infrastructure to meet stringent Service Level Objectives (SLOs).

- Performance Tuning : Proactively identify and resolve performance bottlenecks across the entire application stack, from infrastructure to application code.

- Automation : Develop and implement automation for operational tasks, infrastructure provisioning, deployment, and monitoring to eliminate manual toil.

- Capacity Planning : Collaborate with development teams on capacity planning, forecasting demand, and ensuring the infrastructure can scale efficiently to meet future business needs.

Operations & Incident Management :

- Monitoring & Alerting : Establish and maintain comprehensive monitoring, logging, and alerting systems to gain deep visibility into system health and performance (e.g., using Prometheus, Grafana, ELK Stack, etc.).

- Incident Response : Serve as a key responder during critical incidents, performing rapid triage, mitigation, and recovery.

- Post-Mortems & RCA : Lead detailed Post-Mortem and Root Cause Analysis (RCA) processes for all significant incidents, ensuring that permanent fixes and preventative measures are implemented to prevent recurrence.

- On-Call : Participate in a periodic on-call rotation to provide 24/7 support for critical production systems.

Tooling & Infrastructure :

- CI/CD & DevOps : Enhance and manage CI/CD pipelines to facilitate fast, reliable, and automated software releases.

- Containerization & Orchestration : Manage and optimize containerized environments using Docker and Kubernetes.

- Infrastructure as Code (IaC) : Utilize IaC tools (e.g., Terraform, Ansible) to provision and manage infrastructure in a repeatable and documented manner.

Required Skills & Experience :

Core Experience (7 Years) :

- Minimum 7 years of hands-on experience in a Site Reliability Engineer, DevOps Engineer, or Production Engineer role supporting high-availability, mission-critical production environments.

- Deep expertise in establishing and improving system monitoring, logging, alerting, and telemetry practices.

- Demonstrated experience with formal Incident Management processes and leading thorough Root Cause Analysis (RCA).

Technical Expertise :

- Cloud Platforms : Extensive, hands-on experience with at least one major cloud provider (e.g., AWS, Azure, or GCP). This includes managing compute, networking, storage, and managed services.

- Scripting & Programming : Strong proficiency in scripting and programming languages, with mandatory expertise in Python and Shell scripting for automation and tooling.

- DevOps Tooling : Proven experience with CI/CD pipeline tools (e.g., Jenkins, GitLab CI, Azure DevOps), Git, and artifact repositories.

- Containerization : Expert-level knowledge of Docker and robust experience with orchestrating large-scale deployments using Kubernetes.

- Operating Systems : Strong command of Linux/Unix operating systems and networking fundamentals (TCP/IP, DNS, Load Balancing).

Desired Qualifications (Good to Have) :

- Experience with configuration management tools (e.g., Ansible, Chef, Puppet).

- Familiarity with service mesh technologies (e.g., Istio, Linkerd).

- Knowledge of database administration and performance tuning (SQL/NoSQL).

- Certifications related to SRE, Cloud (e.g., AWS Certified DevOps Engineer), or Kubernetes (CKA, CKAD).



  • Chennai, Tamil Nadu, India Ford Global Career Site Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    Be at the Forefront of Mobility's Future: Join Ford as a Site Reliability EngineerEnterprise Technology is the engine driving the future of transportation, and we're looking for a talented Site Reliability Engineer (SRE) to help us redefine mobility. In this role, you'll leverage cutting-edge technology to enhance customer experiences, improve lives, and...


  • Chennai, Tamil Nadu, India Elgebra Full time ₹ 6,00,000 - ₹ 18,00,000 per year

    Hiring: Site Reliability Engineer – 7+ YearsLocation: Bangalore / Chennai Payroll: Elgebra Client: Qincline Joining: Immediate to 15 DaysRole Overview:We are looking for an experienced Site Reliability Engineer (SRE) with over 6 years of expertise to join our team. The ideal candidate will have strong technical skills, a problem-solving mindset, and the...


  • Chennai, Tamil Nadu, India NatWest Group Full time

    Site Reliability Engineer, AVP Join us as a Site Reliability EngineerYou'll manage the provision of stable, resilient, reliable applications with the end goal of minimising disruption to Customer & Colleague Journeys (CCJ) We'll look to you to identify and automate manual tasks and implement observability solutions, ensuring a thorough understanding of...


  • Chennai, Tamil Nadu, India Ford Motor Full time

    SRE - Software Engineer Enterprise Technology plays a critical part in shaping the future of mobility. If you're looking for the chance to leverage advanced technology to redefine the transportation landscape, enhance the customer experience and improve people's lives, this is the opportunity for you. Join us and challenge your IT expertise and analytical...


  • Chennai, Tamil Nadu, India NatWest Group Full time ₹ 9,00,000 - ₹ 12,00,000 per year

    Join us as a Site Reliability EngineerIn this key role, you'll support the improvement of non-functional and operational characteristics such as availability, performance, efficiency, change management, monitoring, security, incident response, and capacity planning of our products and servicesYou'll enjoy significant stakeholder interaction, working in...


  • Chennai, Tamil Nadu, India ACV Full time ₹ 1,04,000 - ₹ 1,30,878 per year

    ACV's mission is to build and enable the most trusted and efficient digital marketplaces for buying and selling used vehicles with transparency and comprehensive data that was previously unimaginable. We are powered by a combination of the world's best people and the industry's best technology.  At ACV, we are driven by an entrepreneurial spirit and...


  • Chennai, Tamil Nadu, India Keuro Life Full time ₹ 10,00,000 - ₹ 25,00,000 per year

    Site Reliability Engineer / DevOps We are seeking an experienced Site Reliability Engineer / DevOps professional with a minimum of 6 years in the industry. The ideal candidate will be adept at managing large-scale, high-traffic production environments and ensuring their reliability. Key Responsibilities : - Manage and optimize production environments...


  • Chennai, Tamil Nadu, India Trimble Full time

    Site Reliability Engineer II Your Title: Site Reliability Engineer -II Job Location: Chennai, India Our Department: Trimble Platform Are you interested in cutting edge cloud technologies, ready to dirt your hands in the cloud world? Do you like to be part of a core team with industry leading site reliability engineering standards? About the...


  • Chennai, Tamil Nadu, India Parkar Digital Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    About Parkar:We love building software products. With a decade of experience and a global presence across four countries, we've established ourselves as a trusted partner for over 100 organizations, helping them leverage technology to drive transformative growth. Staying at the forefront of technological advancements, we actively explore and integrate the...


  • Chennai, Tamil Nadu, India Talent Worx Full time ₹ 1,20,000 - ₹ 3,00,000 per year

    EXP required - 5 to 8 years.Role and ResponsibilitiesReporting to Engineering, the Site Reliability Engineer will play a critical role in driving innovation and growth for the Banking Solutions, Payments and Capital Markets business.  In this role, the candidate will have the opportunity to make a lasting impact on the company's transformation journey,...