Site Reliability Engineer

3 weeks ago


Chennai, Tamil Nadu, India Everstage Inc. Full time

Everstage is looking to hire Site Reliability Engineer. Please write to bharath@everstage.com if the below opportunity excites you. 

We are seeking a skilled and motivated Site Reliability Engineer (SRE) with at least 2 years of experience in maintaining and optimising infrastructure. The ideal candidate will be responsible for ensuring system reliability and meeting our SLOs and SLAs. You will work closely with engineering and operations teams to build, monitor, and maintain scalable, resilient systems.


Key Responsibilities:


System Monitoring & Reliability:

1. Monitor and improve the performance, availability, and reliability of production systems.

2. Implement and maintain monitoring, alerting, and logging tools to ensure early detection and resolution of issues.


Incident Management:

1. Proactively design and maintain infrastructure to prevent issues before they arise, minimising the need for reactive responses.

2. Troubleshoot and resolve issues if they arise, ensuring minimal downtime and impact on end-users.

Contribute to post-incident reviews to improve the response and prevent similar issues.


Automation & Scripting:

1. Automate repetitive tasks and processes to improve efficiency and reduce human error.

2. Write and maintain scripts in Python, Shell, or similar languages to support operational tasks.


Collaboration:

1. Work with developers to optimise application performance, improve code reliability, and troubleshoot production issues.

2. Document processes, configurations, and troubleshooting steps to provide visibility across teams.


Required Skills & Qualifications:

1. Bachelor's degree in Computer Science, Engineering, or a related field.

2. 2+ years of experience in a similar SRE, DevOps, or Operations role.

3. Strong understanding of Linux/Unix operating systems and system administration.

4. Familiarity with monitoring and logging tools such as Datadog or ELK stack.

5. Hands-on experience with Docker / Docker compose.

6. Experience with cloud platforms, preferably with AWS.

7. Proficiency in scripting languages such as Python, Bash or Shell and using cli commands.

8. Excellent troubleshooting skills with a proactive attitude toward problem-solving.


Nice to Have:

1. Familiarity with CI/CD tools like Jenkins, GitLab CI, or similar.

2. Familiarity with StatsD or collectd protocols

3. Knowledge of Infrastructure as Code (IaC) tools like Terraform



  • Chennai, Tamil Nadu, India Bright Vision Technologies Full time

    Bright Vision Technologies has an immediate Full-time opportunity for Site Reliability Engineer (SRE)  Job Role:  Site Reliability Engineer (SRE) Job Type: Full Time Candidates Looking for Visa sponsorship and willing to relocate to USA are encouraged to apply.About Bright Vision Technologies: Bright Vision Technologies is a fast-growing technology company...


  • Chennai, Tamil Nadu, India 10decoders Full time

    JD: Site Reliability Engineer - GCP With Terraform The Role: We are looking for a Senior SRE with 5+ years of experience to work primarily with our Application development team. An ideal candidate would have extensive experience building cloud infrastructure on Google Cloud with Terraform and have strong experience running workloads that scale on Google's...


  • Chennai, Tamil Nadu, India Burgeon It Services Pvt Ltd Full time

    Job Title : SRE EngineerLocation : ChennaiExperience : 8+ YearsJob Description :We are seeking an experienced Site Reliability Engineer (SRE) to join our dynamic team. The ideal candidate will have a strong background in software engineering and operations, with a passion for building scalable and reliable systems.Key Responsibilities :- Design, implement,...


  • Chennai, Tamil Nadu, India Burgeon It Services Pvt Ltd Full time

    Job Title : SRE EngineerLocation : ChennaiExperience : 8+ YearsJob Description :We are seeking an experienced Site Reliability Engineer (SRE) to join our dynamic team. The ideal candidate will have a strong background in software engineering and operations, with a passion for building scalable and reliable systems.Key Responsibilities :- Design, implement,...


  • Chennai, Tamil Nadu, India 10decoders Full time

    JD: Site Reliability Engineer - GCP With TerraformThe Role:We are looking for a Senior SRE with 5+ years of experience to work primarily with ourApplication development team. An ideal candidate would have extensive experiencebuilding cloud infrastructure on Google Cloud with Terraform and have strongexperience running workloads that scale on Google's...


  • Chennai, Tamil Nadu, India 10decoders Full time

    JD: Site Reliability Engineer -GCP With TerraformThe Role:We are looking for a Senior SRE with5+ yearsof experience to work primarily with ourApplication development team. An ideal candidate would have extensive experiencebuilding cloud infrastructure onGoogle Cloud with Terraformand have strongexperience running workloads that scale on Google's Kubernetes...


  • Chennai, Tamil Nadu, India 10decoders Full time

    Job Summary We are seeking a Senior Site Reliability Engineer (SRE) with 5+ years of experience to join our team and work primarily with our Application development team. The ideal candidate will have extensive experience building cloud infrastructure on Google Cloud Platform using Terraform and strong experience running workloads that scale on Google's...


  • Chennai, Tamil Nadu, India Zf Friedrich Full time

    Job DescriptionJob Description :Req ID 77489|GEC Chennai, India,ZF Commercial Vehicle Control Systems India LimitedLong DescriptionAbout the Team:Garuda team is a SRE team responsible for the reliability and operations of our Fleet management services platform. We ensure the availability and performance of the platform through proactive incident management,...


  • Chennai, Tamil Nadu, India Kiash Solutions LLp Full time

    We are hiring a Site Reliability Engineer (SRE) with strong expertise in Azure operations, containerized workflows (Docker), and Python scripting. The ideal candidate will lead efforts to ensure system reliability, automate operational tasks, and optimize cloud-based infrastructure, while collaborating with cross-functional teams to deliver high-performing...


  • Chennai, Tamil Nadu, India 10decoders Full time

    JD: Site Reliability Engineer - GCP With TerraformThe Role:We are looking for a Senior SRE with 5+ years of experience to work primarily with ourApplication development team. An ideal candidate would have extensive experiencebuilding cloud infrastructure on Google Cloud with Terraform and have strongexperience running workloads that scale on Google's...


  • Chennai, Tamil Nadu, India ZF Group Full time

    Job DescriptionJob description:About the Team:Garuda team is a SRE team responsible for the reliability and operations of our Fleet management services platform. We ensure the availability and performance of the platform through proactive incident management, optimization, and continuous improvement while contributing to the development of SCALAR&aposs...


  • Chennai, Tamil Nadu, India 10decoders Full time

    Job Description The Role: We are seeking a Senior Site Reliability Engineer with 5+ years of experience to work closely with our Application Development team. Responsibilities: Contribute to establishing best practices and shaping the SRE culture within our organization. Collaborate with teams to design, build, and improve Google Cloud infrastructure using...


  • Chennai, Tamil Nadu, India Bastion Data Solutions Full time

    Become a part of Bastion Data Solutions' mission to deliver exceptional data solutions.ResponsibilitiesThis on-site role at Bastion Data Solutions in Chennai requires a strong background in Site Reliability Engineering, software development, and system administration.Main duties will include:Ensuring site reliability and performanceDeveloping software...


  • Chennai, Tamil Nadu, India Ascendion Full time

    Job Description :We are looking for an experienced Azure Site Reliability Engineer (SRE) with 6-9 years of experience to support and administer Azure Kubernetes Service (AKS) clusters running critical middleware handling thousands of transactions per second (TPS). The ideal candidate will have a strong background in Infrastructure as Code (IaC), cloud...


  • Chennai, Tamil Nadu, India triSys Full time

    Job DescriptionExperience: 5-8yrsJob Location : Chennai/Pune/Gurgaon/KolkataWe are seeking a highly skilled and experienced Site Reliability Engineer (SRE) with a deep understanding of SRE principles and practices. This role will be instrumental in shaping and guiding the SRE journey, ensuring high availability, reliability, and performance. The ideal...


  • Chennai, Tamil Nadu, India Tredence Inc. Full time

    Site Reliability Engineer (SRE) Experience: 8-12yrs Pune/ Chennai/ Gurgaon/ Kolkata We are seeking a highly skilled and experienced Site Reliability Engineer (SRE) with a deep understanding of SRE principles and practices. This role will be instrumental in shaping and guiding the SRE journey, ensuring high availability, reliability, and performance. The...


  • Chennai, Tamil Nadu, India Natobotics Technologies Pvt Limited Full time

    Site Reliability Engineer - Server Support (SRE - SES)Location : Chennai, Hyderabad, Pune, BangaloreExperience : 4-7 YearsNotice Period : 0-30 DaysAbout the Role :We are urgently seeking experienced Site Reliability Engineers - Server Support (SRE - SES) to join our growing team. As an SRE - SES, you will be responsible for ensuring the high availability,...


  • Chennai, Tamil Nadu, India Tredence Inc. Full time

    Site Reliability Engineer (SRE) Experience: 8-12yrsPune/ Chennai/ Gurgaon/ KolkataWe are seeking a highly skilled and experienced Site Reliability Engineer (SRE) with a deep understanding of SRE principles and practices. This role will be instrumental in shaping and guiding the SRE journey, ensuring high availability, reliability, and performance. The ideal...


  • Chennai, Tamil Nadu, India Tredence Inc. Full time

    **Job Title:** Site Reliability Engineer (SRE) **Experience Level:** 8-12 years **Locations:** Pune, Chennai, Gurgaon, Kolkata We are seeking a highly skilled and experienced Site Reliability Engineer (SRE) to shape and guide our SRE journey. The ideal candidate will bring both technical expertise and SRE knowledge to establish robust observability, incident...


  • Chennai, Tamil Nadu, India Zuora Full time

    As a Site Reliability Engineering Manager at Zuora, you will be responsible for leading a team of talented engineers to leverage their expertise in cloud technologies, system design, troubleshooting, automation, and AI to scale and work across Product Engineering, Customer Support, Product Management, and Global Services to deliver Site and Customer...