Senior Site Reliability Engineer

2 weeks ago


Chennai, Tamil Nadu, India Tredence Inc. Full time

Site Reliability Engineer (SRE)

Experience: 8-12yrs

Pune/ Chennai/ Gurgaon/ Kolkata

We are seeking a highly skilled and experienced Site Reliability Engineer (SRE) with a deep understanding of SRE principles and practices. This role will be instrumental in shaping and guiding the SRE journey, ensuring high availability, reliability, and performance. The ideal candidate will bring both technical expertise and SRE expertise to establish robust observability, incident management, and automation practices.

Technical Expertise and Experience:

  • Deep understanding of SRE concepts, including SLIs, SLOs, SLAs, error budgets, and reliability engineering best practices.
  • Expertise in observability tools such as Prometheus, Thanos, and Grafana is mandatory.
  • Strong hands-on experience with PromQL and Alertmanager, with a proven ability to set up and manage monitoring and alerting systems.
  • Proficiency in cloud platforms (Azure is mandatory).
  • Strong scripting and automation skills, with proficiency in Python and Bash.
  • Hands-on experience with infrastructure operations and observability.
  • Extensive knowledge and hands-on experience across IT infrastructure, cloud platforms, and networking.
  • Significant experience with Kubernetes, including running, managing, and troubleshooting containerized workloads.
  • Experience working with version control systems like GitHub and implementing CI/CD pipelines is a plus.
  • Familiarity with infrastructure-as-code (IaC) tools like Terraform or ARM templates is a plus.

SRE Expertise:

  • Ability to define and implement SRE best practices for data platforms, data driven applications, ensuring alignment with organizational goals.
  • Provide mentorship and guidance to teams in adopting SRE principles and improving operational excellence.
  • Collaborate with cross-functional teams to drive reliability, scalability, and performance across data engineering, data science, and platform engineering projects.

Soft Skills:

  • Strong planning and organizational skills to manage individual and team responsibilities efficiently.
  • Excellent problem-solving and troubleshooting skills, with the ability to analyze complex issues and implement effective solutions.
  • Effective real-time communication, ensuring clear and concise updates for both technical and non-technical stakeholders.
  • Ability to work under pressure and manage incidents effectively, ensuring timely resolutions and minimal downtime.
  • Collaborative mindset with the ability to foster a culture of ownership, accountability, and continuous improvement.


  • Chennai, Tamil Nadu, India 10decoders Full time

    JD: Site Reliability Engineer - GCP With Terraform The Role: We are looking for a Senior SRE with 5+ years of experience to work primarily with our Application development team. An ideal candidate would have extensive experience building cloud infrastructure on Google Cloud with Terraform and have strong experience running workloads that scale on Google's...


  • Chennai, Tamil Nadu, India NexionPro Services Full time

    Role: Site Reliability Engineer Job Title: Senior Infrastructure Engineer (Observability & Monitoring) Location: Bangalore / Pune Experience: Minimum 6 Years Job Description: We are looking for a highly skilled Senior Infrastructure Engineer with extensive experience in Infrastructure as Code, Observability, and Monitoring . The ideal candidate will...


  • Chennai, Tamil Nadu, India 10decoders Full time

    JD: Site Reliability Engineer - GCP With TerraformThe Role:We are looking for a Senior SRE with 5+ years of experience to work primarily with ourApplication development team. An ideal candidate would have extensive experiencebuilding cloud infrastructure on Google Cloud with Terraform and have strongexperience running workloads that scale on Google's...


  • Chennai, Tamil Nadu, India HCLTech Full time

    Job DescriptionSite Reliability Engineer, in Application/Cloud Support, will be responsible for ensuring the reliability, scalability, and performance of critical business systems. Involve in identifying and solving issues within multiple components of these systems, utilizing expertise in Site Reliability Engineering.Roles & Responsibilities: -1. Lead...


  • Chennai, Tamil Nadu, India HCLTech Full time

    Site Reliability Engineer, in Application/Cloud Support, will be responsible for ensuring the reliability, scalability, and performance of critical business systems. Involve in identifying and solving issues within multiple components of these systems, utilizing expertise in Site Reliability Engineering. Roles & Responsibilities: -Lead efforts to improve the...


  • Chennai, Tamil Nadu, India 10decoders Full time

    Job Summary We are seeking a Senior Site Reliability Engineer (SRE) with 5+ years of experience to join our team and work primarily with our Application development team. The ideal candidate will have extensive experience building cloud infrastructure on Google Cloud Platform using Terraform and strong experience running workloads that scale on Google's...


  • Chennai, Tamil Nadu, India 10decoders Full time

    JD: Site Reliability Engineer -GCP With TerraformThe Role:We are looking for a Senior SRE with5+ yearsof experience to work primarily with ourApplication development team. An ideal candidate would have extensive experiencebuilding cloud infrastructure onGoogle Cloud with Terraformand have strongexperience running workloads that scale on Google's Kubernetes...


  • Chennai, Tamil Nadu, India 10decoders Full time

    JD: Site Reliability Engineer - GCP With TerraformThe Role:We are looking for a Senior SRE with 5+ years of experience to work primarily with ourApplication development team. An ideal candidate would have extensive experiencebuilding cloud infrastructure on Google Cloud with Terraform and have strongexperience running workloads that scale on Google's...


  • Chennai, Tamil Nadu, India Bright Vision Technologies Full time

    Bright Vision Technologies has an immediate Full-time opportunity for Site Reliability Engineer (SRE)  Job Role:  Site Reliability Engineer (SRE) Job Type: Full Time Candidates Looking for Visa sponsorship and willing to relocate to USA are encouraged to apply.About Bright Vision Technologies: Bright Vision Technologies is a fast-growing technology company...


  • Chennai, Tamil Nadu, India 10decoders Full time

    Job Description The Role: We are seeking a Senior Site Reliability Engineer with 5+ years of experience to work closely with our Application Development team. Responsibilities: Contribute to establishing best practices and shaping the SRE culture within our organization. Collaborate with teams to design, build, and improve Google Cloud infrastructure using...


  • Chennai, Tamil Nadu, India 5100 Kyndryl Solutions Private Limited Full time

    Who We Are At Kyndryl, we design, build, manage and modernize the mission-critical technology systems that the world depends on every day. So why work at Kyndryl? We are always moving forward – always pushing ourselves to go further in our efforts to build a more equitable, inclusive world for our employees, our customers and our communities. The...


  • Chennai, Tamil Nadu, India Athenahealth Technology Private Limited Full time

    Job DescriptionJoin us as we work to create a thriving ecosystem that delivers accessible, high-quality, and sustainable healthcare for all.We are looking for a Senior Site Reliability Engineer to join our Service Operations, Site Reliability Engineering team within the Cloud Infrastructure Engineering division. This team is newly formed and is responsible...


  • Chennai, Tamil Nadu, India Tredence Inc. Full time

    **Job Title:** Site Reliability Engineer (SRE) **Experience Level:** 8-12 years **Locations:** Pune, Chennai, Gurgaon, Kolkata We are seeking a highly skilled and experienced Site Reliability Engineer (SRE) to shape and guide our SRE journey. The ideal candidate will bring both technical expertise and SRE knowledge to establish robust observability, incident...


  • Chennai, Tamil Nadu, India Burgeon It Services Pvt Ltd Full time

    Job Title : SRE EngineerLocation : ChennaiExperience : 8+ YearsJob Description :We are seeking an experienced Site Reliability Engineer (SRE) to join our dynamic team. The ideal candidate will have a strong background in software engineering and operations, with a passion for building scalable and reliable systems.Key Responsibilities :- Design, implement,...


  • Chennai, Tamil Nadu, India Burgeon It Services Pvt Ltd Full time

    Job Title : SRE EngineerLocation : ChennaiExperience : 8+ YearsJob Description :We are seeking an experienced Site Reliability Engineer (SRE) to join our dynamic team. The ideal candidate will have a strong background in software engineering and operations, with a passion for building scalable and reliable systems.Key Responsibilities :- Design, implement,...


  • Chennai, Tamil Nadu, India Athenahealth Full time

    Join us as we work to create a thriving ecosystem that delivers accessible, high-quality, and sustainable healthcare for all. We are looking for a Senior Site Reliability Engineer to join our Service Operations, Site Reliability Engineering team within the Cloud Infrastructure Engineering division. This team is newly formed and is responsible for managing...


  • Chennai, Tamil Nadu, India Zf Friedrich Full time

    Job DescriptionJob Description :Req ID 77489|GEC Chennai, India,ZF Commercial Vehicle Control Systems India LimitedLong DescriptionAbout the Team:Garuda team is a SRE team responsible for the reliability and operations of our Fleet management services platform. We ensure the availability and performance of the platform through proactive incident management,...


  • Chennai, Tamil Nadu, India HARP Technologies and Services Full time

    Experience : 8+ YearsLocation : Mumbai,Chennai (Other cities Remote)Notice period : Immediate to 30 days max Responsibilities of Senior SRE : - The Site Reliability Engineering (SRE) team is responsible for the reliability, scalability, stability and performance of systems and services.- They work with cross-functional teams to design, build and maintain...


  • Chennai, Tamil Nadu, India Kiash Solutions LLp Full time

    We are hiring a Site Reliability Engineer (SRE) with strong expertise in Azure operations, containerized workflows (Docker), and Python scripting. The ideal candidate will lead efforts to ensure system reliability, automate operational tasks, and optimize cloud-based infrastructure, while collaborating with cross-functional teams to deliver high-performing...


  • Chennai, Tamil Nadu, India ZF Group Full time

    Job DescriptionJob description:About the Team:Garuda team is a SRE team responsible for the reliability and operations of our Fleet management services platform. We ensure the availability and performance of the platform through proactive incident management, optimization, and continuous improvement while contributing to the development of SCALAR&aposs...