Senior Site Reliability Engineer

5 days ago


Delhi, India iVoyant Full time

One of our clients is looking for an experienced Senior Site Reliability Engineer (SRE) - Mission-Critical SaaS Cloud Products to join their team.Key Responsibilities:Reliability and Performance Management:- Design, implement, and maintain highly available, scalable, and resilient cloud-native architectures for mission-critical SaaS products.- Develop and implement SLOs, SLIs, and SLAs to measure and improve service reliability.- Continuously optimize system performance and resource utilization across multiple cloud platforms.- Finetune/Optimize Application performance by analyzing the code, traces and database queries.Incident Management and Troubleshooting:- Lead incident response efforts, effectively troubleshooting complex issues to minimize downtime and impact.- Reduce Mean Time to Recover (MTTR) through proactive monitoring, automated alerting, and efficient problem-solving techniques.- Conduct thorough Root Cause Analysis (RCA) for all major incidents and implement preventive measures.Observability and Monitoring:- Design and implement end-to-end observability solutions across our distributed systems.- Develop and maintain comprehensive monitoring strategies using tools like ELK Stack, Prometheus, Grafana.- Create and optimize product status dashboards to provide real-time visibility into system health and performance.Automation and Infrastructure as Code (IaC):- Implement Infrastructure as Code practices using tools like Terraform.- Develop and maintain automated deployment pipelines and CI/CD workflows.- Create self-healing systems and automate routine operational tasks to reduce manual intervention.Cloud-Agnostic Architecture:- Design and implement cloud-agnostic solutions that can operate efficiently across multiple cloud providers.- Develop expertise in event-driven architecture and related technologies (e.g., Apache Kafka/EventHub, Redis, Mongo Atlas, IoTHub).- Implement and manage containerized applications using Kubernetes across different cloud environments.Continuous Improvement:- Regularly review and refine operational practices to enhance efficiency and reliability.- Stay updated with the latest industry trends and technologies in SRE, cloud computing, and DevOps.- Contribute to the development of internal tools and frameworks to support SRE practices.Requirements:- Strong knowledge of cloud platforms - Azure and their associated services.- Expert in Observability tools (ELK Stack, Dynatrace, Prometheus)- Expertise in containerization technologies such as Docker and Kubernetes- Understanding of Event-driven architecture and database technologies (Mongo Atlas, Azure SQL, Postgres DB)- Proficient in IaaC tools such as - Terraform and GitHub Actions.- Proficiency in one or more programming languages - Python/.Net/Java- Strong understanding of networking concepts, load balancing, and security practices.



  • New Delhi, India Tata Consultancy Services Full time

    Dear Candidates,Greetings from TCS!!!TCS is looking for Senior Site Reliability Engineer – AWSExperience: 8-12 yearsLocation: ChennaiMust have skills:- Design, implement, and maintain scalable, secure, and highly available infrastructure on AWS - Develop and improve CI/CD pipelines, Infrastructure as Code (IaC) using Terraform, Harness - Own and implement...


  • New Delhi, India Tata Consultancy Services Full time

    Dear Candidates, Greetings from TCS!!! TCS is looking for Senior Site Reliability Engineer – AWS Experience: 8-12 years Location: ChennaiMust have skills: Design, implement, and maintain scalable, secure, and highly available infrastructure on AWS Develop and improve CI/CD pipelines, Infrastructure as Code (IaC) using Terraform, Harness Own and implement...

  • Site Engineer

    1 week ago


    Delhi, Delhi, India Engineer Department Full time ₹ 6,00,000 - ₹ 12,00,000 per year

    Company DescriptionEngineer Department is a company We are dedicated to providing efficient and effective engineering solutions for public infrastructure and services. Our team is committed to ensuring the highest standards in project management and execution, serving the community with integrity and professionalism.Role DescriptionThis is a full-time...


  • Delhi, India Sonata Software Full time

    We're Hiring: Senior Site Reliability EngineerLocation: Onsite (Office: Hyderabad – Mandatory from Day 1)Employment Type: Full-timeNotice Period: Immediate to 15 Days OnlyExperience: 8+ YearsAbout the RoleWe’re looking for a Senior Site Reliability Engineer (SRE) to lead reliability initiatives across our production systems. This is a high-impact role...


  • New Delhi, India WhiteLotus Talent Partners Full time

    We are looking for aL0 and L1 Site Reliability Engineer (SRE) Supportto join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered byOpenStackandKubernetes . In this role, you will focus onmonitoring ,basic troubleshooting , andincident response , helping to maintain high system availability,...


  • New Delhi, India WhiteLotus Talent Partners Full time

    We are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes. In this role, you will focus on monitoring, basic troubleshooting, and incident response, helping to maintain high system...


  • New Delhi, India SID Global Solutions Full time

    Job Role: Site Reliability Engineer (SRE) – GCP Experience: 3+ years Location: HyderabadAbout SIDGS: SIDGS is a premium global systems integrator and global implementation partner of Google corporation, providing Digital Solutions & Services to Fortune 500 companies. Our Digital solutions go across following domains: User Experience, CMS, API Management,...


  • New Delhi, India SID Global Solutions Full time

    Job Role: Site Reliability Engineer (SRE) – GCPExperience: 3+ yearsLocation: HyderabadAbout SIDGS:SIDGS is a premium global systems integrator and global implementation partner of Google corporation, providing Digital Solutions & Services to Fortune 500 companies. Our Digital solutions go across following domains: User Experience, CMS, API Management,...


  • New Delhi, India Movius Full time

    Senior Staff Site Reliability EngineerLocation: Bengaluru, KA, 560076 Job Description: We are seeking a highly skilled Senior Staff Site Reliability Engineer with extensive experience in DevOps/SRE roles and large-scale distributed systems. The ideal candidate will have a proven background in cloud operations, automation, and CI/CD, with a preference for...


  • Delhi, India Sonata Software Full time

    We're Hiring: Senior Site Reliability EngineerLocation:Onsite (Office: Hyderabad – Mandatory from Day 1)Employment Type:Full-timeNotice Period:Immediate to 15 Days OnlyExperience:8+ YearsAbout the RoleWe’re looking for aSenior Site Reliability Engineer (SRE)to lead reliability initiatives across our production systems. This is a high-impact role where...