Site Reliability Engineer
10 hours ago
Senior Site Reliability Engineer (GCP | Terraform | Ansible | SRE | On-Call)We are looking for ahigh-impact Site Reliability Engineer (SRE)who will play a key role in ensuring the reliability, availability, and scalability of our production systems onGoogle Cloud Platform (GCP) . If you thrive in fast-paced environments, excel in incident management, and love building automated, scalable infrastructure—this role is for you.Responsibilities Production Reliability & On-Call Excellence Act as a primary responder in a24×7 rotational on-call schedule . Rapidly identify, mitigate, and resolvehigh-severity production incidentsimpacting GCP services. Conduct detailedRoot Cause Analysis (RCA)and implement long-term corrective actions. Infrastructure-as-Code (IaC) Design, build, and maintainlarge-scale, multi-environment infrastructureusingTerraform . Develop reusable modules, follow best practices, and maintain version-controlled infrastructure deployments. Configuration Management Build and optimizeAnsible playbooks and rolesfor configuration consistency, patching, and environment provisioning. Automation & Tooling Develop automation usingPython, Go, or Bashto eliminate operational toil and accelerate engineering productivity. Drive automation-first culture across the SRE team. Monitoring, Observability & Tooling Enhance monitoring, logging, and alerting using tools likePrometheus, Grafana, Stackdriver , or similar. Improve observability for proactive detection of service health degradation. Containers & Orchestration Manage and troubleshootKubernetes (GKE)clusters for deployment, scaling, and reliability of containerized applications. SRE Best Practices Define and measureSLIs/SLOs , engineer reliability, and reduce toil through automation. Collaborate closely with DevOps, Cloud, and Engineering teams for continuous improvement.Requirements Must Have 3+ years of hands-on experience on GCP , including GKE, GCE, VPC networking, IAM, load balancers, security, and networking fundamentals. Advanced expertise in Terraformfor production-grade infrastructure deployments. Strong Ansible experiencefor configuration management. Proven experience inon-call rotations , incident response, and handling critical production issues. Proficiency inPython, Go, or Bashfor automation. Strong understanding ofSRE principles : SLIs/SLOs, error budgets, incident management, RCA. Experience withKubernetes , containerization, and troubleshooting distributed systems.Nice to Have Exposure toservice mesh(Istio/Linkerd). Experience withCI/CD pipelines(Jenkins, GitLab CI, Cloud Build). Networking and security certifications (GCP Associate Cloud Engineer / Professional Cloud DevOps Engineer).What We Offer Opportunity to work onhigh-scale, mission-critical systems . A culture of ownership, innovation, and automation. Competitive compensation + on-call benefits. Growth opportunities in SRE, Cloud, and Platform Engineering tracks.How to Apply Share your updated resume at:deepika.balijepally@eminds.ai
-
Site Reliability Engineer
3 weeks ago
New Delhi, India Tata Consultancy Services Full timeRole: Site Reliability Engineer Experience: 4 to 7 Years Locations: Chennai/Pune/Kolkata
-
Site Reliability Engineer
3 weeks ago
New Delhi, India Datum Technologies Group Full timeJob Title: Site Reliability Engineer (SRE) – Azure & AIExperience: 7+ yearsWork Mode: HybridWork Location: Chennai/Mumbai/GurgaonJob Summary:We are looking for an experienced Site Reliability Engineer (SRE) with strong expertise in Microsoft Azure, AI infrastructure, and automation. The ideal candidate will have a solid background in managing cloud...
-
Site Reliability Engineer
1 week ago
New Delhi, India Datum Technologies Group Full timeJob Title: Site Reliability Engineer (SRE) – AWSExperience: 8+ years Location: Chennai / Mumbai Work Mode: HybridKey Skills:AWS, Terraform, Kubernetes, Docker, Grafana, Prometheus, DatadogJob Summary:We are looking for a skilled Site Reliability Engineer (SRE) with strong AWS experience and a solid background in DevOps, automation, observability, and...
-
Site Reliability Engineer
2 weeks ago
New Delhi, India Grootan Technologies Full timeAbout the RoleWe are seeking a skilled Site Reliability Engineer (SRE) with 4–5 years of hands-on experience to join our engineering team. In this role, you will be responsible for building and maintaining reliable, scalable, and secure infrastructure to support our applications. You will leverage your expertise in automation, cloud platforms, and...
-
Site Reliability Engineer
1 week ago
New Delhi, India Elios Talent Full timeSite Reliability EngineerKey Highlights️ Build, automate, and support cloud-native infrastructure powering high-availability platforms⚡ Contribute to automation-first engineering across AWS, Terraform, CI/CD, and observability toolingImprove reliability, uptime, system health, and performance across production environmentsStrengthen DevSecOps...
-
Site Reliability Engineer
5 days ago
New Delhi, India Synechron Full timeWe have immediate opportunity forSRE (Senior Site Reliability Engineer) 5 to 9 years. Synechron –PuneJob Role: -SRE (Senior Site Reliability Engineer) Job Location: -PuneAbout Synechron We began life in 2001 as a small, self-funded team of technology specialists. Since then, we’ve grown our organization to 14,500+ people, across 58 offices, in 21...
-
Site Reliability Engineer
2 weeks ago
New Delhi, India Tata Consultancy Services Full timeRole: Site Reliability Engineer Location: Chennai/Bangalore/HyderabadExp- 5-11 years 1.Exposure to any APM tool like Dynatrace, Appdynamics, Splunk, etc 2.DBA or Infra admin 3.Gremlin or Chaos Monkey or Simian Army or Litmus expertise 4.Exposure to ITSM tools like Service Now, etc 5.Understanding of Automation and Chaos Engineering 6.Exposure to Devops tools...
-
Site Reliability Engineer
2 weeks ago
New Delhi, India VXI Global Solutions Full timeWe are looking for a Site Reliability Engineer with 3+ years for Experience into design, implement, and manage robust observability solutions across our cloud infrastructure and applications. The ideal candidate will have hands-on experience withPrometheus ,Grafana , along with exposure toSolarWinds . You should be comfortable working withmetrics, logs, and...
-
Site Reliability Engineer
1 week ago
New Delhi, India VXI Global Solutions Full timeWe are looking for a Site Reliability Engineer with 3+ years for Experience into design, implement, and manage robust observability solutions across our cloud infrastructure and applications. The ideal candidate will have hands-on experience with Prometheus, Grafana, along with exposure to SolarWinds. You should be comfortable working with metrics, logs, and...
-
Site Reliability Engineer
1 week ago
New Delhi, India Andor Tech Full timeHiring!! About AndorTech AndorTech is aglobal IT services and consulting firmfounded in 2009, headquartered in Bangalore. The company specializes insoftware engineering, AI-enabled IT services, application support, analytics, and test automation. With a presence across India, the USA, Europe, and the UAE, AndorTech partners withGlobal Capability Centers...