Site Reliability Engineer
2 hours ago
Senior Site Reliability Engineer (GCP | Terraform | Ansible | SRE | On-Call)We are looking for a high-impact Site Reliability Engineer (SRE) who will play a key role in ensuring the reliability, availability, and scalability of our production systems on Google Cloud Platform (GCP).If you thrive in fast-paced environments, excel in incident management, and love building automated, scalable infrastructure—this role is for you.🔧 ResponsibilitiesProduction Reliability & On-Call ExcellenceAct as a primary responder in a 24×7 rotational on-call schedule.Rapidly identify, mitigate, and resolve high-severity production incidents impacting GCP services.Conduct detailed Root Cause Analysis (RCA) and implement long-term corrective actions.Infrastructure-as-Code (IaC)Design, build, and maintain large-scale, multi-environment infrastructure using Terraform.Develop reusable modules, follow best practices, and maintain version-controlled infrastructure deployments.Configuration ManagementBuild and optimize Ansible playbooks and roles for configuration consistency, patching, and environment provisioning.Automation & ToolingDevelop automation using Python, Go, or Bash to eliminate operational toil and accelerate engineering productivity.Drive automation-first culture across the SRE team.Monitoring, Observability & ToolingEnhance monitoring, logging, and alerting using tools like Prometheus, Grafana, Stackdriver, or similar.Improve observability for proactive detection of service health degradation.Containers & OrchestrationManage and troubleshoot Kubernetes (GKE) clusters for deployment, scaling, and reliability of containerized applications.SRE Best PracticesDefine and measure SLIs/SLOs, engineer reliability, and reduce toil through automation.Collaborate closely with DevOps, Cloud, and Engineering teams for continuous improvement.🔍 RequirementsMust Have3+ years of hands-on experience on GCP, including GKE, GCE, VPC networking, IAM, load balancers, security, and networking fundamentals.Advanced expertise in Terraform for production-grade infrastructure deployments.Strong Ansible experience for configuration management.Proven experience in on-call rotations, incident response, and handling critical production issues.Proficiency in Python, Go, or Bash for automation.Strong understanding of SRE principles: SLIs/SLOs, error budgets, incident management, RCA.Experience with Kubernetes, containerization, and troubleshooting distributed systems.Nice to HaveExposure to service mesh (Istio/Linkerd).Experience with CI/CD pipelines (Jenkins, GitLab CI, Cloud Build).Networking and security certifications (GCP Associate Cloud Engineer / Professional Cloud DevOps Engineer).🌟 What We OfferOpportunity to work on high-scale, mission-critical systems.A culture of ownership, innovation, and automation.Competitive compensation + on-call benefits.Growth opportunities in SRE, Cloud, and Platform Engineering tracks.📨 How to ApplyShare your updated resume at: deepika.balijepally@eminds.ai
-
Site Reliability Engineer
1 week ago
bangalore, India super Full timeSite Reliability Engineer (SRE) Level 3Overview:A Site Reliability Engineer (SRE) Level 3 is a senior technical leadership role focused on designing, implementing, and maintaining large-scale, complex, and highly reliable systems. This role emphasizes a blend of software and systems engineering to ensure the availability, latency, performance, and capacity...
-
Site Reliability Engineer
1 day ago
bangalore, India Pagos Consultants Full timewe are looking for experienced site reliability engineers to join a founding team of startup-minded individuals that will lay the groundwork for our new fintech offering. This team will play a pivotal role in spearheading innovation. As such, you will have the opportunity to shape the early architecture and design of the system and set the trajectory for its...
-
Site Reliability Engineer
1 week ago
bangalore, India Datum Technologies Group Full timeJob Title: Site Reliability Engineer (SRE) – AWS Experience: 8+ years Location: Chennai / Mumbai Work Mode: Hybrid Key Skills: AWS, Terraform, Kubernetes, Docker, Grafana, Prometheus, Datadog Job Summary: We are looking for a skilled Site Reliability Engineer (SRE) with strong AWS experience and a solid background in DevOps, automation, observability, and...
-
Site Reliability Engineer
2 weeks ago
Bangalore, India CodeKarma Full timeSite Reliability Engineer (Multi-Cloud Deployments) Location: Bangalore / Remote Experience: 4–10 years Type: Full-time (6-month probation) About CodeKarma CodeKarma is redefining how engineering teams understand and evolve complex systems — bringing production context directly into the developer’s workflow. Our platform runs both as SaaS and as sub-
-
Site Reliability Engineer
2 weeks ago
Bangalore, India Flipkart Full timeHiring Site Reliability Engineers Exp : 2.5 +years (Excluding internship) Location : Bangalore Apply Here : The engineer will work in the Reliability and Productivity Engineering team and is responsible for building industry standard large scale platforms to be utilised across FK that helps to significantly improve the reliability of systems and bring...
-
Site Reliability Engineer
1 week ago
bangalore, India Andor Tech Full timeHiring!!🏢 About AndorTechAndorTech is a global IT services and consulting firm founded in 2009, headquartered in Bangalore. The company specializes in software engineering, AI-enabled IT services, application support, analytics, and test automation. With a presence across India, the USA, Europe, and the UAE, AndorTech partners with Global Capability...
-
Site Reliability Engineer
1 week ago
Bangalore, India Andor Tech Full timeHiring!! About AndorTech AndorTech is a global IT services and consulting firm founded in 2009, headquartered in Bangalore. The company specializes in software engineering, AI-enabled IT services, application support, analytics, and test automation . With a presence across India, the USA, Europe, and the UAE, AndorTech partners with Global Capability Centers...
-
Site Reliability Engineer
1 week ago
bangalore, India Karix Full timeRole: Site Reliability Engineer Location: Bangalore (WFO) About the role: We are seeking an experienced professional Site Reliability Engineer who acts as a bridge between development and IT operations, taking operational tasks to ensure the efficient functioning of Service platforms. They are responsible for monitoring, automating, and improving the...
-
Site Reliability Engineer
1 week ago
bangalore, India Karix Full timeRole: Site Reliability Engineer Location: Bangalore (WFO) About the role: We are seeking an experienced professional Site Reliability Engineer who acts as a bridge between development and IT operations, taking operational tasks to ensure the efficient functioning of Service platforms. They are responsible for monitoring, automating, and improving the...
-
Site Reliability Engineer
7 days ago
bangalore, India Karix Full timeRole: Site Reliability Engineer Location: Bangalore (WFO) About the role: We are seeking an experienced professional Site Reliability Engineer who acts as a bridge between development and IT operations, taking operational tasks to ensure the efficient functioning of Service platforms. They are responsible for monitoring, automating, and improving the...