Site Reliability Engineer

2 hours ago


bangalore, India Enterprise Minds, Inc Full time

Senior Site Reliability Engineer (GCP | Terraform | Ansible | SRE | On-Call)We are looking for a high-impact Site Reliability Engineer (SRE) who will play a key role in ensuring the reliability, availability, and scalability of our production systems on Google Cloud Platform (GCP).If you thrive in fast-paced environments, excel in incident management, and love building automated, scalable infrastructure—this role is for you.🔧 ResponsibilitiesProduction Reliability & On-Call ExcellenceAct as a primary responder in a 24×7 rotational on-call schedule.Rapidly identify, mitigate, and resolve high-severity production incidents impacting GCP services.Conduct detailed Root Cause Analysis (RCA) and implement long-term corrective actions.Infrastructure-as-Code (IaC)Design, build, and maintain large-scale, multi-environment infrastructure using Terraform.Develop reusable modules, follow best practices, and maintain version-controlled infrastructure deployments.Configuration ManagementBuild and optimize Ansible playbooks and roles for configuration consistency, patching, and environment provisioning.Automation & ToolingDevelop automation using Python, Go, or Bash to eliminate operational toil and accelerate engineering productivity.Drive automation-first culture across the SRE team.Monitoring, Observability & ToolingEnhance monitoring, logging, and alerting using tools like Prometheus, Grafana, Stackdriver, or similar.Improve observability for proactive detection of service health degradation.Containers & OrchestrationManage and troubleshoot Kubernetes (GKE) clusters for deployment, scaling, and reliability of containerized applications.SRE Best PracticesDefine and measure SLIs/SLOs, engineer reliability, and reduce toil through automation.Collaborate closely with DevOps, Cloud, and Engineering teams for continuous improvement.🔍 RequirementsMust Have3+ years of hands-on experience on GCP, including GKE, GCE, VPC networking, IAM, load balancers, security, and networking fundamentals.Advanced expertise in Terraform for production-grade infrastructure deployments.Strong Ansible experience for configuration management.Proven experience in on-call rotations, incident response, and handling critical production issues.Proficiency in Python, Go, or Bash for automation.Strong understanding of SRE principles: SLIs/SLOs, error budgets, incident management, RCA.Experience with Kubernetes, containerization, and troubleshooting distributed systems.Nice to HaveExposure to service mesh (Istio/Linkerd).Experience with CI/CD pipelines (Jenkins, GitLab CI, Cloud Build).Networking and security certifications (GCP Associate Cloud Engineer / Professional Cloud DevOps Engineer).🌟 What We OfferOpportunity to work on high-scale, mission-critical systems.A culture of ownership, innovation, and automation.Competitive compensation + on-call benefits.Growth opportunities in SRE, Cloud, and Platform Engineering tracks.📨 How to ApplyShare your updated resume at: deepika.balijepally@eminds.ai



  • bangalore, India super Full time

    Site Reliability Engineer (SRE) Level 3Overview:A Site Reliability Engineer (SRE) Level 3 is a senior technical leadership role focused on designing, implementing, and maintaining large-scale, complex, and highly reliable systems. This role emphasizes a blend of software and systems engineering to ensure the availability, latency, performance, and capacity...


  • bangalore, India Pagos Consultants Full time

    we are looking for experienced site reliability engineers to join a founding team of startup-minded individuals that will lay the groundwork for our new fintech offering. This team will play a pivotal role in spearheading innovation. As such, you will have the opportunity to shape the early architecture and design of the system and set the trajectory for its...


  • bangalore, India Datum Technologies Group Full time

    Job Title: Site Reliability Engineer (SRE) – AWS Experience: 8+ years Location: Chennai / Mumbai Work Mode: Hybrid Key Skills: AWS, Terraform, Kubernetes, Docker, Grafana, Prometheus, Datadog Job Summary: We are looking for a skilled Site Reliability Engineer (SRE) with strong AWS experience and a solid background in DevOps, automation, observability, and...


  • Bangalore, India CodeKarma Full time

    Site Reliability Engineer (Multi-Cloud Deployments) Location: Bangalore / Remote Experience: 4–10 years Type: Full-time (6-month probation) About CodeKarma CodeKarma is redefining how engineering teams understand and evolve complex systems — bringing production context directly into the developer’s workflow. Our platform runs both as SaaS and as sub-


  • Bangalore, India Flipkart Full time

    Hiring Site Reliability Engineers Exp : 2.5 +years (Excluding internship) Location : Bangalore Apply Here : The engineer will work in the Reliability and Productivity Engineering team and is responsible for building industry standard large scale platforms to be utilised across FK that helps to significantly improve the reliability of systems and bring...


  • bangalore, India Andor Tech Full time

    Hiring!!🏢 About AndorTechAndorTech is a global IT services and consulting firm founded in 2009, headquartered in Bangalore. The company specializes in software engineering, AI-enabled IT services, application support, analytics, and test automation. With a presence across India, the USA, Europe, and the UAE, AndorTech partners with Global Capability...


  • Bangalore, India Andor Tech Full time

    Hiring!! About AndorTech AndorTech is a global IT services and consulting firm founded in 2009, headquartered in Bangalore. The company specializes in software engineering, AI-enabled IT services, application support, analytics, and test automation . With a presence across India, the USA, Europe, and the UAE, AndorTech partners with Global Capability Centers...


  • bangalore, India Karix Full time

    Role: Site Reliability Engineer Location: Bangalore (WFO) About the role: We are seeking an experienced professional Site Reliability Engineer who acts as a bridge between development and IT operations, taking operational tasks to ensure the efficient functioning of Service platforms. They are responsible for monitoring, automating, and improving the...


  • bangalore, India Karix Full time

    Role: Site Reliability Engineer Location: Bangalore (WFO) About the role: We are seeking an experienced professional Site Reliability Engineer who acts as a bridge between development and IT operations, taking operational tasks to ensure the efficient functioning of Service platforms. They are responsible for monitoring, automating, and improving the...


  • bangalore, India Karix Full time

    Role: Site Reliability Engineer Location: Bangalore (WFO) About the role: We are seeking an experienced professional Site Reliability Engineer who acts as a bridge between development and IT operations, taking operational tasks to ensure the efficient functioning of Service platforms. They are responsible for monitoring, automating, and improving the...