Site Reliability Engineer

21 hours ago


bangalore, India Enterprise Minds, Inc Full time

Senior Site Reliability Engineer (GCP | Terraform | Ansible | SRE | On-Call) We are looking for a high-impact Site Reliability Engineer (SRE) who will play a key role in ensuring the reliability, availability, and scalability of our production systems on Google Cloud Platform (GCP) . If you thrive in fast-paced environments, excel in incident management, and love building automated, scalable infrastructure—this role is for you. Responsibilities Production Reliability & On-Call Excellence Act as a primary responder in a 24×7 rotational on-call schedule . Rapidly identify, mitigate, and resolve high-severity production incidents impacting GCP services. Conduct detailed Root Cause Analysis (RCA) and implement long-term corrective actions. Infrastructure-as-Code (IaC) Design, build, and maintain large-scale, multi-environment infrastructure using Terraform . Develop reusable modules, follow best practices, and maintain version-controlled infrastructure deployments. Configuration Management Build and optimize Ansible playbooks and roles for configuration consistency, patching, and environment provisioning. Automation & Tooling Develop automation using Python, Go, or Bash to eliminate operational toil and accelerate engineering productivity. Drive automation-first culture across the SRE team. Monitoring, Observability & Tooling Enhance monitoring, logging, and alerting using tools like Prometheus, Grafana, Stackdriver , or similar. Improve observability for proactive detection of service health degradation. Containers & Orchestration Manage and troubleshoot Kubernetes (GKE) clusters for deployment, scaling, and reliability of containerized applications. SRE Best Practices Define and measure SLIs/SLOs , engineer reliability, and reduce toil through automation. Collaborate closely with DevOps, Cloud, and Engineering teams for continuous improvement. Requirements Must Have 3+ years of hands-on experience on GCP , including GKE, GCE, VPC networking, IAM, load balancers, security, and networking fundamentals. Advanced expertise in Terraform for production-grade infrastructure deployments. Strong Ansible experience for configuration management. Proven experience in on-call rotations , incident response, and handling critical production issues. Proficiency in Python, Go, or Bash for automation. Strong understanding of SRE principles : SLIs/SLOs, error budgets, incident management, RCA. Experience with Kubernetes , containerization, and troubleshooting distributed systems. Nice to Have Exposure to service mesh (Istio/Linkerd). Experience with CI/CD pipelines (Jenkins, GitLab CI, Cloud Build). Networking and security certifications (GCP Associate Cloud Engineer / Professional Cloud DevOps Engineer). What We Offer Opportunity to work on high-scale, mission-critical systems . A culture of ownership, innovation, and automation. Competitive compensation + on-call benefits. Growth opportunities in SRE, Cloud, and Platform Engineering tracks. How to Apply Share your updated resume at:



  • bangalore, India super Full time

    Site Reliability Engineer (SRE) Level 3Overview:A Site Reliability Engineer (SRE) Level 3 is a senior technical leadership role focused on designing, implementing, and maintaining large-scale, complex, and highly reliable systems. This role emphasizes a blend of software and systems engineering to ensure the availability, latency, performance, and capacity...


  • bangalore, India Pagos Consultants Full time

    we are looking for experienced site reliability engineers to join a founding team of startup-minded individuals that will lay the groundwork for our new fintech offering. This team will play a pivotal role in spearheading innovation. As such, you will have the opportunity to shape the early architecture and design of the system and set the trajectory for its...


  • bangalore, India Tata Consultancy Services Full time

    TCS has been a great pioneer in feeding the fire of young techies like you. We are a global leader in the technology arena and there’s nothing that can stop us from growing together. What we are looking for Role: Site Reliability Engineering (SRE) Experience Range: 5 – 15 Years Location: Chennai/Pune candidates should come to office for Walk in...


  • bangalore, India Tata Consultancy Services Full time

    TCS has been a great pioneer in feeding the fire of young techies like you. We are a global leader in the technology arena and there’s nothing that can stop us from growing together.What we are looking forRole: Site Reliability Engineering (SRE)Experience Range: 5 – 15 YearsLocation: Chennai/Punecandidates should come to office for Walk in Drive(Face to...


  • bangalore, India Enterprise Minds, Inc Full time

    Senior Site Reliability Engineer (GCP | Terraform | Ansible | SRE | On-Call)We are looking for a high-impact Site Reliability Engineer (SRE) who will play a key role in ensuring the reliability, availability, and scalability of our production systems on Google Cloud Platform (GCP).If you thrive in fast-paced environments, excel in incident management, and...


  • bangalore, India Datum Technologies Group Full time

    Job Title: Site Reliability Engineer (SRE) – AWS Experience: 8+ years Location: Chennai / Mumbai Work Mode: Hybrid Key Skills: AWS, Terraform, Kubernetes, Docker, Grafana, Prometheus, Datadog Job Summary: We are looking for a skilled Site Reliability Engineer (SRE) with strong AWS experience and a solid background in DevOps, automation, observability, and...


  • bangalore, India Enterprise Minds, Inc Full time

    Senior Site Reliability Engineer (GCP | Terraform | Ansible | SRE | On-Call) We are looking for a high-impact Site Reliability Engineer (SRE) who will play a key role in ensuring the reliability, availability, and scalability of our production systems on Google Cloud Platform (GCP) . If you thrive in fast-paced environments, excel in incident management, and...


  • bangalore, India Insight Global Full time

    Company: Insight GlobalDuration: Approved for 1 year📍 Location: Remote (India)💼 Type: Contract with Insight Global Client💰 Compensation: 14 LPA – 20 LPA🕒 Working Hours: Normal IST hours🚀 Start Date: Immediate (No notice period)About the RoleJoin our Site Reliability Engineering (SRE) team as a Fullstack Developer, focused on building and...


  • bangalore, India Hydrolix Full time

    About the jobAt Hydrolix, we are revolutionizing the world of data management and analytics with our innovative cloud data platform, purpose-built for petabyte-scale datasets. Our mission is to help organizations drastically reduce data costs while increasing their data retention.We are looking for a Site Reliability Engineer (SRE) with 8 to 10+ years...


  • bangalore, India Andor Tech Full time

    Hiring!!🏢 About AndorTechAndorTech is a global IT services and consulting firm founded in 2009, headquartered in Bangalore. The company specializes in software engineering, AI-enabled IT services, application support, analytics, and test automation. With a presence across India, the USA, Europe, and the UAE, AndorTech partners with Global Capability...