Site Reliability Engineer

18 hours ago


bangalore, India Weekday AI Full time

This role is for one of the Weekday's clientsSalary range: Rs Rs ie INR 12-20 LPA)Min Experience: 1 yearsLocation: BengaluruJobType: full-timeAs an SRE, you will work closely with product engineering, DevOps, and platform teams to build resilient services, improve deployment processes, and drive operational excellence across the organization. You will be responsible for maintaining the health of our applications and infrastructure, strengthening reliability practices, and ensuring optimal system performance.RequirementsKey Responsibilities1. Reliability & Performance Ensure high availability, resilience, and performance of production systems. Conduct root cause analysis, implement long-term fixes, and reduce recurring incidents. Develop and tune SLIs, SLOs, and error budgets in collaboration with engineering teams. 2. Infrastructure & Operations Build, maintain, and optimize cloud infrastructure (AWS/Azure/GCP). Implement Infrastructure-as-Code (IaC) using tools like Terraform, CloudFormation, or similar. Manage compute, storage, networking, load balancers, and container orchestration systems. Maintain CI/CD pipelines to streamline deployments and operational workflows. 3. Automation & Tooling Automate operational tasks, scaling, failover, monitoring, and configuration management. Develop tooling to improve engineering efficiency and reduce manual interventions. Implement proactive alerting, self-healing mechanisms, and automated remediation workflows. 4. Observability & Incident Management Build end-to-end observability using logs, metrics, traces, and dashboards. Respond to production issues, participate in on-call rotations, and manage incident lifecycle. Establish and improve incident response processes, runbooks, and reliability best practices. 5. Collaboration & Continuous Improvement Partner with developers to design reliable architectures and production-ready solutions. Promote SRE principles, reliability mindset, and performance culture across teams. Contribute to capacity planning, cost optimization, and system scalability initiatives. Required Skills & Qualifications 1–4 years of experience in Site Reliability Engineering, DevOps, or Infrastructure Engineering. Strong understanding of SRE fundamentals, including SLIs, SLOs, error budgets, and operational maturity models. Hands-on experience with cloud platforms (AWS/Azure/GCP) and distributed systems. Proficiency in Linux systems, OS internals, networking concepts, and performance troubleshooting. Experience with Infrastructure-as-Code tools such as Terraform, CloudFormation, or Pulumi. Familiarity with containerization (Docker) and orchestration (Kubernetes). Good understanding of CI/CD pipelines, Git workflows, and release engineering. Exposure to monitoring and observability tools (Prometheus, Grafana, ELK, Datadog, etc.). Scripting proficiency in Bash, Python, or Go (preferred). Strong analytical, problem-solving, and incident-management skills.



  • bangalore, India super Full time

    Site Reliability Engineer (SRE) Level 3Overview:A Site Reliability Engineer (SRE) Level 3 is a senior technical leadership role focused on designing, implementing, and maintaining large-scale, complex, and highly reliable systems. This role emphasizes a blend of software and systems engineering to ensure the availability, latency, performance, and capacity...


  • Bangalore, India Aqilea (formerly Soltia) Full time

    We are a consulting company with a bunch of technology-interested and happy people! We love technology, we love design and we love quality. Our diversity makes us unique and creates an inclusive and welcoming workplace where each individual is highly valued.With us, each individual is her/himself and respects others for who they are and we believe that when...


  • bangalore, India Datum Technologies Group Full time

    Job Title: Site Reliability Engineer (SRE) – AWS Experience: 8+ years Location: Chennai / Mumbai Work Mode: Hybrid Key Skills: AWS, Terraform, Kubernetes, Docker, Grafana, Prometheus, Datadog Job Summary: We are looking for a skilled Site Reliability Engineer (SRE) with strong AWS experience and a solid background in DevOps, automation, observability, and...


  • Bangalore, India CodeKarma Full time

    Site Reliability Engineer (Multi-Cloud Deployments) Location: Bangalore / Remote Experience: 4–10 years Type: Full-time (6-month probation) About CodeKarma CodeKarma is redefining how engineering teams understand and evolve complex systems — bringing production context directly into the developer’s workflow. Our platform runs both as SaaS and as sub-


  • Bangalore, India Flipkart Full time

    Hiring Site Reliability Engineers Exp : 2.5 +years (Excluding internship) Location : Bangalore Apply Here : The engineer will work in the Reliability and Productivity Engineering team and is responsible for building industry standard large scale platforms to be utilised across FK that helps to significantly improve the reliability of systems and bring...


  • bangalore, India Andor Tech Full time

    Hiring!!🏢 About AndorTechAndorTech is a global IT services and consulting firm founded in 2009, headquartered in Bangalore. The company specializes in software engineering, AI-enabled IT services, application support, analytics, and test automation. With a presence across India, the USA, Europe, and the UAE, AndorTech partners with Global Capability...


  • bangalore, India Cyberhaven Full time

    About the roleWe're looking for an experienced Site Reliability engineer for making sure systems are reliable, scalable, and performing well especially in production environments. Our technology is new and rapidly evolving as an early member on the team, you'll play a key role in shaping the reliability architecture, building scalable infrastructure, and...


  • Bangalore, India Andor Tech Full time

    Hiring!! About AndorTech AndorTech is a global IT services and consulting firm founded in 2009, headquartered in Bangalore. The company specializes in software engineering, AI-enabled IT services, application support, analytics, and test automation . With a presence across India, the USA, Europe, and the UAE, AndorTech partners with Global Capability Centers...


  • bangalore, India Karix Full time

    Role: Site Reliability Engineer Location: Bangalore (WFO) About the role: We are seeking an experienced professional Site Reliability Engineer who acts as a bridge between development and IT operations, taking operational tasks to ensure the efficient functioning of Service platforms. They are responsible for monitoring, automating, and improving the...


  • bangalore, India Karix Full time

    Role: Site Reliability Engineer Location: Bangalore (WFO) About the role: We are seeking an experienced professional Site Reliability Engineer who acts as a bridge between development and IT operations, taking operational tasks to ensure the efficient functioning of Service platforms. They are responsible for monitoring, automating, and improving the...