Site Reliability Engineering Manager

4 days ago


bangalore, India Tata Consultancy Services Full time

Role**: Manager, Site Reliability EngineeringRequired Technical Skill Set: Manager, Site Reliability EngineeringDesired Experience Range: 12 - 18 yrsNotice Period: Immediate to 90Days onlyLocation of Requirement: BangaloreWe are currently planning to do a Virtual Interview Job Description:Describe what the person will do in the role - how he/she will impact the organization.As the Manager of Site Reliability Engineering on the Infrastructure Reliability team, you will be responsible for building and leading a high-performing team dedicated to ensuring our infrastructure is reliable, scalable, and efficient. Your primary focus will be on people management, strategic planning, and technical leadership. You will mentor and guide your team members, fostering their professional growth and creating a culture of ownership and operational excellence. You will define the team's vision and roadmap, aligning it with the company's broader goals, and work with cross-functional partners to prioritize and execute projects. You will oversee the development of SRE solutions across our globally distributed environments and empowering your team to improve service resiliency, automate processes, and conduct effective incident response and capacity planning to guarantee the highest level of uptime and Quality of Service (QoS) for our internal customers.Responsibilities and Duties of the Role:Summarize job responsibilities, core deliverables and major duties. What is required for the position to exist?-Focus on major areas of work, typically 20% or more of role% of TimeLead, mentor, and grow a team of software and infrastructure automation engineers.Develop and execute the roadmap for the Infrastructure Reliability Engineering team.Collaborate with engineering and operations teams to identify and prioritize reliability improvements.Drive the design and implementation of tools and automation for infrastructure testing and self-healing.Establish and monitor key performance indicators (KPIs) for infrastructure reliability.10%Minimum and Preferred. Inclusive of Licenses/Certs (include functional experience as well as behavioral attributes and/or leadership capabilities)Basic QualificationsBachelor's degree in Computer Science, Engineering, or a related field, or equivalent experience.12+ years of experience in a software engineering or infrastructure role.5+ years of experience in a leadership or management role.Lead a team of Infrastructure Reliability Engineers on projects for users and be directly responsible for uptime.Own end-to-end availability and performance of key services and build automation to prevent problem recurrence. Automate response to all non-exceptional service conditions.Set the standard for excellence by mentoring team members and establishing trust through superior technical delivery.Proficiency in Kubernetes administration and modern CI/CD techniques and Infrastructure as Code (IaC).Deep understanding of Linux operating systems and TCP/IP fundamentals.Experience with monitoring, metrics gathering, APM, container management, and log collection tools.Creative problem solver with excellent debugging skills and great documentation abilities.Strong understanding of networking, storage, security, and compute technologies.Preferred QualificationsExperience building and leading a Site Reliability Engineering (SRE) or Infrastructure Reliability team.Expertise with complex system architectures and infrastructures.Proficiency in one or more programming languages (e.g., Python, Go, Java).Passion for automation, scalability, and building reliable systems from the ground up.



  • bangalore, India CloudHire Full time

    Job Summary The Technical Manager for Site Reliability Engineering (SRE) will lead a remote team of Site Reliability Engineers, ensuring operational excellence and fostering a high-performing team culture. Reporting to the US-based Director of Systems and Security, this role is responsible for overseeing day-to-day operations, technical mentorship, and...


  • bangalore, India CloudHire Full time

    Job SummaryThe Technical Manager for Site Reliability Engineering (SRE) will lead a remote team of Site Reliability Engineers, ensuring operational excellence and fostering a high-performing team culture. Reporting to the US-based Director of Systems and Security, this role is responsible for overseeing day-to-day operations, technical mentorship, and...


  • IND - Karnataka - BANGALORE, India Globalfoundries Engineering Private Limited Full time ₹ 12,00,000 - ₹ 24,00,000 per year

    Site Reliability Engineer About GlobalFoundries GlobalFoundries is a leading full-service semiconductor foundry providing a unique combination of design, development, and fabrication services to some of the world's most inspired technology companies. With a global manufacturing footprint spanning three continents, GlobalFoundries makes possible the...


  • Bangalore, India Aqilea (formerly Soltia) Full time

    We are a consulting company with a bunch of technology-interested and happy people!We love technology, we love design and we love quality. Our diversity makes us unique and creates an inclusive and welcoming workplace where each individual is highly valued.With us, each individual is her/himself and respects others for who they are and we believe that when a...


  • bangalore, India ViewSonic Full time

    Job Requirements:Bachelor's degree in Computer Science, Engineering, or a related field.3+ year of experience in a relevant role, such as Site Reliability Engineer, DevOps Engineer, or similar, is preferred but not mandatory.Basic understanding of AWS solutions including EC2, S3, CloudWatch, Lambda, and RDS.Interest and understanding of Platform Engineering...


  • bangalore district, India IntraEdge Full time

    Job Title: Site Reliability Engineer (SRE) – Production Support Location: Bengaluru Job Summary: We are looking for a skilled Site Reliability Engineer (SRE) with strong experience in production support, DevOps practices, and cloud infrastructure management . The ideal candidate will be responsible for maintaining the reliability, performance, and...


  • bangalore, India JRD Systems Full time

    Site Reliability Engineer (Windows / Cloud / Automation)Job Summary:We are seeking an experienced Site Reliability Engineer with a strong background in managing Windows infrastructure and cloud environments. The ideal candidate will be responsible for designing, implementing, automating, and maintaining scalable infrastructure solutions across AWS, Azure,...


  • bangalore, India WhiteLotus Talent Partners Full time

    We are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes. In this role, you will focus on monitoring, basic troubleshooting, and incident response, helping to maintain high system...


  • bangalore, India JRD Systems Full time

    Site Reliability Engineer (Windows / Cloud / Automation) Job Summary: We are seeking an experienced Site Reliability Engineer with a strong background in managing Windows infrastructure and cloud environments. The ideal candidate will be responsible for designing, implementing, automating, and maintaining scalable infrastructure solutions across AWS, Azure,...


  • Bangalore, India JRD Systems Full time

    Site Reliability Engineer (Windows / Cloud / Automation) Job Summary: We are seeking an experienced Site Reliability Engineer with a strong background in managing Windows infrastructure and cloud environments. The ideal candidate will be responsible for designing, implementing, automating, and maintaining scalable infrastructure solutions across AWS, Azure,...