Site Reliability Engineering Manager

14 hours ago


New Delhi, India Tata Consultancy Services Full time

Role**: Manager, Site Reliability EngineeringRequired Technical Skill Set: Manager, Site Reliability EngineeringDesired Experience Range: 12 - 18 yrsNotice Period: Immediate to 90Days onlyLocation of Requirement: BangaloreWe are currently planning to do a Virtual InterviewJob Description:Describe what the person will do in the role - how he/she will impact the organization.As the Manager of Site Reliability Engineering on the Infrastructure Reliability team, you will be responsible for building and leading a high-performing team dedicated to ensuring our infrastructure is reliable, scalable, and efficient. Your primary focus will be on people management, strategic planning, and technical leadership. You will mentor and guide your team members, fostering their professional growth and creating a culture of ownership and operational excellence. You will define the team's vision and roadmap, aligning it with the company's broader goals, and work with cross-functional partners to prioritize and execute projects. You will oversee the development of SRE solutions across our globally distributed environments and empowering your team to improve service resiliency, automate processes, and conduct effective incident response and capacity planning to guarantee the highest level of uptime and Quality of Service (QoS) for our internal customers.Responsibilities and Duties of the Role:Summarize job responsibilities, core deliverables and major duties. What is required for the position to exist?-Focus on major areas of work, typically 20% or more of role% of Time- Lead, mentor, and grow a team of software and infrastructure automation engineers. - Develop and execute the roadmap for the Infrastructure Reliability Engineering team. - Collaborate with engineering and operations teams to identify and prioritize reliability improvements. - Drive the design and implementation of tools and automation for infrastructure testing and self-healing. - Establish and monitor key performance indicators (KPIs) for infrastructure reliability.10%Minimum and Preferred. Inclusive of Licenses/Certs (include functional experience as well as behavioral attributes and/or leadership capabilities)Basic Qualifications- Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent experience. - 12+ years of experience in a software engineering or infrastructure role. - 5+ years of experience in a leadership or management role. - Lead a team of Infrastructure Reliability Engineers on projects for users and be directly responsible for uptime. - Own end-to-end availability and performance of key services and build automation to prevent problem recurrence. Automate response to all non-exceptional service conditions. - Set the standard for excellence by mentoring team members and establishing trust through superior technical delivery. - Proficiency in Kubernetes administration and modern CI/CD techniques and Infrastructure as Code (IaC). - Deep understanding of Linux operating systems and TCP/IP fundamentals. - Experience with monitoring, metrics gathering, APM, container management, and log collection tools. - Creative problem solver with excellent debugging skills and great documentation abilities. - Strong understanding of networking, storage, security, and compute technologies.Preferred Qualifications- Experience building and leading a Site Reliability Engineering (SRE) or Infrastructure Reliability team. - Expertise with complex system architectures and infrastructures. - Proficiency in one or more programming languages (e.g., Python, Go, Java). - Passion for automation, scalability, and building reliable systems from the ground up.



  • New Delhi, India Endpoint Clinical Full time

    About Us:Endpoint is an interactive response technology (IRT®) systems and solutions provider that supports the life sciences industry. Since 2009, we have been working with a single vision in mind, to help sponsors and pharmaceutical companies achieve clinical trial success. Our solutions, realized through the proprietary PULSE® platform, have proven to...


  • New Delhi, India Endpoint Clinical Full time

    About Us:Endpoint is an interactive response technology (IRT®) systems and solutions provider that supports the life sciences industry. Since 2009, we have been working with a single vision in mind, to help sponsors and pharmaceutical companies achieve clinical trial success. Our solutions, realized through the proprietary PULSE® platform, have proven to...


  • New Delhi, India HDFC Limited Full time

    Hiring for Lead / Sr Site Reliability Engineer for Mumbai & Bangalore Location Experience - 8 - 14 YearsJob Purpose Analysing, troubleshooting, and designing vital services, platforms, and infrastructure on GCP while always thinking about reliability, scalability, resilience, security, and performance.Job Responsibilities:Help build a Site Reliability...


  • New Delhi, India IntraEdge Full time

    Job Title: Site Reliability Engineer (SRE) – Production Support Location: BengaluruJob Summary: We are looking for a skilledSite Reliability Engineer (SRE)with strong experience inproduction support, DevOps practices, and cloud infrastructure management . The ideal candidate will be responsible for maintaining the reliability, performance, and scalability...


  • New Delhi, India SID Global Solutions Full time

    Job Role: Site Reliability Engineer (SRE) – GCP Experience: 3+ years Location: HyderabadAbout SIDGS: SIDGS is a premium global systems integrator and global implementation partner of Google corporation, providing Digital Solutions & Services to Fortune 500 companies. Our Digital solutions go across following domains: User Experience, CMS, API Management,...


  • New Delhi, India Batch Systems Inc Full time

    Batch is a brand-first technology platform designed to amplify customer engagement, enable frictionless transactions, defend product authenticity, elevate customer loyalty, and ignite customer growth. Our mission is to provide seamless solutions that help businesses build stronger connections with their customers. With a focus on enhancing the customer...


  • New Delhi, India Resource Algorithm Full time

    Senior SRE (Engineering & Reliability) Job Summary: We are seeking an experienced and dynamic Site Reliability Engineering (SRE) Lead to oversee the reliability, scalability, and performance of our critical systems.As an SeniorSRE, you will play a pivotal role in establishing and implementing SRE practices, leading a team of engineers, and driving...


  • New Delhi, India JRD Systems Full time

    Position:Site Reliability Engineer (SRE) Role Overview: We are seeking an experienced Site Reliability Engineer (SRE) with a strong background inWindows infrastructureto manage and optimize our cloud and on-premises environments. The ideal candidate will partner with development teams to improve service reliability, implement automation, and ensure...


  • New Delhi, India Trantor Full time

    Job Title - Site Reliability Engineer Role- Contract (9 Months- Extendable) Exp- 5+ years Loc- Bangalore ( Hybrid) Notice- Immediate joiner onlyDuties: Responsible for maintaining and scaling production services and servers across multiple data centers for complex and data-intensive cloud services Improve scalability, service reliability, capacity, and...


  • New Delhi, India Insight Global Full time

    Job Description: Title: Site Reliability Engineer Location: Hyderabad (4 days onsite and 1 day remote)Required Skills & Experience: Bachelor's degree in computer science, Engineering, or related field 5+ years of experience in SRE or related roles Proficiency in Python and experience with Kubernetes and Kafka Experience with Ignition SCADA and RESTful APIs...