Site Reliability Engineering Manager

3 weeks ago


bangalore, India CloudHire Full time

Job Summary The Technical Manager for Site Reliability Engineering (SRE) will lead a remote team of Site Reliability Engineers, ensuring operational excellence and fostering a high-performing team culture. Reporting to the US-based Director of Systems and Security, this role is responsible for overseeing day-to-day operations, technical mentorship, and strategic alignment with the company’s goals. The Technical Manager will act as a bridge between the team and senior leadership, ensuring clear communication, efficient issue resolution, and continuous improvement in service delivery. Job Category Technology Solutions Responsibilities: ● Provide leadership and management to a remote team of Site Reliability Engineers, ensuring alignment with organizational priorities and goals. ● Oversee team operations, including incident management, technical support, and infrastructure maintenance. ● Act as the primary point of escalation for complex technical issues, collaborating with the Director of Systems and Security, Quality Assurance and Product teams as needed. ● Ensure the team adheres to established SLAs for issue resolution and maintains high customer satisfaction levels. ● Mentor and develop team members, fostering growth in technical skills, problem-solving abilities, and customer engagement. ● Lead initiatives to improve operational processes, tools, and workflows, driving greater efficiency and reliability. ● Collaborate with cross-functional teams, including Product, Engineering, and Operations, to address customer needs and improve platform performance. ● Facilitate regular team meetings, performance reviews, and one-on-one sessions to ensure clear communication and ongoing development. ● Maintain and report on key performance metrics, providing insights and recommendations to senior leadership. ● Stay informed on industry trends and best practices, ensuring the team is equipped with the latest tools and methodologies. ● Participate in strategic planning and contribute to the continuous improvement of the SRE function. Qualifications: ● Proven experience managing technical teams, preferably in Site Reliability Engineering, DevOps, or a related field. ● Strong technical background in cloud computing and infrastructure management, particularly with AWS and Linux-based systems. ● Demonstrated ability to lead and mentor teams in remote and distributed environments. ● Excellent written and oral English communication and interpersonal skills, with the ability to engage effectively with both technical and non-technical stakeholders. ● Strong problem-solving and decision-making abilities, with a focus on root cause analysis and long-term solutions. ● Experience with automation tools (Terraform, Ansible, CloudFormation) and CI/CD pipelines. ● Familiarity with incident management practices and tools, as well as ticketing systems. ● High attention to detail and a commitment to operational excellence. ● Bachelor’s degree in a technical or quantitative science field, or equivalent work experience. Preferred Qualifications: ● AWS certification (any level). ● Experience leading customer-facing technical teams, with a focus on improving service delivery. ● Knowledge of security best practices and governance in cloud environments. ● Strong understanding of networking concepts and system architecture. Key Attributes: ● Empathetic leader who values collaboration, transparency, and accountability. ● Proactive mindset with a focus on continuous improvement and innovation. ● Ability to prioritize and manage multiple initiatives in a fast-paced environment. ● Strategic thinker who can align team efforts with broader organizational objectives. ● Passion for enabling team growth and fostering a culture of learning and development. Job Location: Kolkata Rotational Shift Budget: 12 LPA



  • bangalore, India super Full time

    Site Reliability Engineer (SRE) Level 3Overview:A Site Reliability Engineer (SRE) Level 3 is a senior technical leadership role focused on designing, implementing, and maintaining large-scale, complex, and highly reliable systems. This role emphasizes a blend of software and systems engineering to ensure the availability, latency, performance, and capacity...


  • Bangalore, India Aqilea (formerly Soltia) Full time

    We are a consulting company with a bunch of technology-interested and happy people! We love technology, we love design and we love quality. Our diversity makes us unique and creates an inclusive and welcoming workplace where each individual is highly valued.With us, each individual is her/himself and respects others for who they are and we believe that when...


  • bangalore, India Datum Technologies Group Full time

    Job Title: Site Reliability Engineer (SRE) – AWS Experience: 8+ years Location: Chennai / Mumbai Work Mode: Hybrid Key Skills: AWS, Terraform, Kubernetes, Docker, Grafana, Prometheus, Datadog Job Summary: We are looking for a skilled Site Reliability Engineer (SRE) with strong AWS experience and a solid background in DevOps, automation, observability, and...


  • bangalore, India Andor Tech Full time

    Hiring!!🏢 About AndorTechAndorTech is a global IT services and consulting firm founded in 2009, headquartered in Bangalore. The company specializes in software engineering, AI-enabled IT services, application support, analytics, and test automation. With a presence across India, the USA, Europe, and the UAE, AndorTech partners with Global Capability...


  • Bangalore, India Flipkart Full time

    Hiring Site Reliability Engineers Exp : 2.5 +years (Excluding internship) Location : Bangalore Apply Here : The engineer will work in the Reliability and Productivity Engineering team and is responsible for building industry standard large scale platforms to be utilised across FK that helps to significantly improve the reliability of systems and bring...


  • Bangalore, India Andor Tech Full time

    Hiring!! About AndorTech AndorTech is a global IT services and consulting firm founded in 2009, headquartered in Bangalore. The company specializes in software engineering, AI-enabled IT services, application support, analytics, and test automation . With a presence across India, the USA, Europe, and the UAE, AndorTech partners with Global Capability Centers...


  • bangalore, India JRD Systems Full time

    Site Reliability Engineer (Windows / Cloud / Automation) Job Summary: We are seeking an experienced Site Reliability Engineer with a strong background in managing Windows infrastructure and cloud environments. The ideal candidate will be responsible for designing, implementing, automating, and maintaining scalable infrastructure solutions across AWS, Azure,...


  • bangalore, India Cyberhaven Full time

    About the roleWe're looking for an experienced Site Reliability engineer for making sure systems are reliable, scalable, and performing well especially in production environments. Our technology is new and rapidly evolving as an early member on the team, you'll play a key role in shaping the reliability architecture, building scalable infrastructure, and...


  • bangalore, India Glocomms Full time

    We are currently looking for an SRE Lead - to join our customer - an IT consultancy with urgent projects on board.This will be a 6 month contract initially with an option to extend further.Must have 10+ years exp.Responsibilities:- Assess application architecture and implement patterns for reliability and performance.- Automate workflows and reduce manual...


  • bangalore, India Weekday AI Full time

    This role is for one of the Weekday's clientsSalary range: Rs Rs ie INR 12-20 LPA)Min Experience: 1 yearsLocation: BengaluruJobType: full-timeAs an SRE, you will work closely with product engineering, DevOps, and platform teams to build resilient services, improve deployment processes, and drive operational excellence across the organization. You will be...