Site Reliability Engineering Manager

2 weeks ago


Delhi, Delhi, India CloudHire Full time
The Technical Manager for Site Reliability Engineering (SRE) will lead a team of Site Reliability Engineers, ensuring operational excellence and fostering a high-performing team culture. Reporting to the US-based Director of Systems and Security, this role is responsible for overseeing day-to-day operations, technical mentorship, and strategic alignment with the company's goals. The Technical Manager will act as tier 2 support and also as a bridge between the team and senior leadership, ensuring clear communication, efficient issue resolution, and continuous improvement in service delivery.
This is a client-facing support role for a highly technical product offered as a Platform-as-a-Service (PaaS) to our clients.

Responsibilities:
● Provide leadership and management to a remote team of Site Reliability Engineers, ensuring alignment with organizational priorities and goals.
● Oversee team operations, including incident management, technical support, and infrastructure maintenance.
● Act as the primary point of escalation for complex technical issues (tier 2,) collaborating with the Director of Systems and Security, Quality Assurance, and Product teams as needed.
● Ensure the team adheres to established SLAs for issue resolution and maintains high customer satisfaction levels.
● Mentor and develop team members, fostering growth in technical skills, problem-solving abilities, and customer engagement.
● Lead initiatives to improve operational processes, tools, and workflows, driving greater efficiency and reliability.
● Collaborate with cross-functional teams, including Product, Engineering, and Operations, to address customer needs and improve platform performance.
● Facilitate regular team meetings, performance reviews, and one-on-one sessions to ensure clear communication and ongoing development.
● Maintain and report on key performance metrics, providing insights and recommendations to senior leadership.
● Stay informed on industry trends and best practices, ensuring the team is equipped with the latest tools and methodologies.
● Participate in strategic planning and contribute to the continuous improvement of the SRE function.

Qualifications:
● Experience leading customer-facing technical teams, with a focus on improving service delivery.
● Proven experience managing technical teams, preferably in Customer Support or a related field.
● Strong technical background in cloud computing and infrastructure management, particularly with AWS and Linux-based systems.
● Strong understanding of networking concepts and system architecture, including but not limited to VPN connectivity, routing, and basic firewall rules
● Demonstrated ability to lead and mentor teams in remote and distributed environments.
● Excellent written and oral English communication and interpersonal skills, with the ability to engage effectively with both technical and non-technical stakeholders.
● Strong problem-solving and decision-making abilities, with a focus on root cause analysis and long-term solutions.
● Familiarity with incident management practices and tools, as well as ticketing systems.
● High attention to detail and a commitment to operational excellence.
● Bachelor's degree in a technical or quantitative science field, or equivalent work experience

Preferred Qualifications:
● AWS certification (any level).
● Knowledge of security best practices and governance in cloud environments.

Key Attributes:
● Empathetic leader who values collaboration, transparency, and accountability.
● Proactive mindset with a focus on continuous improvement and innovation.
● Strategic thinker who can align team efforts with broader organizational objectives.
● Passion for enabling team growth and fostering a culture of learning and development.

Location : Kolkata (Onsite)
Rotational Shift Timing Range between : 5:00 AM to 9:00 PM
Salary Range : 12 LPA to 14 LPA

  • Delhi, Delhi, India Cricbuzz Full time

    Site Reliability EngineerWe are looking for a highly skilled and motivated Web Server Site Reliability Engineer to join our team. As a Web Server Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our web server infrastructure and CDN services.Experience - 3 - 5 yearsResponsibilities:● Design,...


  • Delhi, Delhi, India FourthPointer Services Pvt. Ltd. Full time

    Job Title : Site Reliability Engineer (SRE)Experience Required : 5+ yearsLocation : Noida (Remote)Job Description :We are looking for an experienced Infrastructure Site Reliability Engineer (SRE) to join our team. This role involves managing and optimizing infrastructure with a primary focus on Kafka, OpenSearch, and multi-cloud environments.Key...


  • Delhi, Delhi, India FourthPointer Services Pvt. Ltd. Full time

    Job Title : Site Reliability Engineer (SRE)Experience Required : 5+ yearsLocation : Noida (Remote)Job Description :We are looking for an experienced Infrastructure Site Reliability Engineer (SRE) to join our team. This role involves managing and optimizing infrastructure with a primary focus on Kafka, OpenSearch, and multi-cloud environments.Key...


  • Delhi, Delhi, India Bright Vision Technologies Full time

    B Exciting Opportunity for Site Reliability Engineer- H1B Sponsorship for 2025 at Bright Vision Technologies Join the Bright Vision Technologies Team: Where Innovation Meets Opportunity  www.bvteck.com As we approach the 2025 H1B filing season, we are excited to offer a unique opportunity for talented professionals like you to work with our direct clients...


  • Delhi, Delhi, India BigRio Full time

    Job Title: Site Reliability EngineerLocation: Remote with Quarterly visits to Chennai, Tamil Nadu, IndiaDuration: Full-TimeAbout BigRio:BigRio is a remote-based, technology consulting firm headquartered in Boston, MA. We deliver software solutions ranging from custom development and software implementation to data analytics and machine learning/AI...


  • Delhi, Delhi, India SITA Full time

    SITA is seeking a skilled Security Site Reliability Engineer (SRE) to join its team. In this role, you will play a key part in maintaining and enhancing systems operational efficiency by leveraging your expertise in security engineering and site reliability.ResponsibilitiesCreate and review Infrastructure as Code to meet regulatory and market security...


  • Delhi, Delhi, India NEXUS HR CONSULTANTS PVT. LTD. Full time

    Job Summary :As the Head of Site Reliability Engineering (SRE), you will lead a team of talented engineers responsible for ensuring the reliability, availability, and performance of our organization's technology infrastructure and systems. You will play a critical role in defining and implementing the SRE strategy, establishing best practices, and driving...


  • Delhi, Delhi, India CloudHire Full time

    The Site Reliability Engineer (SRE) is responsible for maintaining high standards of quality customer service and support. In this role, you will be providing front-line customer support for our flagship product, Metworx. The Metworx product is delivered as a Platform-as-a-Service to our clients and provides a stable, scalable, and reproducible computing...


  • Delhi, Delhi, India Stackave Solutions Full time

    Job Title: Core Banking Senior Platform Engineer(Site Reliability Engineer)Experience Range: 5+ yearsLocation: BangaloreBusiness and Private Banking - Core Banking ModernizationKey ResponsibilitiesRun the production environment by monitoring availability and taking a holistic view of system healthRun core banking ledger and switch systems with the domain...


  • Delhi, Delhi, India Antal International Full time

    Job Description Summary role description: Hiring for a Site Reliability Engineer for a fastest-growing energy technology company. Company description: Our client is one of the fastest-growing energy technology companies in India, founded by some of the leaders in this space. They lead technological innovation for the most effective energy...


  • Delhi, Delhi, India Zeta Full time

    About Zeta Zeta is a Next-Gen Banking Tech company that empowers banks and fintechs to launch banking products for the future. It was founded by Bhavin Turakhia and Ramki Gaddipati in 2015. Our flagship processing platform - Zeta Tachyon - is the industry's first modern, cloud-native, and fully API-enabled stack that brings together issuance, processing,...


  • Delhi, Delhi, India Zeta Full time

    About ZetaZeta is a Next-Gen Banking Tech company that empowers banks and fintechs to launch banking products for the future. It was founded by Bhavin Turakhia and Ramki Gaddipati in 2015.Our flagship processing platform - Zeta Tachyon - is the industry's first modern, cloud-native, and fully API-enabled stack that brings together issuance, processing,...


  • Delhi, Delhi, India Agivant Technologies Full time

    Job Description : We are looking for a highly skilled Site Reliability Engineer (SRE) with strong engineering and architectural expertise to design, implement, and manage large-scale, mission-critical infrastructure across multiple data centers and cloud providers. As an SRE, you will be responsible for architecting and optimizing our global infrastructure,...


  • Delhi, Delhi, India Agivant Technologies Full time

    Job Description : We are looking for a highly skilled Site Reliability Engineer (SRE) with strong engineering and architectural expertise to design, implement, and manage large-scale, mission-critical infrastructure across multiple data centers and cloud providers. As an SRE, you will be responsible for architecting and optimizing our global infrastructure,...


  • Delhi, Delhi, India Zeta Full time

    About ZetaZeta is a Next-Gen Banking Tech company that empowers banks and fintechs to launch banking products for the future. It was founded by Bhavin Turakhia and Ramki Gaddipati in 2015.Our flagship processing platform - Zeta Tachyon - is the industry's first modern, cloud-native, and fully API-enabled stack that brings together issuance, processing,...


  • Delhi, Delhi, India Antal International Full time

    Job Description Summary role description:  Hiring for a Site Reliability Engineer for a fastest-growing energy technology company. Company description: Our client is one of the fastest-growing energy technology companies in India, founded by some of the leaders in this space. They lead technological innovation for the most effective energy...


  • Delhi, Delhi, India Buncha Full time

    About the Role:We are seeking a passionate and detail-oriented Site Reliability Engineer to join our dynamic team. The ideal candidate will have 3+ years of experience in system monitoring, reliability, and troubleshooting applications. You will play a crucial role in ensuring the availability, performance, and scalability of our systems.Key...


  • Delhi, Delhi, India Tekgence Inc Full time

    Datacenter Observability and Site Reliability EngineerLocation: Remote, Indiacontract Duration: 6 months+working hours: 5.30 am to 2.30 pm ISTRoles and Responsibilities:Observability and Monitoring:Design, implement, and maintain observability solutions for datacenter infrastructure.Develop, deploy, and maintain the operational and reliability components of...


  • Delhi, Delhi, India Tata Consultancy Services Full time

    Skill :Site Reliability EngineerJob Location : New DelhiExperience : 8 to 12 YearsJob Description :You will design and architect distributed systems in the cloud and understand how to move systems from on-prem data centers to the cloudYou will create monitoring, alerting and dashboarding solutions that improve visibility into EA's application performance and...


  • Delhi, Delhi, India Tata Consultancy Services Limited Full time

    Job DescriptionSite Reliability Engineering (SRE)Job DescriptionResponsibilities:- You will design and architect distributed systems in the cloud and understand how to move systems from on-prem data centers to the cloud- You will create monitoring, alerting and dashboarding solutions that improve visibility into EA's application performance and business...