Senior Site Reliability Engineer

2 days ago


Bengaluru, Karnataka, India CloudHire Full time

Job Summary

The Technical Manager for Site Reliability Engineering (SRE) will lead a remote team of Site Reliability Engineers, ensuring operational excellence and fostering a high-performing team culture. Reporting to the US-based Director of Systems and Security, this role is responsible for overseeing day-to-day operations, technical mentorship, and strategic alignment with the company's goals. The Technical Manager will act as a bridge between the team and senior leadership, ensuring clear communication, efficient issue resolution, and continuous improvement in service delivery.

Job Responsibilities:

● Provide leadership and management to a remote team of Site Reliability Engineers, ensuring alignment with organizational priorities and goals.

● Oversee team operations, including incident management, technical support, and infrastructure maintenance.

● Act as the primary point of escalation for complex technical issues, collaborating with the Director of Systems and Security, Quality Assurance and Product teams as needed.

● Ensure the team adheres to established SLAs for issue resolution and maintains high customer satisfaction levels.

● Mentor and develop team members, fostering growth in technical skills, problem-solving abilities, and customer engagement.

● Lead initiatives to improve operational processes, tools, and workflows, driving greater efficiency and reliability.

● Collaborate with cross-functional teams, including Product, Engineering, and Operations, to address customer needs and improve platform performance.

● Facilitate regular team meetings, performance reviews, and one-on-one sessions to ensure clear communication and ongoing development.

● Maintain and report on key performance metrics, providing insights and recommendations to senior leadership.

● Stay informed on industry trends and best practices, ensuring the team is equipped with the latest tools and methodologies.

● Participate in strategic planning and contribute to the continuous improvement of the SRE function.

Qualifications:

● 6+ Years of proven experience managing technical teams, preferably in Site Reliability Engineering, DevOps, or a related field.

● Strong technical background in cloud computing and infrastructure management, particularly with AWS and Linux-based systems.

● Demonstrated ability to lead and mentor teams in remote and distributed environments.

● Excellent written and oral English communication and interpersonal skills, with the ability to engage effectively with both technical and non-technical stakeholders.

● Strong problem-solving and decision-making abilities, with a focus on root cause analysis and long-term solutions.

● Experience with automation tools (Terraform, Ansible, CloudFormation) and CI/CD pipelines.

● Familiarity with incident management practices and tools, as well as ticketing systems.

● High attention to detail and a commitment to operational excellence.

● Bachelor's degree in a technical or quantitative science field, or equivalent work experience.

Preferred Qualifications:

● AWS certification (any level).

● Experience leading customer-facing technical teams, with a focus on improving service delivery.

● Knowledge of security best practices and governance in cloud environments.

● Strong understanding of networking concepts and system architecture.

Key Attributes:

● Empathetic leader who values collaboration, transparency, and accountability.

● Proactive mindset with a focus on continuous improvement and innovation.

● Ability to prioritize and manage multiple initiatives in a fast-paced environment.

● Strategic thinker who can align team efforts with broader organizational objectives.

● Passion for enabling team growth and fostering a culture of learning and development.

Job Location: On-site Kolkata

Rotating three shifts range between 5:00 AM and 9:00 PM

Salary Budget: 12LPA



  • Bengaluru, Karnataka, India Akamai Full time

    Job Category Site Reliability Would you like to lead modernization initiatives while building a public cloud platform from scratch Would you like to own critical services in a new public cloud platform Join our IaaS Site Reliability Engineering SRE team We design develop and operate infrastructure and services that power the backbone of our...


  • Bengaluru, Karnataka, India WhiteLotus Talent Partners Full time

    We are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes . In this role, you will focus on monitoring , basic troubleshooting , and incident response , helping to maintain high...


  • Bengaluru, Karnataka, India WhiteLotus Talent Partners Full time

    We are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes. In this role, you will focus on monitoring, basic troubleshooting, and incident response, helping to maintain high system...


  • Bengaluru, Karnataka, India WhiteLotus Talent Partners Full time

    We are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes. In this role, you will focus on monitoring, basic troubleshooting, and incident response, helping to maintain high system...


  • Bengaluru, Karnataka, India Josys Full time US$ 1,50,000 - US$ 2,00,000 per year

    Senior Site Reliability Engineer (SRE)About JOSYSJosys, a dynamic B2B SaaS platform startup, has embarked on a mission to revolutionize IT operations globally, following an exceptional launch in Japan and securing $125 million in Series A and B funding. Our platform enables businesses to conquer the complexities of work-from-anywhere setups, rapid digital...


  • Bengaluru, Karnataka, India LanceSoft, Inc. Full time ₹ 6,00,000 - ₹ 8,00,000 per year

    Role DescriptionThis is a full-time on-site role for a Senior Site Reliability Engineer based in Bangalore/Chennai/Pune. The Senior Site Reliability Engineer will be responsible for maintaining and enhancing the reliability and performance of the company's IT infrastructure & Development. Daily tasks include troubleshooting system issues, ensuring system...


  • Bengaluru, Karnataka, India beBeeSiteReliability Full time ₹ 20,00,000 - ₹ 30,00,000

    As a senior site reliability engineer, you will play a critical role in ensuring the stability and scalability of financial platforms.Key Responsibilities:Ensure defined SLAs, SLOs, and SLIs are met for performance, reliability, and uptime.Build automation for deployments, monitoring, scaling, and self-healing capabilities to reduce manual effort and...


  • Bengaluru, Karnataka, India HireAlpha Full time ₹ 15,00,000 - ₹ 20,00,000 per year

    We're Hiring | Senior Site Reliability Engineer (SRE)Bangalore | HybridPermanent RoleAre you ready to help shape the future of cloud contact centers? we're building scalable, reliable, and cutting-edge infrastructure for world-class customer experiences — and we're looking for aSenior SREto join our teamWhat you'll do:Lead efforts in building a seamless ...


  • Bengaluru, Karnataka, India Aerospike Full time US$ 1,50,000 - US$ 2,00,000 per year

    About AerospikeAt Aerospike, we dream big. Our focus is helping companies tackle seemingly insurmountable problems and doing what's never been done before. That is why we developed the world's leading real-time data platform that powers mission-critical applications at the world's most innovative, category-disrupting companies. Aerospike companies have...


  • Bengaluru, Karnataka, India Luxoft Full time ₹ 9,00,000 - ₹ 12,00,000 per year

    Project descriptionLuxoft partner with next-generation digital bank, built from the ground up to deliver seamless, secure, and scalable financial services. Our platform is cloud-native, API-first, and focused on reliability, speed, and security. We are growing fast and looking for top-tier Site Reliability / Ops Engineers to join our core team and help run...