
Site Reliability Engineering Manager
4 weeks ago
Job Summary
The Technical Manager for Site Reliability Engineering (SRE) will lead a remote team of Site Reliability Engineers, ensuring operational excellence and fostering a high-performing team culture. Reporting to the US-based Director of Systems and Security, this role is responsible for overseeing day-to-day operations, technical mentorship, and strategic alignment with the company's goals. The Technical Manager will act as a bridge between the team and senior leadership, ensuring clear communication, efficient issue resolution, and continuous improvement in service delivery.
Job Responsibilities:
● Provide leadership and management to a remote team of Site Reliability Engineers, ensuring alignment with organizational priorities and goals.
● Oversee team operations, including incident management, technical support, and infrastructure maintenance.
● Act as the primary point of escalation for complex technical issues, collaborating with the Director of Systems and Security, Quality Assurance and Product teams as needed.
● Ensure the team adheres to established SLAs for issue resolution and maintains high customer satisfaction levels.
● Mentor and develop team members, fostering growth in technical skills, problem-solving abilities, and customer engagement.
● Lead initiatives to improve operational processes, tools, and workflows, driving greater efficiency and reliability.
● Collaborate with cross-functional teams, including Product, Engineering, and Operations, to address customer needs and improve platform performance.
● Facilitate regular team meetings, performance reviews, and one-on-one sessions to ensure clear communication and ongoing development.
● Maintain and report on key performance metrics, providing insights and recommendations to senior leadership.
● Stay informed on industry trends and best practices, ensuring the team is equipped with the latest tools and methodologies.
● Participate in strategic planning and contribute to the continuous improvement of the SRE function.
Qualifications:
● 6+ Years of proven experience managing technical teams, preferably in Site Reliability Engineering, DevOps, or a related field.
● Strong technical background in cloud computing and infrastructure management, particularly with AWS and Linux-based systems.
● Demonstrated ability to lead and mentor teams in remote and distributed environments.
● Excellent written and oral English communication and interpersonal skills, with the ability to engage effectively with both technical and non-technical stakeholders.
● Strong problem-solving and decision-making abilities, with a focus on root cause analysis and long-term solutions.
● Experience with automation tools (Terraform, Ansible, CloudFormation) and CI/CD pipelines.
● Familiarity with incident management practices and tools, as well as ticketing systems.
● High attention to detail and a commitment to operational excellence.
● Bachelor's degree in a technical or quantitative science field, or equivalent work experience.
Preferred Qualifications:
● AWS certification (any level).
● Experience leading customer-facing technical teams, with a focus on improving service delivery.
● Knowledge of security best practices and governance in cloud environments.
● Strong understanding of networking concepts and system architecture.
Key Attributes:
● Empathetic leader who values collaboration, transparency, and accountability.
● Proactive mindset with a focus on continuous improvement and innovation.
● Ability to prioritize and manage multiple initiatives in a fast-paced environment.
● Strategic thinker who can align team efforts with broader organizational objectives.
● Passion for enabling team growth and fostering a culture of learning and development.
Job Location: On-site Kolkata
Rotating three shifts range between 5:00 AM and 9:00 PM
Salary Budget: 12LPA Fixed
-
Site Reliability Engineering Manager
3 weeks ago
India Coinbase Full timeJob DescriptionReady to be pushed beyond what you think youre capable ofAt Coinbase, our mission is to increase economic freedom in the world. Its a massive, ambitious opportunity that demands the best of us, every day, as we build the emerging onchain platform and with it, the future global financial system.To achieve our mission, were seeking a very...
-
India AionNimbius Full timeWe are looking for a Site Reliability Engineering Manager – Cloud Engineering to join our team in Bengaluru.This role will lead operations for a 24x7 cloud environment, ensuring our systems stay reliable, resilient, and ready to scale.You'll be the one making sure incidents are handled quickly, systems are well-documented, and automation is in place to...
-
Senior Site Reliability Engineer
4 weeks ago
India BQE Software Full timeWe are seeking a Senior Site Reliability Engineer to lead reliability efforts across our application stack, focusing on high availability, performance, and scalability.This role will own the health and uptime of our mission-critical application , Cloud infrastructure , database system , and monitoring infrastructure . About Us At BQE, our mission...
-
Site Reliability Engineer
4 weeks ago
India CES Full timeWe're looking for a highly skilled Site Reliability Engineer to help us build, manage, and scale modern infrastructure systems for high-availability applications. If you're passionate about automation, cloud platforms, and solving tough operational challenges, we would love to hear from you.Key Skills and Competencies3+ years of extensive experience with...
-
Junior Site Reliability Engineer
3 weeks ago
India JoVE Full timeJo VE is the world-leading producer and provider of science video solutions with the mission to improve scientific research and education.Millions of scientists, educators and students use Jo VE for their research, teaching and learning.Our institutional clients comprise over 1,000 universities, colleges, and biopharma companies, including such leaders as...
-
Junior Site Reliability Engineer
4 weeks ago
India JoVE Full timeJoVE is the world- leading producer and provider of video solutions with the mission to improve scientific research and education. Millions of scientists, educators and students use JoVE for their research, teaching and learning. Our institutional clients comprise over 1,000 universities, colleges, and biopharma companies, including such leaders as Harvard,...
-
Site Reliability Engineer
2 days ago
Remote, India Rackspace Technology Full timeJob DescriptionSite Reliability Engineer / Observability EngineerPublic Cloud - Offerings and Delivery - Workforce Mgmt & Delivery Ops /Full - Time / RemoteRackspace is building up its Professional Services Center of Excellence on Application Performance Monitoring Suites.If you enjoy solving complex business problems and can contribute to building next...
-
Site Reliability Engineer
1 day ago
India Xebia Full timeWe are looking for a highly skilled AWS Engineer with strong Python development and Chaos Engineering expertise to design, build, and validate resilient, scalable, and automated cloud-native environments. The ideal candidate will combine cloud engineering, DevOps, and chaos experimentation to improve reliability, fault tolerance, and operational efficiency...
-
Urgent Search Site Reliability Engineer
3 weeks ago
India pythian Full timeRemote Site Reliability Engineering - Site Reliability Engineering Full Time Remote Site Reliability Engineer India Multiple Timezones Remote Work from Home Why Pythian At Pythian we are experts in strategic database and analytics services driving digital transformation and operational excellence Pythian a multinational company was...
-
Senior Site Reliability Engineer- ELK Expert
4 weeks ago
India iVedha Inc. Full timeSenior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering PracticeLocation: India (Remote) - Must be available to work in the EST (US/Canada) Time Zone.Role Summary:Are you a Senior Site Reliability Engineer (SRE) with deep ELK expertise, ready to take ownership of large-scale observability infrastructure?We're looking for an SRE with 7+...