Cloud Reliability Engineering Director
2 weeks ago
We're seeking an experienced Site Reliability Engineering (SRE) Manager to lead our SRE team in India.
The SRE Manager will manage the team's day-to-day operations, drive strategic initiatives, and ensure system reliability and performance.
This role requires a strong technical background, leadership experience, and a strategic mindset to align the SRE team's efforts with business objectives.
The role also includes people management responsibilities for a team size of 10+ SREs.Key Responsibilities:
Lead and manage the SRE team in India, ensuring high availability and reliability of critical services and infrastructure.
Develop, implement, and manage SRE practices such as monitoring, incident management, capacity planning, and service level objectives (SLOs).
Collaborate with global SRE teams to define and align on SRE best practices, incident response, and service reliability strategies.
Oversee and manage the performance of systems in production and non-production environments, ensuring proactive monitoring, alerting, and capacity management.
Drive initiatives to enhance system reliability and performance, including automation, Infrastructure as Code (IaC), and continuous integration/continuous deployment (CI/CD) improvements.
Manage 24/7 on-call rotations and ensure the team is equipped to handle incidents and provide timely responses and escalations.Partner with engineering, DevOps, and security teams to ensure seamless integration of SRE practices and security compliance across the organization.
Mentor and coach team members to develop their skills and promote a culture of continuous improvement.Prepare and present regular updates on system performance, incidents, and project statuses to leadership and key stakeholders.
Ensure cost-efficient utilization of resources and drive infrastructure cost optimization initiatives.
Handle people management responsibilities, including recruitment, onboarding, performance reviews, and overall team development
Requirements:
Education and Experience:
Bachelor's or Master's in Computer Science, Information Technology, or a related field.
12+ years of experience in IT or software engineering, with at least 6 years in SRE, DevOps, or infrastructure management roles.
Proven experience in leading and managing SRE, DevOps, or infrastructure teams, preferably in a global or multi-regional setting.Prior experience in managing and mentoring a team of at least 5-10 engineers with strong people management skills.
Technical Skills:
Strong understanding of cloud platforms such as AWS, GCP, or Azure, with experience in cloud-native architectures and services.
Hands-on experience with automation tools, Infrastructure as Code (Terraform, CloudFormation), CI/CD pipelines, and configuration management tools (e.g., Ansible, Puppet).
Expertise in monitoring and observability tools (e.g., Prometheus, Grafana, Datadog, ELK stack).
Familiarity with containerization and orchestration technologies (e.g., Docker, Kubernetes).
Strong programming/scripting skills (Python, Go, Bash, NodeJS, or similar languages).
Leadership and Soft Skills:
Proven ability to lead, mentor, and develop high-performing teams of at least 10 engineers.
Excellent problem-solving and analytical skills, with a strong focus on system reliability and operational excellence.
Strong communication and collaboration skills, with experience working across multiple teams and stakeholders.
Ability to navigate ambiguity and drive strategic initiatives with limited resources and information.
Additional Qualifications:
Experience with security practices and compliance standards (ISO 27001, SOC 2, etc.) is a plus.
Experience with cost management and optimization strategies in cloud environments.
Previous experience managing incident response and on-call rotations for a 24/7 support environment.
-
Site Reliability Engineer
1 month ago
Delhi, Delhi, India Boost-IT Full timeAbout the RoleWe are seeking an experienced Site Reliability Engineer to join our team at Boost-IT. As a key member of our technical leadership team, you will be responsible for ensuring the reliability and scalability of our cloud infrastructure on GCP and Azure.You will lead and guide our team technically, providing technical guidance and mentorship while...
-
Delhi, Delhi, India Cloud Destinations Full timeLead DevOps EngineerAbout the Role : We are seeking a highly skilled and experienced Lead DevOps Engineer to join our dynamic team at Cloud Destinations.Job OverviewIn this role, you will be responsible for leading and driving our DevOps initiatives, automating infrastructure, and ensuring the reliability and scalability of our applications. Our ideal...
-
Reliability Engineer
3 weeks ago
Delhi, Delhi, India mccainfood Full timeJob Title: SRE EngineerJob Summary:We are seeking a highly skilled SRE Engineer to join our team. The successful candidate will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, implement, and maintain scalable and reliable cloud infrastructureDevelop and maintain...
-
Site Reliability Engineer
3 weeks ago
Delhi, Delhi, India mccainfood Full timeJob Title: Site Reliability EngineerJob Summary:We are seeking a highly skilled Site Reliability Engineer to join our team at McCain Foods. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and resilience of our systems and infrastructure. This is a critical role that requires a strong understanding of software development,...
-
Site Reliability Engineering Manager
1 month ago
Delhi, Delhi, India Boost IT Full timeJob Title: Site Reliability Engineering ManagerBoost IT is a company that brings people together to solve complex problems and create innovative solutions. We are passionate about technology and its potential to make a positive impact on society.We are seeking a talented Site Reliability Engineering Manager to join our team. The ideal candidate will have a...
-
Site Reliability Engineer
4 weeks ago
Delhi, Delhi, India mccainfood Full timeJob Summary:As a Site Reliability Engineer at McCain Foods, you will play a crucial role in ensuring the reliability and resilience of our systems and applications. Your primary responsibility will be to design, build, and maintain scalable and efficient infrastructure to support our business growth.Key Responsibilities:Collaborate with cross-functional...
-
Senior Data Engineering Director
1 day ago
Delhi, Delhi, India SourceBae Full timeJob Title: Senior Data Engineering DirectorAbout SourceBae:We are a forward-thinking organization dedicated to harnessing the power of data. Our mission is to drive innovation through cutting-edge technology and expertise.Salary: ₹2500000 - ₹3500000 per annum, depending on experienceJob Description:We are seeking an experienced Senior Data Engineering...
-
Cloud Infrastructure Engineer
3 weeks ago
Delhi, Delhi, India Boost-IT Full timeJob Title: Cloud Infrastructure EngineerAbout the Role:We are seeking an experienced Cloud Infrastructure Engineer to join our team at Boost-IT. As a key member of our technical team, you will be responsible for designing, deploying, and maintaining highly reliable and scalable cloud environments on GCP and Azure.Key Responsibilities:Design and deploy cloud...
-
Data Engineer
2 weeks ago
Delhi, Delhi, India Syren Cloud Inc Full timeAbout the RoleWe are seeking a highly skilled Data Engineer - Cloud Computing Expert to join our team at Syren Cloud Inc. In this role, you will be responsible for designing, developing, and implementing distributed applications and systems on the Azure Cloud platform.Key ResponsibilitiesDevelop and implement databases, data collection systems, data...
-
Site Reliability Engineer
2 weeks ago
Delhi, Delhi, India Mrsool Full timeAbout UsAt Mrsool, we are revolutionizing the delivery experience by providing unparalleled convenience and flexibility to our customers. Our mission is to empower users to get what they need, when they need it, through our seamless and user-centric platform. As a key member of our team, you will play a critical role in ensuring the stability and reliability...
-
Engineering Director
4 weeks ago
Delhi, Delhi, India Hyatt Corporation Full timeJob SummaryThe Engineering Manager will be responsible for the efficient operation of the Engineering Department in support of all other operating departments. The successful candidate will assist the Director of Engineering in ensuring the department's goals are met while maintaining a high level of service quality.Key ResponsibilitiesAssist the Director of...
-
Senior Sales Director
2 weeks ago
Delhi, Delhi, India Oracle Full timeAt Oracle, we are empowering businesses to turn untapped potential into real business value. We are looking for a Senior Sales Director to lead and manage the West region sales team, focusing on cloud technology. The ideal candidate will have 15+ years of experience in leading teams in the IT sector, with a proven track record of delivering business...
-
Senior Site Reliability Engineer
3 weeks ago
Delhi, Delhi, India Norstella Full timeSite Reliability Engineer Job DescriptionAt Norstella, we're on a mission to improve patient access to lifesaving therapies. As a Site Reliability Engineer, you'll play a critical role in empowering our users with a rich feature set, high availability, and stellar performance.About the Role:We're looking for a motivated, driven, and passionate Site...
-
Site Reliability Engineer
2 weeks ago
Delhi, Delhi, India Vionsys IT Solutions India Pvt. Ltd Full timeJob Title: Site Reliability EngineerJob Summary:As a Site Reliability Engineer at Vionsys IT Solutions India Pvt. Ltd, you will play a crucial role in maintaining and enhancing the security, stability, scalability, and cost-effectiveness of our systems. You will leverage your expertise in tools like Terraform, Ansible, Kubernetes, and AWS to build and manage...
-
Cloud Infrastructure Engineer
2 weeks ago
Delhi, Delhi, India Tata Consultancy Services Full timeGreetings from TCS.Tata Consultancy Services is hiring for a Site Reliability Engineer. Key responsibilities include collaborating with cloud platform engineers to design, develop, and implement solutions in Azure, as well as understanding service level indicators to proactively resolve issues.Key Skills:Site Reliability EngineerCloud Platform EngineerAzure...
-
SRE Engineer
2 weeks ago
Delhi, Delhi, India mccainfood Full timeJob SummaryWe are seeking a highly skilled Site Reliability Engineer (SRE) to join our team at McCain Foods. As an SRE, you will be responsible for ensuring the reliability, scalability, and performance of our cloud and infrastructure systems.Key ResponsibilitiesCollaborate with cross-functional teams to design, develop, and deploy scalable and reliable...
-
Senior Cloud Infrastructure Engineer
5 days ago
Delhi, Delhi, India Tata Consultancy Services Full timeAbout the RoleWe are seeking an experienced Senior Cloud Infrastructure Engineer to join our team at Tata Consultancy Services. This is a unique opportunity to leverage your technical expertise and passion for cloud computing to design, deploy, and manage enterprise-level Citrix/Azure infrastructure.Job DescriptionThis role requires a strong background in...
-
Cloud Infrastructure Engineer
4 weeks ago
Delhi, Delhi, India Boost-IT Full timeBoost IT is a technology consultancy company integrated into a group of entrepreneurs with investments in over 30 companies.We strive to be known for being a dynamic, energetic, and reliable company to operate in the market, and for that, we need a skilled Site Reliability Engineer (SRE) with hands-on expertise in Google Cloud Platform (GCP) and Microsoft...
-
Site Reliability Engineering Manager
3 weeks ago
Delhi, Delhi, India Mrsool Full timeAbout the RoleWe are seeking an experienced Engineering Manager to lead and grow a team of Site Reliability Engineers (SREs) at Mrsool. As a key member of our engineering organization, you will be responsible for ensuring platform stability and reliability while actively contributing to strategy, prioritization, and mission setting for the SRE team.This role...
-
Site Reliability Engineering Manager
4 weeks ago
Delhi, Delhi, India Mrsool Full timeAbout MrsoolMrsool is a leading on-demand delivery platform in the Middle East and North Africa region, known for its seamless user experience and high ratings on major app stores. We're committed to providing an unparalleled 'order anything from anywhere' experience, empowering users to get what they need when they need it.The JobWe're seeking an...