Cloud Reliability Engineering Director

2 weeks ago


Delhi, Delhi, India ViewSonic Full time
Job Summary:
We're seeking an experienced Site Reliability Engineering (SRE) Manager to lead our SRE team in India.

The SRE Manager will manage the team's day-to-day operations, drive strategic initiatives, and ensure system reliability and performance.

This role requires a strong technical background, leadership experience, and a strategic mindset to align the SRE team's efforts with business objectives.

The role also includes people management responsibilities for a team size of 10+ SREs.

Key Responsibilities:
Lead and manage the SRE team in India, ensuring high availability and reliability of critical services and infrastructure.
Develop, implement, and manage SRE practices such as monitoring, incident management, capacity planning, and service level objectives (SLOs).
Collaborate with global SRE teams to define and align on SRE best practices, incident response, and service reliability strategies.
Oversee and manage the performance of systems in production and non-production environments, ensuring proactive monitoring, alerting, and capacity management.

Drive initiatives to enhance system reliability and performance, including automation, Infrastructure as Code (IaC), and continuous integration/continuous deployment (CI/CD) improvements.

Manage 24/7 on-call rotations and ensure the team is equipped to handle incidents and provide timely responses and escalations.

Partner with engineering, DevOps, and security teams to ensure seamless integration of SRE practices and security compliance across the organization.

Mentor and coach team members to develop their skills and promote a culture of continuous improvement.
Prepare and present regular updates on system performance, incidents, and project statuses to leadership and key stakeholders.
Ensure cost-efficient utilization of resources and drive infrastructure cost optimization initiatives.
Handle people management responsibilities, including recruitment, onboarding, performance reviews, and overall team development

Requirements:

Education and Experience:
Bachelor's or Master's in Computer Science, Information Technology, or a related field.

12+ years of experience in IT or software engineering, with at least 6 years in SRE, DevOps, or infrastructure management roles.

Proven experience in leading and managing SRE, DevOps, or infrastructure teams, preferably in a global or multi-regional setting.
Prior experience in managing and mentoring a team of at least 5-10 engineers with strong people management skills.

Technical Skills:
Strong understanding of cloud platforms such as AWS, GCP, or Azure, with experience in cloud-native architectures and services.
Hands-on experience with automation tools, Infrastructure as Code (Terraform, CloudFormation), CI/CD pipelines, and configuration management tools (e.g., Ansible, Puppet).
Expertise in monitoring and observability tools (e.g., Prometheus, Grafana, Datadog, ELK stack).
Familiarity with containerization and orchestration technologies (e.g., Docker, Kubernetes).
Strong programming/scripting skills (Python, Go, Bash, NodeJS, or similar languages).

Leadership and Soft Skills:
Proven ability to lead, mentor, and develop high-performing teams of at least 10 engineers.
Excellent problem-solving and analytical skills, with a strong focus on system reliability and operational excellence.
Strong communication and collaboration skills, with experience working across multiple teams and stakeholders.
Ability to navigate ambiguity and drive strategic initiatives with limited resources and information.

Additional Qualifications:
Experience with security practices and compliance standards (ISO 27001, SOC 2, etc.) is a plus.
Experience with cost management and optimization strategies in cloud environments.
Previous experience managing incident response and on-call rotations for a 24/7 support environment.

  • Delhi, Delhi, India Boost-IT Full time

    About the RoleWe are seeking an experienced Site Reliability Engineer to join our team at Boost-IT. As a key member of our technical leadership team, you will be responsible for ensuring the reliability and scalability of our cloud infrastructure on GCP and Azure.You will lead and guide our team technically, providing technical guidance and mentorship while...


  • Delhi, Delhi, India Cloud Destinations Full time

    Lead DevOps EngineerAbout the Role : We are seeking a highly skilled and experienced Lead DevOps Engineer to join our dynamic team at Cloud Destinations.Job OverviewIn this role, you will be responsible for leading and driving our DevOps initiatives, automating infrastructure, and ensuring the reliability and scalability of our applications. Our ideal...

  • Reliability Engineer

    3 weeks ago


    Delhi, Delhi, India mccainfood Full time

    Job Title: SRE EngineerJob Summary:We are seeking a highly skilled SRE Engineer to join our team. The successful candidate will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, implement, and maintain scalable and reliable cloud infrastructureDevelop and maintain...


  • Delhi, Delhi, India mccainfood Full time

    Job Title: Site Reliability EngineerJob Summary:We are seeking a highly skilled Site Reliability Engineer to join our team at McCain Foods. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and resilience of our systems and infrastructure. This is a critical role that requires a strong understanding of software development,...


  • Delhi, Delhi, India Boost IT Full time

    Job Title: Site Reliability Engineering ManagerBoost IT is a company that brings people together to solve complex problems and create innovative solutions. We are passionate about technology and its potential to make a positive impact on society.We are seeking a talented Site Reliability Engineering Manager to join our team. The ideal candidate will have a...


  • Delhi, Delhi, India mccainfood Full time

    Job Summary:As a Site Reliability Engineer at McCain Foods, you will play a crucial role in ensuring the reliability and resilience of our systems and applications. Your primary responsibility will be to design, build, and maintain scalable and efficient infrastructure to support our business growth.Key Responsibilities:Collaborate with cross-functional...


  • Delhi, Delhi, India SourceBae Full time

    Job Title: Senior Data Engineering DirectorAbout SourceBae:We are a forward-thinking organization dedicated to harnessing the power of data. Our mission is to drive innovation through cutting-edge technology and expertise.Salary: ₹2500000 - ₹3500000 per annum, depending on experienceJob Description:We are seeking an experienced Senior Data Engineering...


  • Delhi, Delhi, India Boost-IT Full time

    Job Title: Cloud Infrastructure EngineerAbout the Role:We are seeking an experienced Cloud Infrastructure Engineer to join our team at Boost-IT. As a key member of our technical team, you will be responsible for designing, deploying, and maintaining highly reliable and scalable cloud environments on GCP and Azure.Key Responsibilities:Design and deploy cloud...

  • Data Engineer

    2 weeks ago


    Delhi, Delhi, India Syren Cloud Inc Full time

    About the RoleWe are seeking a highly skilled Data Engineer - Cloud Computing Expert to join our team at Syren Cloud Inc. In this role, you will be responsible for designing, developing, and implementing distributed applications and systems on the Azure Cloud platform.Key ResponsibilitiesDevelop and implement databases, data collection systems, data...


  • Delhi, Delhi, India Mrsool Full time

    About UsAt Mrsool, we are revolutionizing the delivery experience by providing unparalleled convenience and flexibility to our customers. Our mission is to empower users to get what they need, when they need it, through our seamless and user-centric platform. As a key member of our team, you will play a critical role in ensuring the stability and reliability...

  • Engineering Director

    4 weeks ago


    Delhi, Delhi, India Hyatt Corporation Full time

    Job SummaryThe Engineering Manager will be responsible for the efficient operation of the Engineering Department in support of all other operating departments. The successful candidate will assist the Director of Engineering in ensuring the department's goals are met while maintaining a high level of service quality.Key ResponsibilitiesAssist the Director of...

  • Senior Sales Director

    2 weeks ago


    Delhi, Delhi, India Oracle Full time

    At Oracle, we are empowering businesses to turn untapped potential into real business value. We are looking for a Senior Sales Director to lead and manage the West region sales team, focusing on cloud technology. The ideal candidate will have 15+ years of experience in leading teams in the IT sector, with a proven track record of delivering business...


  • Delhi, Delhi, India Norstella Full time

    Site Reliability Engineer Job DescriptionAt Norstella, we're on a mission to improve patient access to lifesaving therapies. As a Site Reliability Engineer, you'll play a critical role in empowering our users with a rich feature set, high availability, and stellar performance.About the Role:We're looking for a motivated, driven, and passionate Site...


  • Delhi, Delhi, India Vionsys IT Solutions India Pvt. Ltd Full time

    Job Title: Site Reliability EngineerJob Summary:As a Site Reliability Engineer at Vionsys IT Solutions India Pvt. Ltd, you will play a crucial role in maintaining and enhancing the security, stability, scalability, and cost-effectiveness of our systems. You will leverage your expertise in tools like Terraform, Ansible, Kubernetes, and AWS to build and manage...


  • Delhi, Delhi, India Tata Consultancy Services Full time

    Greetings from TCS.Tata Consultancy Services is hiring for a Site Reliability Engineer. Key responsibilities include collaborating with cloud platform engineers to design, develop, and implement solutions in Azure, as well as understanding service level indicators to proactively resolve issues.Key Skills:Site Reliability EngineerCloud Platform EngineerAzure...

  • SRE Engineer

    2 weeks ago


    Delhi, Delhi, India mccainfood Full time

    Job SummaryWe are seeking a highly skilled Site Reliability Engineer (SRE) to join our team at McCain Foods. As an SRE, you will be responsible for ensuring the reliability, scalability, and performance of our cloud and infrastructure systems.Key ResponsibilitiesCollaborate with cross-functional teams to design, develop, and deploy scalable and reliable...


  • Delhi, Delhi, India Tata Consultancy Services Full time

    About the RoleWe are seeking an experienced Senior Cloud Infrastructure Engineer to join our team at Tata Consultancy Services. This is a unique opportunity to leverage your technical expertise and passion for cloud computing to design, deploy, and manage enterprise-level Citrix/Azure infrastructure.Job DescriptionThis role requires a strong background in...


  • Delhi, Delhi, India Boost-IT Full time

    Boost IT is a technology consultancy company integrated into a group of entrepreneurs with investments in over 30 companies.We strive to be known for being a dynamic, energetic, and reliable company to operate in the market, and for that, we need a skilled Site Reliability Engineer (SRE) with hands-on expertise in Google Cloud Platform (GCP) and Microsoft...


  • Delhi, Delhi, India Mrsool Full time

    About the RoleWe are seeking an experienced Engineering Manager to lead and grow a team of Site Reliability Engineers (SREs) at Mrsool. As a key member of our engineering organization, you will be responsible for ensuring platform stability and reliability while actively contributing to strategy, prioritization, and mission setting for the SRE team.This role...


  • Delhi, Delhi, India Mrsool Full time

    About MrsoolMrsool is a leading on-demand delivery platform in the Middle East and North Africa region, known for its seamless user experience and high ratings on major app stores. We're committed to providing an unparalleled 'order anything from anywhere' experience, empowering users to get what they need when they need it.The JobWe're seeking an...