
Deputy Director Azure SRE
2 days ago
Overview
We are seeking a highly motivated and experienced Manager of Site Reliability Engineering (SRE) to lead our Azure-focused SRE team. The ideal candidate will combine technical expertise in Azure cloud services with strong leadership skills to ensure the reliability, scalability, and performance of our applications and infrastructure. As a manager, you will oversee a team of SREs, driving automation, incident management, and operational excellence while collaborating with cross-functional teams to achieve business goals.
Responsibilities
Key Responsibilities
• Team Leadership and Development:
o Lead, mentor, and grow a team of SREs, fostering a culture of collaboration, continuous learning, and operational excellence.
o Define team goals, metrics, and performance objectives aligned with organizational priorities.
• Operational Reliability:
o Ensure the reliability, availability, and performance of Azure-hosted services through proactive monitoring and alerting.
o Develop and enforce best practices for incident response, root cause analysis, and postmortem reporting.
o Establish SLAs, SLOs, and error budgets in collaboration with product and engineering teams.
• Automation and Tooling:
o Drive the adoption of automation tools to reduce manual operational tasks and improve system reliability.
o Implement Infrastructure as Code (IaC) principles using tools such as Terraform, ARM templates, or Bicep for Azure resources.
• Performance and Scalability:
o Optimize system performance, capacity planning, and scalability to support growth and evolving business needs.
o Leverage Azure services such as Azure Monitor, Application Insights, and Log Analytics to gain insights into system health.
• Collaboration and Stakeholder Management:
o Partner with development, product, and infrastructure teams to align on technical strategies and priorities.
o Communicate operational health, risks, and opportunities to executive stakeholders.
• Risk and Security Management:
o Ensure compliance with security best practices, standards, and policies within Azure environments.
o Identify and mitigate risks related to cloud infrastructure and applications.
Qualifications
• Minimum 11-14 Y of experience
• Bachelor’s degree in Computer Science, Engineering, or a related field (or equivalent experience).
• 8+ years of experience in cloud-based infrastructure and operations, with at least 3 years in a leadership role.
• Strong expertise in Microsoft Azure services, including compute, storage, networking, security, and monitoring tools.
• Proven experience in managing and scaling infrastructure using SRE principles.
• Proficiency in automation and scripting (e.g., Python, PowerShell) and Infrastructure as Code (e.g., Terraform, ARM templates).
• Hands-on experience with CI/CD pipelines and DevOps practices.
• Strong understanding of incident management, change management, and ITIL practices.
• Minimum 11-14 Y of experience
• Bachelor’s degree in Computer Science, Engineering, or a related field (or equivalent experience).
• 8+ years of experience in cloud-based infrastructure and operations, with at least 3 years in a leadership role.
• Strong expertise in Microsoft Azure services, including compute, storage, networking, security, and monitoring tools.
• Proven experience in managing and scaling infrastructure using SRE principles.
• Proficiency in automation and scripting (e.g., Python, PowerShell) and Infrastructure as Code (e.g., Terraform, ARM templates).
• Hands-on experience with CI/CD pipelines and DevOps practices.
• Strong understanding of incident management, change management, and ITIL practices.
Key Responsibilities
• Team Leadership and Development:
o Lead, mentor, and grow a team of SREs, fostering a culture of collaboration, continuous learning, and operational excellence.
o Define team goals, metrics, and performance objectives aligned with organizational priorities.
• Operational Reliability:
o Ensure the reliability, availability, and performance of Azure-hosted services through proactive monitoring and alerting.
o Develop and enforce best practices for incident response, root cause analysis, and postmortem reporting.
o Establish SLAs, SLOs, and error budgets in collaboration with product and engineering teams.
• Automation and Tooling:
o Drive the adoption of automation tools to reduce manual operational tasks and improve system reliability.
o Implement Infrastructure as Code (IaC) principles using tools such as Terraform, ARM templates, or Bicep for Azure resources.
• Performance and Scalability:
o Optimize system performance, capacity planning, and scalability to support growth and evolving business needs.
o Leverage Azure services such as Azure Monitor, Application Insights, and Log Analytics to gain insights into system health.
• Collaboration and Stakeholder Management:
o Partner with development, product, and infrastructure teams to align on technical strategies and priorities.
o Communicate operational health, risks, and opportunities to executive stakeholders.
• Risk and Security Management:
o Ensure compliance with security best practices, standards, and policies within Azure environments.
o Identify and mitigate risks related to cloud infrastructure and applications.
-
Deputy Director Azure SRE
2 weeks ago
Hyderabad, Telangana, India Pepsico Full time ₹ 1,04,000 - ₹ 1,30,878 per yearOverview We are seeking a highly motivated and experienced Manager of Site Reliability Engineering (SRE) to lead our Azure-focused SRE team. The ideal candidate will combine technical expertise in Azure cloud services with strong leadership skills to ensure the reliability, scalability, and performance of our applications and infrastructure. As a manager,...
-
Deputy Director Azure SRE
2 weeks ago
Hyderabad, Telangana, India PepsiCo Full time ₹ 15,00,000 - ₹ 20,00,000 per yearOverviewWe are seeking a highly motivated and experienced Manager of Site Reliability Engineering (SRE) to lead our Azure-focused SRE team. The ideal candidate will combine technical expertise in Azure cloud services with strong leadership skills to ensure the reliability, scalability, and performance of our applications and infrastructure. As a manager, you...
-
Deputy Director Azure SRE
57 minutes ago
Hyderabad, India PepsiCo Full timeOverviewWe are seeking a highly motivated and experienced Manager of Site Reliability Engineering (SRE) to lead our Azure-focused SRE team. The ideal candidate will combine technical expertise in Azure cloud services with strong leadership skills to ensure the reliability, scalability, and performance of our applications and infrastructure. As a manager, you...
-
Deputy Director Azure SRE
33 minutes ago
Hyderabad, India Pepsico Full timeOverview We are seeking a highly motivated and experienced Manager of Site Reliability Engineering (SRE) to lead our Azure-focused SRE team. The ideal candidate will combine technical expertise in Azure cloud services with strong leadership skills to ensure the reliability, scalability, and performance of our applications and infrastructure. As a...
-
Deputy Director Azure SRE
2 weeks ago
Hyderabad, Telangana, India Pepsico Full timeOverviewWe are seeking a highly motivated and experienced Manager of Site Reliability Engineering (SRE) to lead our Azure-focused SRE team. The ideal candidate will combine technical expertise in Azure cloud services with strong leadership skills to ensure the reliability, scalability, and performance of our applications and infrastructure. As a manager, you...
-
Azure SRE
3 weeks ago
Hyderabad, Telangana, India LTIMindtree Full timeJob Title: Azure SREExperience: 6 to 8 Years OnlyLocation: Pune/Hyderabad OnlyHybrid ModeNotice Period: Immediate to 30 days maxFull-time employment with LTIMindtreeMandatory Skills: Azure SREThanks & Regards,Prabal PandeyPrabal.Pandey@alphacom.in
-
Only 24h Left: Deputy Director Azure SRE
2 weeks ago
Hyderabad, Telangana, India PepsiCo Full timeJob DescriptionOverviewWe are seeking a highly motivated and experienced Manager of Site Reliability Engineering (SRE) to lead our Azure-focused SRE team. The ideal candidate will combine technical expertise in Azure cloud services with strong leadership skills to ensure the reliability, scalability, and performance of our applications and infrastructure. As...
-
Azure SRE
2 weeks ago
Hyderabad, Telangana, India LTIMindtree Full timeJob Title: Azure SRE Experience: 6 to 8 Years Only Location: Pune/Hyderabad Only Hybrid Mode Notice Period: Immediate to 30 days max Full-time employment with LTIMindtree Mandatory Skills: Azure SRE Thanks & Regards, Prabal Pandey
-
Azure SRE
1 week ago
Hyderabad, Telangana, India LTIMindtree Full timeJob Title: Azure SREExperience: 6 to 8 Years OnlyLocation: Pune/Hyderabad OnlyHybrid ModeNotice Period: Immediate to 30 days maxFull-time employment with LTIMindtreeMandatory Skills: Azure SREThanks & Regards,Prabal PandeyPrabal.Pandey@alphacom.in
-
Azure AI_ML SRE
1 week ago
Bengaluru, Hyderabad, India NTT DATA, Inc. Full time ₹ 15,00,000 - ₹ 28,00,000 per yearProvide Azure infrastructure Sustain/Operations supportManage all activities of SRE- Site Reliability EngineerSupport for deployments/change requests requiring SRE assistance365 Days coverage and 1st or 2nd shiftOverall, 6to 8Years of experience with working knowledge of SRE with Strong Databricks and Terraform as musts3+ years of Experience with Azure...