Principal Site Reliability Engineer Manager

1 week ago


Hyderabad, India Microsoft Full time
Overview

Microsoft Digital (MSD)’s

mission is to power, protect, and transform the employee experience at Microsoft across the globe. Come, build community, explore your passions, pursue your AI and ML aspirations, do your best work and be a part of the team within Microsoft’s Data Platform & Growth (DPG) organization and Experiences & Devices (E+D) division. Microsoft Digital (MSD), is the team that innovates, creates, and delivers the vision for Microsoft’s employee experience, human resources, corporate and legal affairs, global real estate products, and runs Microsoft’s internal network and infrastructure, plus builds campus modernization and hybrid solutions. You will leverage Gen AI, AI, ML, and other topical and latest technologies to focus on empowering Microsoft employees with the tools and services that define both the physical and digital future of work.Microsoft’s mission is to empower every person and every organization on the planet to achieve more, and we’re dedicated to this mission across every aspect of our company. is centered on embracing a growth mindset and encouraging teams and leaders to bring their best each day. Join us and help shape the future of the world.The Site Reliability Engineering (SRE) team provides leadership, direction and accountability for application architecture, system design, and end-to-end implementation. As Senior SRE Manager, you build and develop a team to identify and deliver software improvements using expertise in software development, complexity analysis, and scalable system design. Collaboration skills will be required to work closely with other engineering teams to ensure services/systems are highly stable and performant, meeting the expectations of our government customers and users. You provide vision and clarity to team of SREs who build, monitor, and maintain the systems and infrastructure that ensure our customers can quickly access their data and run workloads whenever and wherever they need to. You drive practices and engineering excellence focus to identify service problems and areas for improvement, and we follow up by fixing those problems.At Microsoft, we can offer you an amazing team, exciting challenges, and a fun place to work. The work environment empowers you to have a positive impact on millions of end users. The right candidate for this job (is):Passionate about distributed systems and working with highly scalable servicesGains fulfillment developing others and building a postive and collabrative team culture.Enjoys new technological challenges and is motivated to solve themExcited about making better software and continuously improving the development, integration, and deployment processesSmart, highly motivated, self-starter who thrives in a bottoms-up, fast-paced, highly technical environmentEffective collaborator, experienced in creating technical partnerships across teamsPassionate about excellence and efficiency in day to day operations.

Qualifications

Required Qualifications6+ years technical experience in software engineering, network engineering, or systems administrationOR Bachelor's Degree in Computer Science, Information Technology, or related field AND 3+ years technical experience in software engineering, network engineering, or systems administrationOR Master's Degree in Computer Science, Information Technology, or related field AND 2+ years technical experience in software engineering, network engineering, or systems administration.7+ years of Software, Site Reliability, Systems, or Service Engineering experience.Current software development expertise in multiple programming languages (C#, C++, Python, Java, et al)Proven experience with effectively driving improvement and delivering solutions with stakeholders across all levels of an organizationPreferred QualificationsExposure to AIOPs and automations at scale3+ years technical experience working with large-scale cloud or distributed systems.3+ years people management experience.Experience designing, building, servicing, and driving ongoing improvement of service infrastructure & systemsProven track record of improving reliability, available and performance of cloud servicesTechnical understanding of Network as code and automation as well as AIOps in network space.Non-Technical skillsProblem solving - Ability to clearly understand problems, decompose them into smaller problems; and technical articulation skills so that it is easy for the team to collectively solve.Creative, Curious/Inquisitive & Associative Thinking

:

A desire to go beneath the surface of the problem, find the questions (how’, ‘what’, why’, ‘when’) at its heart and distill them into a very clear set of hypotheses that can be tested & validated.Ability to work both independently and collectively in a fun team environment with minimal supervision.Good communication and stakeholder management skillsHigh capacity to learn and adapt to new technologies and engineering processes quickly.

Responsibilities

Responsibilities :Uphold high organizational standard of great employee and team satisfaction.Stakeholder Management for large and distributed systems ecosystem.Expertise in escalation management and driving resource allocation for large scale high criticality operations ecosystem.Agility and adapting as per the changing requirements and aligning investments as per priority.Provide technical leadership to a team of highly passionate and skilled engineersRecruit, on-board, and grow a team of Software Engineers focused on Site ReliabilityBuild, run and improve critical public-sector service environmentsCoordinate planning and execution with internal engineering teams, business partners and technical leaders across the divisionOwn deployment, availability, reliability, performance and customer escalation targets for these environmentsProactive identification and reduction of issues through design, testing, and implementation of softwareGuide and mentor team of SREs and drive best of systems engineering excellence practices.Architect and review designs for large scale integrated systems.Guiding force for Creating and maintaining large scale architectures, looking after the end-to-end experience of the customer, or otherwise exercising excellent engineering judgement on a level above isolated methods.Drive culture of creating resilient architectures capable of surviving the failure of any individual component and painstakingly reconstructing the causal chain of an outage, to figure out how it can be improved.Identify efficienct operations practice and drive culture of automating repetitive tasks with scripting or applications. Experience in Process automation.In depth understanding of application infrastructure, middleware tier, microservices and system integrations.Knowledge of AIOps implementation is a plus.Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.Industry leading healthcareEducational resourcesDiscounts on products and servicesSavings and investmentsMaternity and paternity leaveGenerous time awayGiving programsOpportunities to network and connect

  • Hyderabad, Telangana, India Microsoft Full time

    Overview Every minute of every day, customers stake their entire business and reputation on the Microsoft Cloud. The Azure Customer Experience (CXP) team believes that when we meet our high standards for quality and reliability, our customers win. If we falter, our customers fail their end-customers. Our vision is to turn Microsoft Cloud customers into...


  • hyderabad, India Microsoft Full time

    Overview Every minute of every day, customers stake their entire business and reputation on the Microsoft Cloud. The Azure Customer Experience (CXP) team believes that when we meet our high standards for quality and reliability, our customers win. If we falter, our customers fail their end-customers. Our vision is to turn Microsoft Cloud customers...


  • Hyderabad, India Microsoft Full time

    Overview Every minute of every day, customers stake their entire business and reputation on the Microsoft Cloud. The Azure Customer Experience (CXP) team believes that when we meet our high standards for quality and reliability, our customers win. If we falter, our customers fail their end-customers. Our vision is to turn Microsoft Cloud customers into...


  • Hyderabad, India Microsoft Full time

    Overview Every minute of every day, customers stake their entire business and reputation on the Microsoft Cloud. The Azure Customer Experience (CXP) team believes that when we meet our high standards for quality and reliability, our customers win. If we falter, our customers fail their end-customers. Our vision is to turn Microsoft Cloud customers into...


  • hyderabad, India Microsoft Full time

    Overview Every minute of every day, customers stake their entire business and reputation on the Microsoft Cloud. The Azure Customer Experience (CXP) team believes that when we meet our high standards for quality and reliability, our customers win. If we falter, our customers fail their end-customers. Our vision is to turn Microsoft Cloud customers...


  • Hyderabad, India Microsoft Full time

    Overview Microsoft Digital (MSD)’s  mission is to power, protect, and transform the employee experience at Microsoft across the globe. Come, build community, explore your passions, pursue your AI and ML aspirations, do your best work and be a part of the team within Microsoft’s Data Platform & Growth (DPG) organization and Experiences & Devices...


  • Hyderabad, India Microsoft Full time

    Overview Microsoft Digital (MSD)’s  mission is to power, protect, and transform the employee experience at Microsoft across the globe. Come, build community, explore your passions, pursue your AI and ML aspirations, do your best work and be a part of the team within Microsoft’s Data Platform & Growth (DPG) organization and Experiences & Devices...


  • hyderabad, India Microsoft Full time

    Overview Microsoft Digital (MSD)’s  mission is to power, protect, and transform the employee experience at Microsoft across the globe. Come, build community, explore your passions, pursue your AI and ML aspirations, do your best work and be a part of the team within Microsoft’s Data Platform & Growth (DPG) organization and Experiences &...


  • hyderabad, India Microsoft Full time

    Overview Microsoft Digital (MSD)’s  mission is to power, protect, and transform the employee experience at Microsoft across the globe. Come, build community, explore your passions, pursue your AI and ML aspirations, do your best work and be a part of the team within Microsoft’s Data Platform & Growth (DPG) organization and Experiences &...


  • hyderabad, India Quiktrak, LLC Full time

    Job Title: Azure Site Reliability Engineer (SRE) / DevOps EngineerJob Description:Summary:As an Azure Site Reliability Engineer (SRE) / DevOps Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud infrastructure on the Azure platform. This role involves managing deployments, implementing continuous...


  • hyderabad, India Korn Ferry Full time

    Role - Site Reliability EngineerExp - 5+ years RequiredLocation - Hyderabad ( Work from Office-Hybrid)Shift Timings - 5AM -1 PM ISTWe are looking for a Site Reliability Engineer with strong development background to join our team. In this role, you will be responsible for ensuring the reliability and performance of our systems. You will work closely to our...


  • Hyderabad, India Microsoft Full time

    Overview Are you interested in working for one of the most exciting products at Microsoft, passionate about exceeding customer expectations and advancing Microsoft's cloud first strategy? Are you interested in a start-up like the environment, passionate about cloud computing technology and driving growth in one of Microsoft's core businesses? If so,...


  • hyderabad, India Microsoft Full time

    Overview Are you interested in working for one of the most exciting products at Microsoft, passionate about exceeding customer expectations and advancing Microsoft's cloud first strategy? Are you interested in a start-up like the environment, passionate about cloud computing technology and driving growth in one of Microsoft's core businesses? If...


  • Hyderabad, India Microsoft Full time

    Overview Are you interested in working for one of the most exciting products at Microsoft, passionate about exceeding customer expectations and advancing Microsoft's cloud first strategy? Are you interested in a start-up like the environment, passionate about cloud computing technology and driving growth in one of Microsoft's core businesses? If so,...


  • Hyderabad, Telangana, India Microsoft Full time

    Overview Are you interested in working for one of the most exciting products at Microsoft, passionate about exceeding customer expectations and advancing Microsoft's cloud first strategy? Are you interested in a start-up like the environment, passionate about cloud computing technology and driving growth in one of Microsoft's core businesses? If so,...


  • hyderabad, India Microsoft Full time

    Overview Are you interested in working for one of the most exciting products at Microsoft, passionate about exceeding customer expectations and advancing Microsoft's cloud first strategy? Are you interested in a start-up like the environment, passionate about cloud computing technology and driving growth in one of Microsoft's core businesses? If...


  • Hyderabad, Telangana, India Quiktrak, LLC Full time

    Job Title: Azure Site Reliability Engineer (SRE) / DevOps Engineer Job Description: Summary: As an Azure Site Reliability Engineer (SRE) / DevOps Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud infrastructure on the Azure platform. This role involves managing deployments, implementing continuous...


  • Hyderabad, India Korn Ferry Full time

    Role - Site Reliability EngineerExp - 5+ years RequiredLocation - Hyderabad ( Work from Office-Hybrid)Shift Timings - 5AM -1 PM IST We are looking for a Site Reliability Engineer with strong development background to join our team. In this role, you will be responsible for ensuring the reliability and performance of our systems. You will work closely to our...


  • Hyderabad, India Korn Ferry Full time

    Role - Site Reliability Engineer Exp - 5+ years Required Location - Hyderabad ( Work from Office-Hybrid) Shift Timings - 5AM -1 PM IST We are looking for a Site Reliability Engineer with strong development background to join our team. In this role, you will be responsible for ensuring the reliability and performance of our systems. You will work closely...


  • Hyderabad, India Korn Ferry Full time

    Role - Site Reliability EngineerExp - 5+ years RequiredLocation - Hyderabad ( Work from Office-Hybrid)Shift Timings - 5AM -1 PM IST We are looking for a Site Reliability Engineer with strong development background to join our team. In this role, you will be responsible for ensuring the reliability and performance of our systems. You will work closely to our...