Senior Service Engineer
1 week ago
Are you passionate about cloud computing, obsessed with customer experience, and skilled at translating complex technical issues into clear, transparent communication? Do you thrive in high-stakes, fast-paced environments and want to play a pivotal role in how Microsoft shows up for customers during moments that matter most? If so, the Azure Customer Experience (CXP) team has the opportunity for you.
Microsoft Azure is one of the most exciting and strategic products at Microsoft—powering mission-critical workloads for enterprises, governments, and startups around the world. Azure delivers on-demand, hyper-scale infrastructure and platforms via Microsoft's global data centers, enabling customers to build, host, and scale their applications with confidence.
The Customer Reliability Engineering (CRE) team within Azure CXP is a top-level pillar of Azure Engineering responsible for world-class live-site management, customer reliability engagements, and modern customer-first experiences at scale. Our "no dead-ends" philosophy ensures that every customer, regardless of size or scale, can realize their full potential through the Microsoft Cloud.
We are seeking a decisive, detail-oriented Service Engineer who will serve as the customer's voice and advocate during high-severity incidents across Microsoft Azure. While predominantly focused on livesite customers communications, this hybrid role will also support service engineering, program and project management, and continual service improvement. You will work closely with incident managers, engineering responders, and field stakeholders to shape and deliver clear, timely, and action-oriented communications during outages, security events, service retirements, and other high-impact scenarios.
This is a critical, customer-facing role requiring exceptional writing skills, calm leadership during ambiguity, and a passion for building customer trust through transparency and clarity. You'll work at the intersection of customer support, technical operations, and communications—and you'll help shape how Microsoft communicates during crises, preemptively and retrospectively.
As part of the Azure CXP CRE team, your responsibilities include:
On-call Communication Management during regular on-call rotations
• Join incident bridges and work with engineering to obtain real-time outage details.
• Understand incident scope, impact, and mitigation to translate complex technical findings into clear, professional, and decisive updates for customers and stakeholders.
• Keep communications consistent and fact-based throughout the incident; confirm information with engineering and leadership before sharing.
• Assist with publishing Public Incident Reports and RCA summaries.
• Support live site incident (LSI) operations, including triage, resolution, and post-incident analysis.
• Shares details related to incidents and their resolution through post-mortem reports and during regular review meetings.
Problem Management & Data Analytics
• Design and implement automated detection systems to identify impacted resources in real time.
• Collaborate with engineering and operations teams to enhance telemetry, monitoring, and alerting accuracy while reducing false positives.
• Develop dashboards and visualizations in Power BI and Azure Data Explorer to support data-driven insights.
• Build scalable data collection and analysis frameworks to improve service reliability and incident response.
• Participate in incident resolution workflows and provide actionable insights to drive platform and process improvements.
• Communicate technical findings and recommendations to stakeholders through clear, data-backed reporting.
Tooling & Automation
• Develop tools and analytics pipelines to automatically assess incident impact and blast radius across services, regions, and customers in real time.
• Design and maintain automation solutions that enhance incident detection, monitoring, communication, and remediation while reducing operational toil and repeat issues.
• Identify recurring problems, propose preventive solutions, and collaborate with engineers and teams to implement fixes.
• Design and orchestrate automation workflows using Microsoft Copilot Studio, Power Automate, and Azure AI Foundry.
• Build and support no-code/low-code solutions to optimize operations and improve team efficiency.
• Collaborate with product, infrastructure, and operations teams to align automation initiatives with organizational reliability and customer trust goals.
Required Qualifications
• Bachelor's degree in computer science, Information Technology, Data Science, Cybersecurity, or a related field AND 5+ years of technical experience in software engineering, network engineering, service engineering, systems engineering, or industrial controls; OR equivalent hands-on experience.
• Hands-on experience implementing AI-driven solutions and automation, with proficiency in one or more programming/automation languages (e.g., C#, Java, JavaScript, Python) or equivalent expertise is a plus.
• Certifications in cloud technologies (Azure, AWS, GCP), ITIL, or SRE frameworks are desirable.
• Strategic thinking and a customer-first mindset; able to advocate for improvements in platform transparency and experience.
• Excellent problem-solving, judgment, and decision-making skills, communication and collaboration skills.
• Understanding of SRE principles, including SLAs/SLOs, telemetry, and monitoring.
• Proven experience in cloud operations, incident & crisis management, or large-scale systems engineering ideally within platforms such as Azure, AWS, or GCP.
• Contribute to a data-driven culture as well as a culture of experimentation across the organization.
• Own and drive projects and features by working towards the team's defined goals and milestones.
• Creating prototypes and proof-of-concepts for iterative development.
• Be curious and willing to learn and grow.
Preferred:
5+ Years of demonstrated experience as an Incident Commander or Crisis Manager for critical, high-severity incidents in high-availability, distributed environments.
Experience with SRE (Site Reliability Engineering) principles and practices.
Exposure to chaos engineering, fault injection, or high availability architecture.
AI/ML Experience: [Beginner to Intermediate]
Familiarity with how AI/ML models are integrated into cloud infrastructure and their potential failure modes.
Experience using AI-powered tools for incident analysis, log correlation, or predictive alerting.
An understanding of the challenges and risks associated with AI/ML systems in a production environment.
Certifications:
Relevant cloud certifications (e.g., AWS Certified DevOps Engineer, Azure Solutions Architect, GCP Professional Cloud Architect).
Certifications in ITIL, SRE, or other relevant frameworks.
Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request via the Accommodation request form.
Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
-
Commissioning & Service Engineer (Hyderabad)
2 weeks ago
Hyderabad, Telangana, India Gowork placement service Full time ₹ 3,00,000 - ₹ 6,00,000 per yearUrgent Looking for Commissioning & Service Engineer for Fire & safety Equipments in Hyderabad Location -Qualification - Diploma/Degree in Electrical/Electronics/InstrumentationExp - 3-4 Years in Fire & Safety Commissioning & ServicingLocation - HyderabadJob role -Perform , Testing , Commissioning & servicing of Fire & Security System such as -FIRE ALARM...
-
Senior Service Engineer
5 days ago
Hyderabad, Telangana, India Microsoft Full time ₹ 8,00,000 - ₹ 12,00,000 per yearSenior Service EngineerHyderabad, Telangana, IndiaDate postedOct 27, 2025Job number1901482Work site3 days / week in-officeTravelNoneRole typeIndividual ContributorProfessionSoftware EngineeringDisciplineService EngineeringEmployment typeFull-TimeOverviewAre you passionate about cloud computing, obsessed with customer experience, and skilled at translating...
-
Senior Service Engineer
5 days ago
Hyderabad, Telangana, India Microsoft Full time ₹ 8,00,000 - ₹ 24,00,000 per yearAre you passionate about cloud computing, obsessed with customer experience, and skilled at translating complex technical issues into clear, transparent communication? Do you thrive in high-stakes, fast-paced environments and want to play a pivotal role in how Microsoft shows up for customers during moments that matter most? If so, the Azure Customer...
-
Senior Manager Service Desk Operations, SRE
2 days ago
Hyderabad, Telangana, India Jobaaj (a Unit Of Nishtya Infotech ) Full time ₹ 15,00,000 - ₹ 25,00,000 per yearSeeking a Senior Manager to lead Service Desk Operations, Engineering,and SRE initiatives focused on modernization,automation,and AI transformation.Drive ITIL governance via ServiceNow,enhance reliability,and lead global teams for service excellence.
-
Senior Rust Engineer
2 weeks ago
Hyderabad, Telangana, India Maple Software Full time ₹ 20,00,000 - ₹ 25,00,000 per yearJob Title: Senior Rust Engineer (or) Senior Backend EngineerLocation: RemoteEmployment Type: Fulltime/ContractHeadcount: 10Where they will be used: They will form the main development force building all backend services, implementing cryptographic protocols, and ensuring the platform is secure and scalable.Responsibilities:Implement features for the CA core,...
-
Services Senior Consultant
2 days ago
Hyderabad, Telangana, India Aveva Full time ₹ 12,00,000 - ₹ 36,00,000 per yearJob Title: Services Senior ConsultantLocation: HyderabadEmployment Type: Full TimeThe jobWe are seeking a highly skilled and experienced UOC Senior Consultant to lead the design, implementation, and maintenance of Unified Operations Center (UOC) systems across complex infrastructures and multiple domains.Essential requirementsNon-Technical:Excellent written,...
-
Senior Software Engineer
2 weeks ago
Hyderabad, Telangana, India Microsoft Full time ₹ 20,00,000 - ₹ 25,00,000 per yearWith continued growth in digital data and the desire to leverage data to address problems that touch all aspects of our lives, Azure Storage is growing to meet these challenges The Azure Storage team is hiring experienced Senior Software Engineer to join agile and diverse engineering teams for deploying Data Processing Unit (DPU) technology.As a Senior...
-
Senior Data Engineer
5 days ago
Hyderabad, Telangana, India WalkingTree Technologies Full time ₹ 8,00,000 - ₹ 24,00,000 per yearCompany DescriptionFounded in 2008, WalkingTree Technologies is a pioneering IT services company specializing in digital and data solutions. We collaborate with our clients to shape and execute their data and digital strategies, unlocking their full potential. Our core strengths in Product Engineering, Digital Transformation, and Modernization create a...
-
Senior Engineering Manager
5 days ago
Hyderabad, Telangana, India Microsoft Full time ₹ 12,00,000 - ₹ 36,00,000 per yearAre you an Engineering Leader with a passion for building high scale microservices? Do you thrive on solving complex and ambiguous challenges? Are you able to generate energy throughout the teams that you lead? If so, come join us as a Senior Engineering ManagerWe are Windows Notification Services team, a part of the Windows India engineering...
-
Senior Data Engineer
1 week ago
Hyderabad, Telangana, India Unique IT Solutions, Inc Full time ₹ 15,00,000 - ₹ 25,00,000 per yearCompany DescriptionUnique IT Solutions is a professional information technology services company with deep SAP roots, providing digital transformation solutions and consulting across a wide range of industries. We are known for providing affordable SAP installations and updates utilizing a blended team of onshore and offshore resources. Our Business...