Principal Site Reliability Engineer

4 weeks ago


Hyderabad, India Microsoft Full time
Overview

Every minute of every day, customers stake their entire business and reputation on the Microsoft Cloud. The Azure Customer Experience (CXP) team believes that when we meet our high standards for quality and reliability, our customers win. If we falter, our customers fail their end-customers. Our vision is to turn Microsoft Cloud customers into fans.We are customer obsessed problem-solvers. We orchestrate deep engagements in areas like incident management, support and enablement. We analyze and amplify those customer voices, both within our own team, and across the Cloud + AI team, bringing the customer connection to the Quality vision for Azure. We innovate ways to scale what we learn across our customer base. Diversity and inclusion are central to who we are, how we work, and what we enable our customers to achieve. We know that empowering our customers starts with empowering our team to show up authentically, work in ways that are best for them, and achieve their career goals.Would you like to join one of the fastest-growing teams within Microsoft Azure Engineering? Are you constantly customer-obsessed, and focused on enhancing customer experience? Are you passionate about cloud computing and love the challenge of solving the most complex technical problems? Are you interested in a start-up like environment, passionate about building automations, observability, proactive & SLO monitoring experiences?Our organization is looking for you, a customer obsessed Principal Site Reliability Engineer with extensive experience in implementing Service Level Objectives (SLOs) monitoring solutions to top Azure customers. As a key member of our Observability team, you will play a critical role in ensuring the reliability, availability, and performance of customer applications hosted in Microsoft Azure. You will be responsible for designing, implementing, and maintaining robust SLO monitoring systems to track and meet the service level objectives defined in our offerings, customer engagement agreements. This position is critical to the success of our team's charter and embodies our inclusive culture, growth & learning mindsets, and unwavering dedication to diversity.Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.“Customer obsession”, “measure what matters”, “no dead-ends”, “get it done”, “collaboration” “teamwork” , “whatever it takes” are few characteristics we look for in this role. We are growing fast but remain agile.

Qualifications

Experience

: At least 10+ years of experience with designing, implementing, debugging and launching commercial software products or web services. 3+ years of SRE experience in cloud - Azure (or AWS/GCP)Degree:

Bachelor’s or master’s degree in computer engineering (or equivalent)Customer Obsession

: Passion for customers and focus on delivering the right customer experience.Growth Mindset

: Openness and ability to learn new skills and technologies in a fast-paced environment.Excellent Communication

: Must have the ability to empathize with customers and convey confidence. Able to explain highly technical issues to varied audiences. Able to prioritize and advocate customer’s needs to the proper channels. Take ownership and work towards a resolution.Technical Skills

:Proven expertise in implementing and managing Service Level Objectives (SLOs) and Service Level Indicators (SLIs) for cloud customers. Extensive experience with SLO monitoring tools and platformsAdvanced certifications in SRE or related fields.Experience in observability, SRE OpenTelemetry, Prometheus, Grafana, Dynatrace, Datadog, AzureMonitor, AI, ML#AZCXP #AZCXPACE #ACES500 #AZCXPSUPPORT, #AzureCXP

Responsibilities

Responsibilities include:Collaborate with customers to jointly define and establish SLOs and SLIs that align with their business goals and expectations.Instrument code to measure SLOs , develop solutions to detect SLO breachesDevelop automated solutions and troubleshooting guides to remediate or mitigate SLO breaches.Collaborate closely with service engineering teams to develop solutions for corelating customer-defined SLOs with relevant platform SLOs, signals to effectively pinpoint, address, and resolve customer-impacting issues.Ensure customer-centric SLOs are consistently exceeded through cross-functional collaboration.Analyze SLO data for trends, improvements, and reliability risks, proposing remediation plans.Proactively engage customers on SLO performance, addressing concerns and offering insights.Lead optimization efforts for system performance, scalability, and efficiency to exceed SLOs.Develop and maintain documentation related to customer-specific SLOs, SLIs, and monitoring processes.Exemplify Microsoft culture and foster a diverse, inclusive work environment.Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.Industry leading healthcareEducational resourcesDiscounts on products and servicesSavings and investmentsMaternity and paternity leaveGenerous time awayGiving programsOpportunities to network and connect

  • hyderabad, India Microsoft Full time

    Overview Every minute of every day, customers stake their entire business and reputation on the Microsoft Cloud. The Azure Customer Experience (CXP) team believes that when we meet our high standards for quality and reliability, our customers win. If we falter, our customers fail their end-customers. Our vision is to turn Microsoft Cloud customers...


  • Hyderabad, India Microsoft Full time

    Overview Every minute of every day, customers stake their entire business and reputation on the Microsoft Cloud. The Azure Customer Experience (CXP) team believes that when we meet our high standards for quality and reliability, our customers win. If we falter, our customers fail their end-customers. Our vision is to turn Microsoft Cloud customers into...


  • Hyderabad, India Microsoft Full time

    Overview Every minute of every day, customers stake their entire business and reputation on the Microsoft Cloud. The Azure Customer Experience (CXP) team believes that when we meet our high standards for quality and reliability, our customers win. If we falter, our customers fail their end-customers. Our vision is to turn Microsoft Cloud customers into...


  • hyderabad, India Microsoft Full time

    Overview Every minute of every day, customers stake their entire business and reputation on the Microsoft Cloud. The Azure Customer Experience (CXP) team believes that when we meet our high standards for quality and reliability, our customers win. If we falter, our customers fail their end-customers. Our vision is to turn Microsoft Cloud customers...


  • hyderabad, India Insight Global Full time

    Required Skills and Experience *- Bachelor's or master's degree in computer science, Software Engineering, or a related field.- Proven experience (7+ years) in SRE, automation testing- Strong skills in developing and implementing automation testing strategies and frameworks.- Solid understanding of site reliability principles and best practices.- Leadership...


  • Hyderabad, India Insight Global Full time

    Required Skills and Experience * - Bachelor's or master's degree in computer science, Software Engineering, or a related field. - Proven experience (7+ years) in SRE, automation testing - Strong skills in developing and implementing automation testing strategies and frameworks. - Solid understanding of site reliability principles and best practices. -...


  • Hyderabad, India Insight Global Full time

    Required Skills and Experience *- Bachelor's or master's degree in computer science, Software Engineering, or a related field.- Proven experience (7+ years) in SRE, automation testing- Strong skills in developing and implementing automation testing strategies and frameworks.- Solid understanding of site reliability principles and best practices.- Leadership...


  • hyderabad, India Virtusa Full time

    Site Reliability engineer - CREQ188641 Description Position : SRE Primary skills: devops CI/CD pipeline Location: Hyderabad Should have proficiency in understanding of application monitoring stack(Logs, Events, Metrics and Alerts) and ability to visualize and setup end-to-end observability.Should have proficiency in industry standard monitoring...


  • Hyderabad, India Virtusa Full time

    Site Reliability engineer - CREQ188641 Description Position : SRE Primary skills: devops CI/CD pipeline Location: Hyderabad Should have proficiency in understanding of application monitoring stack(Logs, Events, Metrics and Alerts) and ability to visualize and setup end-to-end observability. Should have proficiency in industry standard monitoring tools...


  • hyderabad, India Virtusa Full time

    Site Reliability engineer - CREQ188641 Description Position : SRE Primary skills: devops CI/CD pipeline Location: Hyderabad Should have proficiency in understanding of application monitoring stack(Logs, Events, Metrics and Alerts) and ability to visualize and setup end-to-end observability.Should have proficiency in industry standard monitoring...


  • Hyderabad, India Virtusa Full time

    Site Reliability engineer - CREQ188641 Description Position : SRE Primary skills: devops CI/CD pipeline Location: Hyderabad Should have proficiency in understanding of application monitoring stack(Logs, Events, Metrics and Alerts) and ability to visualize and setup end-to-end observability. Should have proficiency in industry standard monitoring tools...


  • hyderabad, India Korn Ferry Full time

    Role - Site Reliability EngineerExp - 5+ years RequiredLocation - Hyderabad ( Work from Office-Hybrid)Shift Timings - 5AM -1 PM ISTWe are looking for a Site Reliability Engineer with strong development background to join our team. In this role, you will be responsible for ensuring the reliability and performance of our systems. You will work closely to our...


  • Hyderabad, India Snaphunt Full time

    The OfferWork within a company with a solid track record of successGreat work environmentAttractive salary & benefitsThe Job You will be responsible for : Gathering and evaluating user feedback.Providing code documentation and other inputs to technical documents.Supporting continuous improvement by investigating alternatives and new technologies and...


  • hyderabad, India Snaphunt Full time

    The Offer Work within a company with a solid track record of success Great work environment Attractive salary & benefits The Job You will be responsible for : Gathering and evaluating user feedback. Providing code documentation and other inputs to technical documents. Supporting continuous improvement by investigating alternatives and new technologies...


  • hyderabad, India Microsoft Full time

    Overview Are you interested in working for one of the most exciting products at Microsoft, passionate about exceeding customer expectations and advancing Microsoft's cloud first strategy? Are you interested in a start-up like the environment, passionate about cloud computing technology and driving growth in one of Microsoft's core businesses? If...


  • Hyderabad, India Microsoft Full time

    Overview Are you interested in working for one of the most exciting products at Microsoft, passionate about exceeding customer expectations and advancing Microsoft's cloud first strategy? Are you interested in a start-up like the environment, passionate about cloud computing technology and driving growth in one of Microsoft's core businesses? If so,...


  • hyderabad, India Microsoft Full time

    Overview Are you interested in working for one of the most exciting products at Microsoft, passionate about exceeding customer expectations and advancing Microsoft's cloud first strategy? Are you interested in a start-up like the environment, passionate about cloud computing technology and driving growth in one of Microsoft's core businesses? If...


  • Hyderabad, India Microsoft Full time

    Overview Are you interested in working for one of the most exciting products at Microsoft, passionate about exceeding customer expectations and advancing Microsoft's cloud first strategy? Are you interested in a start-up like the environment, passionate about cloud computing technology and driving growth in one of Microsoft's core businesses? If so,...


  • hyderabad, India Microsoft Full time

    Overview Microsoft Digital (MSD)’s  mission is to power, protect, and transform the employee experience at Microsoft across the globe. Come, build community, explore your passions, pursue your AI and ML aspirations, do your best work and be a part of the team within Microsoft’s Data Platform & Growth (DPG) organization and Experiences &...


  • Hyderabad, India Microsoft Full time

    Overview Microsoft Digital (MSD)’s  mission is to power, protect, and transform the employee experience at Microsoft across the globe. Come, build community, explore your passions, pursue your AI and ML aspirations, do your best work and be a part of the team within Microsoft’s Data Platform & Growth (DPG) organization and Experiences & Devices...