Site Reliability Engineer II
1 week ago
The Production Engineering and Artificial Intelligence (AI) Group, part of the Linux Systems Group within Microsoft, plays a critical role in powering Azure Cloud. This team ensures that Azure operates with the latest version of Linux software at the highest levels of quality and performance, serving as the gatekeeper for production software. The team achieves this at Azure scale through efficient automation and by leveraging artificial intelligence to reduce the human effort required for these responsibilities. This is an excellent opportunity to join the Production Engineering and AI Group and contribute to the growth of Microsoft's Azure Cloud infrastructure.
As a Site Reliability Engineer II, you will be responsible for ensuring that software deployments follow safe rollout processes while driving operational excellence. You will leverage technical expertise, telemetry analysis, and advanced artificial intelligence to maintain reliability and performance across large-scale systems.
Microsoft's mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
- Independently write code or scripts that automate the performance of scalable operations processes (e.g., monitoring, alerting, deploying products and updates) across components and features of products.
- Create, test and deploy changes through a safe deployment process (SDP) and improve the observability, security, reliability and operability of the systems operating at hyper scale.
- Use tools and processes to troubleshoot problems affecting the availability, security, reliability, performance of components, leveraging the AI capabilities
- Enable the team to increase the velocity in which changes can reliably and safely deployed in production and monitors the effects of these changes.
- Respond to incidents during regular on-call rotations and take appropriate action to mitigate impact. You will develop alerts and automated monitoring infrastructure to notify degradation in performance or availability and draw insights from this data to manage infrastructure in an optimal way
Required Qualifications:
- 4+ years technical experience in software engineering, network engineering, or systems administration
- OR Bachelor's Degree in Computer Science, Information Technology, or related field AND 1+ year(s) technical experience in software engineering, network engineering, or systems administration
- OR Master's Degree in Computer Science, Information Technology, or related field.
1+ years experience in Cloud Infrastructure and Data Center Expertise
- Managing public cloud infrastructure or large-scale data center setups.
- Site Reliability Engineering (SRE) principles.
- Safe deployment practices in hyper-scale data centers.
- Distributed systems designed for high availability and incident handling protocols.
1+ years experience in Programming and Automation Skills
- Python and Bash or PowerShell scripting and advances in cloud technologies.
Other Qualifications:
- Ability to meet Microsoft, customer and/or govenment security screening requirements are required for this role.
These requirements include, but are not limited to the following specialized security screenings:- Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.
Preferred Qualifications:
- 5+ years technical experience in software engineering, network engineering,
- OR systems administration OR Bachelor's Degree in Computer Science, Information Technology,
- OR related field AND 2+ years technical experience in software engineering, network engineering,
- OR systems administration
- OR Master's Degree in Computer Science, Information Technology,
- OR related field AND 1+ year(s) technical experience in software engineering, network engineering,
- 1+ year(s) people management experience.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request via the Accommodation request form.
Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
#azurecorejobs
-
Site Reliability Engineer II
3 days ago
Bengaluru, Karnataka, India Microsoft Full time ₹ 8,00,000 - ₹ 24,00,000 per yearThe Production Engineering and Artificial Intelligence (AI) Group, part of the Linux Systems Group within Microsoft, plays a critical role in powering Azure Cloud. This team ensures that Azure operates with the latest version of Linux software at the highest levels of quality and performance, serving as the gatekeeper for production software. The team...
-
Site Reliability Engineer II
1 week ago
Bengaluru, Karnataka, India JPMorganChase Full time US$ 80,000 - US$ 1,20,000 per yearDescriptionPlay a key role in ensuring system reliability at one of the world's most iconic and largest financial institutions.As a Site Reliability Engineer II at JPMorgan Chase within the Corporate Technology, Finance Last Mile Reporting team, you will use technology to solve business problems and leverage software engineering best practices as we strive...
-
Site Reliability Engineer
5 days ago
Bengaluru, Karnataka, India d416f97b-2589-437a-8e64-3348cfe4008b Full time ₹ 12,00,000 - ₹ 36,00,000 per yearHiring Site Reliability EngineersExp : 2.5 +years [Excluding internship]Location : BangaloreApply Here : The engineer will work in the Reliability and Productivity Engineering team and is responsible for building industry standard large scale platforms to be utilised across FK that helps to significantly improve the reliability of systems and bring...
-
Site Reliability Engineer II
3 days ago
Bengaluru, Karnataka, India CME Group Full timeCME Group is the world's leading and most diverse derivatives marketplace, offering futures and options across a wide range of industries. We are seeking a passionate SRE to join our dynamic team.The Application Site Reliability Engineer II will help ensure the reliability and performance of our Markets trading and real-time post-trade systems; systems where...
-
Site Reliability Engineer II
2 weeks ago
Bengaluru, Karnataka, India UiPath Full time ₹ 15,00,000 - ₹ 25,00,000 per yearLife at UiPath The people at UiPath believe in the transformative power of automation to change how the world works. We're committed to creating category-leading enterprise software that unleashes that power. To make that happen, we need people who are curious, self-propelled, generous, and genuine. People who love being part of a fast-moving,...
-
Site Reliability Engineer II
2 days ago
Bengaluru, Karnataka, India CME Group Full time ₹ 8,00,000 - ₹ 12,00,000 per yearDescription:CME Group is seeking a SRE II to help, build, operate and scale systems in our Markets portfolio. Markets SREs work on products and applications related to CME's Globex trading platform. Our systems deliver an exceptional combination of low-latency performance and rock-solid reliability to seamlessly handle the world's busiest trading days.The...
-
Site Reliability Engineer
7 days ago
Bengaluru, Karnataka, India Ivanti Full time ₹ 8,00,000 - ₹ 24,00,000 per yearAre you ready to help elevate the reliability and performance of cloud services for global enterprise clients? Join Ivanti's growing Site Reliability Engineering (SRE) team and play a vital role in deploying, automating, and securing SaaS solutions trusted by organizations worldwide. If you thrive in a collaborative, fast-paced environment and love solving...
-
Site Reliability Engineer
2 weeks ago
Bengaluru, Karnataka, India Warner Bros. Discovery Full time ₹ 15,00,000 - ₹ 25,00,000 per yearWelcome to Warner Bros. Discovery the stuff dreams are made of.Who We AreWhen we say, the stuff dreams are made of," we're not just referring to the world of wizards, dragons and superheroes, or even to the wonders of Planet Earth. Behind WBD's vast portfolio of iconic content and beloved brands, are the storytellers bringing our characters to life, the...
-
Site Reliability Engineer
2 weeks ago
Bengaluru, Karnataka, India AppHelix Full time ₹ 9,00,000 - ₹ 12,00,000 per yearRole DescriptionThis is a full-time on-site role located in Bengaluru for a Site Reliability Engineer. The Site Reliability Engineer will be responsible for maintaining and improving the reliability of AppHelix's systems. Daily tasks include monitoring system performance, troubleshooting issues, managing infrastructure, and supporting software development....
-
Site Reliability Engineering
4 days ago
Bengaluru, Karnataka, India Thakral One Full time US$ 60,000 - US$ 1,20,000 per yearCompany DescriptionThakral One, headquartered in Singapore, is a technology consulting and services company with a strong presence across Asia. The company specializes in technology-driven consulting, custom solution development, data analytics, and leveraging cloud capabilities to deliver enhanced decision support and practical outcomes. Collaborating...