Site Reliability Engineer
2 days ago
Microsoft is a company where passionate innovators come to collaborate, envision what can be and take their careers further. This is a world of more possibilities, more innovation, more openness, and the sky is the limit thinking in a cloud-enabled world.
Microsoft's Azure Data engineering team is leading the transformation of analytics in the world of data with products like databases, data integration, big data analytics, messaging & real-time analytics, and business intelligence. The products our portfolio include Microsoft Fabric, Azure SQL DB, Azure Cosmos DB, Azure PostgreSQL, Azure Data Factory, Azure Synapse Analytics, Azure Service Bus, Azure Event Grid, and Power BI. Our mission is to build the data platform for the age of AI, powering a new class of data-first applications and driving a data culture.
Within Azure Data, the databases team builds and maintains Microsoft's operational Database systems. We store and manage data in a structured way to enable multitude of applications across various industries. We are on a journey to enable developer friendly, mission-critical, AI enabled operational Databases across relational, non-relational and OSS offerings.
We are hiring a Software Engineer 2 to join the Azure Cosmos DB team, where you will be working on a large-scale distributed operational database. In this role, you will work on distributed systems problems and technologies to help determine the future of our planet scale database.
We do not just value differences or different perspectives. We seek them out and invite them in so we can tap into the collective power of everyone in the company. As a result, our customers are better served.
Responsibilities- Operational Efficiency: Lead designing systems/solutions at org scale, streamlining processes and enhancing efficiency.
- AIOps: Use AI tools and agents to improve SLO/SLAs and reduce toil.
- Monitoring/Observability Architecture: Develop and implement monitoring agents, dashboards, escalations, and alerts to proactively manage and improve service reliability.
- Incident Management: Participate in a distributed on-call rotation, drive root cause analysis during outages, and write and review postmortems to continuously improve our services and practices.
- Team Growth: Advocate for SRE best practices, work independently, and help grow the SRE team by onboarding and mentoring new teammates.
Embody our culture and values
QualificationsRequired/Minimum Qualifications
- Hands-On Experience: Demonstrate 3-8 years of practical experience in site reliability engineering within commercial large-scale software Organizations.
- Proficiency in coding languages (such as Python, .NET).
- Live Site Troubleshooting: Adept at troubleshooting live site issues and providing guidance to engineering teams to resolve them promptly.
- Cloud Proficiency: Possess a good understanding of public cloud offerings such as Azure, Google Cloud, or AWS.
- Distributed Systems: Experience with distributed systems and micro-service-based architectures.
- Performance Analysis: Conduct in-depth analysis of web application performance, identifying bottlenecks and areas for improvement. Utilize various monitoring tools and performance profiling techniques to diagnose and troubleshoot performance issues.
Other Requirements
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings: Microsoft Cloud Background Check:
- This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.
Preferred/Additional Qualifications
Other Requirements
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings: Microsoft Cloud Background Check:
This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.
azdatazuredata
cosmosdb
Microsoft is an equal opportunity employer. Consistent with applicable law, all qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.
-
Site Reliability Engineering
2 weeks ago
Bengaluru, Karnataka, India Thakral One Full time US$ 60,000 - US$ 1,20,000 per yearCompany DescriptionThakral One, headquartered in Singapore, is a technology consulting and services company with a strong presence across Asia. The company specializes in technology-driven consulting, custom solution development, data analytics, and leveraging cloud capabilities to deliver enhanced decision support and practical outcomes. Collaborating...
-
Site Reliability Engineering
2 weeks ago
Bengaluru, Karnataka, India Viraaj HR Solutions Private Limited Full time ₹ 12,00,000 - ₹ 36,00,000 per yearSite Reliability Engineer (SRE)About The OpportunityA fast-growing organization in the Enterprise Cloud Infrastructure & SaaS sector delivering highly available, mission-critical services to enterprise customers. We are hiring an on-site Site Reliability Engineer in India to own reliability, automation, and operational excellence across cloud-native...
-
Site Reliability Engineer
7 days ago
Bengaluru, Karnataka, India super Full time ₹ 12,00,000 - ₹ 24,00,000 per yearSite Reliability Engineer (SRE) Level 3Overview:A Site Reliability Engineer (SRE) Level 3 is a senior technical leadership role focused on designing, implementing, and maintaining large-scale, complex, and highly reliable systems. This role emphasizes a blend of software and systems engineering to ensure the availability, latency, performance, and capacity...
-
Site Reliability Engineer
5 days ago
Bengaluru, Karnataka, India eBay Full time ₹ 12,00,000 - ₹ 36,00,000 per yearAt eBay, we're more than a global ecommerce leader — we're changing the way the world shops and sells. Our platform empowers millions of buyers and sellers in more than 190 markets around the world. We're committed to pushing boundaries and leaving our mark as we reinvent the future of ecommerce for enthusiasts.Our customers are our compass, authenticity...
-
Site Reliability Engineer
1 week ago
Bengaluru, Karnataka, India Zetamicron Full time ₹ 12,00,000 - ₹ 36,00,000 per yearJob Title: Site Reliability Engineer (SRE)About the RoleWe are seeking a highly skilled and proactive Site Reliability Engineer (SRE)to ensure the stability, scalability, and reliability of our platform. The ideal candidate will have strong experience in managing production environments, automating operational processes, and enhancing system performance...
-
Site Reliability Engineer
3 days ago
Bengaluru, Karnataka, India Barycenter Technologies Full time ₹ 5,00,000 - ₹ 15,00,000 per yearJob Description: Site Reliability Engineer (SRE)Must have skills :Kubernetes (Networking, storage), python & Linux.Good to Have skills:Reporting and Monitoring Tools (Grafana, Loki, Dynatrace)
-
Site Reliability Engineer
1 week ago
Bengaluru, Karnataka, India Chevron Full time ₹ 20,00,000 - ₹ 25,00,000 per yearTotal Number of Openings2About the position:Come join our Subsurface Digital Platform where we are driving continuous innovations to improve reliability, scalability and sustainability of Chevron business via Chevron's Digital Transformation. We are seeking a T-shaped dynamic Senior Site Reliability Engineer to lead and provide end-to-end solution support...
-
Site Reliability Engineer
6 days ago
Bengaluru, Karnataka, India Luxoft Full time ₹ 12,00,000 - ₹ 36,00,000 per yearProject description Luxoft partner with next-generation digital bank, built from the ground up to deliver seamless, secure, and scalable financial services. Our platform is cloud-native, API-first, and focused on reliability, speed, and security. We are growing fast and looking for top-tier Site Reliability / Ops Engineers to join our core team and help run...
-
Site Reliability Engineer
1 week ago
Bengaluru, Karnataka, India Empower Full time ₹ 12,00,000 - ₹ 36,00,000 per yearOur vision for the future is based on the idea that transforming financial lives starts by giving our people the freedom to transform their own. We have a flexible work environment, and fluid career paths. We not only encourage but celebrate internal mobility. We also recognize the importance of purpose, well-being, and work-life balance. Within Empower and...
-
Site Reliability Engineer
1 week ago
Bengaluru, Karnataka, India Progress Full time ₹ 12,00,000 - ₹ 36,00,000 per yearWe are Progress (Nasdaq: PRGS) - the trusted provider of software that enables our customers to develop, deploy and manage responsible, AI-powered applications and experience with agility and ease.We're proud to have a diverse, global team where we value the individual and enrich our culture by considering varied perspectives because we believe people power...