​​Senior Site Reliability Engineer​

7 months ago


Bengaluru, India Microsoft Full time

Overview

Microsoft is a company where passionate innovators come to collaborate, envision what can be and take their careers further. This is a world of more possibilities, more innovation, more openness, and the sky is the limit thinking in a cloud-enabled world.Microsoft’s Azure Data engineering team is leading the transformation of analytics in the world of data with products like databases, data integration, big data analytics, messaging & real-time analytics, and business intelligence. The products our portfolio include Microsoft Fabric, Azure SQL DB, Azure Cosmos DB, Azure PostgreSQL, Azure Data Factory, Azure Synapse Analytics, Azure Service Bus, Azure Event Grid, and Power BI. Our mission is to build the data platform for the age of AI, powering a new class of data-first applications and driving a data culture.

​​Within Azure Data, the databases team builds and maintains Microsoft's operational Database systems. We store and manage data in a structured way to enable multitude of applications across various industries. We are on a journey to enable developer friendly, mission-critical, AI enabled operational Databases across relational, non-relational and OSS offerings.​

​​Azure Cosmos DB is Microsoft’s next generation globally distributed, massively scalable, multi-model cloud database service. It is designed to enable developers to build planet-scale applications. Azure Cosmos DB is one of the fastest growing Azure services. Joining the Azure Cosmos DB team is a fantastic opportunity to work with highly talented engineers operating like a startup, and to deliver on our next set of big challenges. As a Senior Site Reliability Engineer, you will identify and deliver software improvements using your expertise in software development, complexity analysis, and scalable system design to ensure services/systems are highly stable, performant, and meeting the expectations of our customers. You will work closely with other engineering teams and provide a holistic view of our cloud service.

​​

We do not just value differences or different perspectives. We seek them out and invite them in so we can tap into the collective power of everyone in the company. As a result, our customers are better served.

Qualifications

​Bachelor's degree in computer science/Engineering/related fields or equivalent industry experience. 6+ years of experience with writing tools, automation / scripting (Powershell, Python or similar), programming (C++, C# or equivalent) and making enhancements in subcomponents within and around services/products to deliver and manage software in production. 6+ years of troubleshooting/debugging experience: telemetry-based analysis (KQL or equivalent preferred), troubleshooting skills across network, hardware, and distributed service layers, with demonstrated ability to debug, fix, and optimize code. Good communications skills, both verbal and written.

Other Requirements

Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings: Microsoft Cloud Background Check:

This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.

Preferred/Additional Qualifications

​​Experience aiding understanding of distributed systems and networking is preferred.​

#azdat

#azuredata

​​#cosmosdb #databases​

Responsibilities

​​Identify opportunities and drive the design and implementation of end-to-end telemetry, alerting, self-healing and automation capabilities to improve service health, manageability, and reliability. Participate in on-call rotations and own, triage, investigate and resolve service issues with an emphasis on broad communications, learning & teaching throughout the process. Interact with customers / support representatives and communicate on a deeply technical level with product engineering and product management teams to evolve services. Own availability, performance, and supportability targets for the service. Author functional and technical documentation and remain current on relevant technologies and procedures. ​

Embody our and

Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.Industry leading healthcareEducational resourcesDiscounts on products and servicesSavings and investmentsMaternity and paternity leaveGenerous time awayGiving programsOpportunities to network and connect

  • Bengaluru, Karnataka, India AMEX Full time

    About the Role: We are seeking an experienced Senior Site Reliability Engineer to lead our team in delivering high-quality, reliable technology solutions. The ideal candidate will have a deep understanding of observability tools and methodologies, as well as strong leadership and people management skills. About Us: American Express is a global leader in...


  • Bengaluru, India Barracuda Full time

    Job ID: 25-251Come Join Our Passionate Team! At Barracuda, we make the world a safer place. We believe every business deserves access to cloud-enabled, enterprise-grade security solutions that are easy to buy, deploy, and use. We protect email, networks, data and applications with innovative solutions that grow and adapt with our customers’ journey. More...


  • Bengaluru, India Tech Mahindra (formerly Mahindra Satyam) Full time

    We are looking for Senior SRE who can join with us in Pan India Location.Skill - SRE with GCP platformExp - 6- 13 YrsLocation - Pan IndiaHybrid work locationAs an Off-Shore Site Reliability Engineer, you will have the following preferred skillsets:- Experience with cloud platforms such as Azure, or GCP- Proficiency in scripting languages such as Python,...


  • Bengaluru, India Tech Mahindra Full time

    We are looking for Senior SRE who can join with us in Pan India Location.Skill - SRE with GCP platformExp - 6- 13 YrsLocation - Pan IndiaHybrid work locationAs an Off-Shore Site Reliability Engineer, you will have the following preferred skillsets:- Experience with cloud platforms such as Azure, or GCP- Proficiency in scripting languages such as Python,...


  • Bengaluru, India Ushur Full time

    Location: BangaloreExperience: 6-8 YearsWork Mode: Hybrid/RemoteThe RoleSenior Site Reliability Engineers at Ushur perform a unique blend of customer support engineering, solution engineering, and operational engineering. You will work on our largest customers’ most complex problems and craft intuitive, elegant solutions. You’ll also proactively work...


  • Bengaluru, India Hirextra -World's First Staffing Aggregator Full time

    Role: Site Reliability EngineerExperiences: 6+ yearsLocation: BangaloreMode: HybridBudget: 14 LPAJob Description:Highly skilled Cloud Site Reliability Engineer to ensure high availability, reliability and performance of cloud infrastructure and services.Experience in cloud platforms (AWS, GCP), automation, monitoring, and incident management.Experience in...


  • Bengaluru, India IntraEdge Full time

    Job Title: Site Reliability Engineer (SRE)Location:Gurgaon / BangaloreJob Type:Full-TimeExperience :5+ YearsAbout the Role:We are seeking a skilled Site Reliability Engineer (SRE) to ensure the reliability, scalability, and performance of our applications. You will work with development teams on microservices architecture, cloud platforms (AWS, GCP, Azure),...


  • Bengaluru, India Hirextra -World's First Staffing Aggregator Full time

    Role: Site Reliability Engineer Experiences: 6+ yearsLocation: Bangalore Mode: HybridBudget: 14LPA Job Description: Highly skilled Cloud Site Reliability Engineer to ensure high availability, reliability and performance of cloud infrastructure and services. Experience in cloud platforms (AWS, GCP), automation, monitoring, and incident management. ...


  • Bengaluru, India Hirextra -World's First Staffing Aggregator Full time

    Role: Site Reliability Engineer Experiences: 6+ yearsLocation: Bangalore Mode: HybridBudget: 14LPA Job Description: Highly skilled Cloud Site Reliability Engineer to ensure high availability, reliability and performance of cloud infrastructure and services. Experience in cloud platforms (AWS, GCP), automation, monitoring, and incident management. ...


  • Bengaluru, India Tech Mahindra (formerly Mahindra Satyam) Full time

    We are looking for Senior SRE who can join with us in Pan India Location.Skill - SRE with GCP platformExp - 6- 13 YrsLocation - Pan IndiaHybrid work locationAs an Off-Shore Site Reliability Engineer, you will have the following preferred skillsets: - Experience with cloud platforms such as Azure, or GCP - Proficiency in scripting languages such as Python,...


  • Bengaluru, India Tech Mahindra (formerly Mahindra Satyam) Full time

    We are looking for Senior SRE who can join with us in Pan India Location.Skill - SRE with GCP platformExp - 6- 13 YrsLocation - Pan IndiaHybrid work locationAs an Off-Shore Site Reliability Engineer, you will have the following preferred skillsets: - Experience with cloud platforms such as Azure, or GCP - Proficiency in scripting languages such as Python,...


  • Bengaluru, India Oracle Full time

    Building off our Cloud momentum, Oracle has formed a new organization - Oracle Health Applications & Infrastructure. This team will focus on product development and product strategy for Oracle Health while building out a complete platform supporting modernized, automated healthcare. This is a net new line of business, constructed with an entrepreneurial...


  • Bengaluru, India Tranzeal Incorporated Full time

    Job Title: Site Reliability Engineer (SRE)Location: BangaloreWe're hiring aSite Reliability Engineerto join our team in Bangalore! If you have a strong background in maintaining and scaling cloud services and love automating infrastructure at scale, this is for you.Experience with Ansible and Kubernetes is a MUST-HAVEKey Responsibilities:Manage and scale...


  • Bengaluru, India Tranzeal Incorporated Full time

    Job Title: Site Reliability Engineer (SRE)Location: BangaloreWe're hiring a Site Reliability Engineer to join our team in Bangalore! If you have a strong background in maintaining and scaling cloud services and love automating infrastructure at scale, this is for you.Experience with Ansible and Kubernetes is a MUST-HAVEKey Responsibilities:Manage and scale...


  • Bengaluru, India Tranzeal Incorporated Full time

    Job Title: Site Reliability Engineer (SRE)Location: BangaloreWe're hiring a Site Reliability Engineer to join our team in Bangalore! If you have a strong background in maintaining and scaling cloud services and love automating infrastructure at scale, this is for you.Experience with Ansible and Kubernetes is a MUST-HAVEKey Responsibilities:Manage and scale...


  • Bengaluru, India Tranzeal Incorporated Full time

    Job Title: Site Reliability Engineer (SRE)Location: Bangalore, KAWork Mode: Office (5Days/Week)Position Type: Contract basedWe're hiring aSite Reliability Engineerto join our team in Bangalore! If you have a strong background in maintaining and scaling cloud services and love automating infrastructure at scale, this is for you.Experience with Ansible and...


  • Bengaluru, India Laerdal Bangalore Full time

    As a Senior Site Reliability Engineer, you’ll play a pivotal role in ensuring the reliability and performance of our cloud-based applications and solutions. Collaborating closely with our team, you will cultivate a culture of SRE breaking down silos and managing incidents and problems. Your role will involve developing and implementing innovative solutions...


  • Bengaluru, India Tranzeal Incorporated Full time

    Job Title: Site Reliability Engineer (SRE)Location: Bangalore, KAWork Mode: Office (5 Days/Week)Position Type: Contract basedWe're hiring a Site Reliability Engineer to join our team in Bangalore! If you have a strong background in maintaining and scaling cloud services and love automating infrastructure at scale, this is for you.Experience with Ansible...


  • Bengaluru, India Tranzeal Incorporated Full time

    Job Title: Site Reliability Engineer (SRE)Location: Bangalore, KAWork Mode: Office (5Days/Week)Position Type: Contract basedWe're hiring a Site Reliability Engineer to join our team in Bangalore! If you have a strong background in maintaining and scaling cloud services and love automating infrastructure at scale, this is for you.Experience with Ansible and...


  • Bengaluru, India Laerdal Bangalore Full time

    As a Senior Site Reliability Engineer, you’ll play a pivotal role in ensuring the reliability and performance of our cloud-based applications and solutions. Collaborating closely with our team, you will cultivate a culture of SRE breaking down silos and managing incidents and problems. Your role will involve developing and implementing innovative solutions...