Site Reliability Engineer II

4 weeks ago


hyderabad, India Microsoft Full time

Overview

Are you looking to make a real difference in Microsoft’s mission to empower every person and organization to achieve more, with the power of cloud computing? Are you passionate about driving reliability of the services to make customers’ mission critical workloads run peacefully? Do you want to be part of a team that has a start-up mindset and work together to delight our customers and have lots of fun and learning along the way?

If this excites you, then come join the Azure Specialized team in India. We are responsible for building and offering specialized hardware and bare-metal solutions in Azure. This involves large scale specialized solutions like Azure Large Instances, Azure VMWare Services, super computers like Cray and more.

We are looking for a Service Reliability Engineer who is ready to be a part of a team that moves fast, leverages continuous delivery practices, and is customer focused. This position requires strong collaboration and teamwork across team and organizational boundaries, playing a vital role in reliability of engineering services that delight the customer. Your ability to be the customer advocate, focus on service first, and part of a team that tears down silos to deliver the best customer experience will be critical to your success, along with the teams.

Qualifications

Bachelor of Computer Science or equivalent industry experience 5+ years of professional experience with 3+ years of experience involving service operations, Data centre operations, monitoring, and reliability improvement. Proven ability to collaborate across teams & organizations. Experience in managing distributed systems and/or cloud platforms a plus. Publications and/or certifications related to cloud technologies a plus.

Technical Skills

Sound understanding of software service monitoring & reliability. Sound understanding of Data centre setup, monitoring & reliability. Strong inclination for quality of services aiming towards zero unplanned downtime. Experience in implementing systems to detect early symptoms towards failure is preferred.

Languages/Tools Experience

Proficient in scripting with PowerShell, Shell script, etc. Deep expertise and experience working on VMware and its ecosystem a plus. Experience working on Azure, Kubernetes, Docker, and the containers ecosystem a plus. Operational knowledge and experience in EPIC and/or NetApp is a plus.

Responsibilities

· Make software services reliable, secure, monitor and auto-manage infrastructure for specialized workloads on Azure.

· You will be involved in defining reliability of infrastructure & platform services used by mission critical workloads and implementing this cutting across Azure control and data plane components, networking, and operating systems.

· You will be responsible for a scenario that would require you to collaborate closely across organizations and teams, to collaborate across geographies, and to mentor and guide junior engineers in the team.

· You will get to enable mission critical workloads on Azure. It is a fast-paced environment. Our emphasis is on value to customers and live site excellence.

Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.Industry leading healthcareEducational resourcesDiscounts on products and servicesSavings and investmentsMaternity and paternity leaveGenerous time awayGiving programsOpportunities to network and connect

  • Hyderabad, India Electronic Arts Full time

    ROLE:  Site Reliability Engineer II EXP: 5-8 years TECH STACK: Containerisation & Orchestration : Docker, Kubernetes, Rancher, EKS, ECS, GKE, Elastic Beanstalk, Google App Engine Cloud Platform               : AWS, GCP IaaC                              : Terraform, AWS-CloudFormation / GCP-CloudDeploymentManager, Ansible...


  • Hyderabad, Telangana, India Electronic Arts Full time

    ROLE: Site Reliability Engineer IIEXP: 5-8 yearsTECH STACK:Containerisation & Orchestration : Docker, Kubernetes, Rancher, EKS, ECS, GKE, Elastic Beanstalk, Google App EngineCloud Platform : AWS, GCPIaaC : Terraform, AWS-CloudFormation / GCP-CloudDeploymentManager, AnsibleInfra Monitoring : Prometheus, Datadog, Alert Manager, Thanos, AWS CloudwatchCI/CD :...


  • Hyderabad, Telangana, India Electronic Arts Full time

    ROLE: Site Reliability Engineer IIEXP: 5-8 yearsTECH STACK:Containerisation & Orchestration : Docker, Kubernetes, Rancher, EKS, ECS, GKE, Elastic Beanstalk, Google App EngineCloud Platform : AWS, GCPIaaC : Terraform, AWS-CloudFormation / GCP-CloudDeploymentManager, AnsibleInfra Monitoring : Prometheus, Datadog, Alert Manager, Thanos, AWS CloudwatchCI/CD :...


  • Hyderabad, India Microsoft Full time

    Overview Do you have a passion for high scale services and working with some of Microsoft’s most critical cloud capabilities? We’re looking for a Senior Site Relability Engineer with the right mix of software development, Cloud experience and passion for quality to envision, design, and deliver solutions for Microsoft's cloud Infrastructure. ...


  • hyderabad, India Insight Global Full time

    Required Skills and Experience *- Bachelor's or master's degree in computer science, Software Engineering, or a related field.- Proven experience (7+ years) in SRE, automation testing- Strong skills in developing and implementing automation testing strategies and frameworks.- Solid understanding of site reliability principles and best practices.- Leadership...


  • Hyderabad, India Insight Global Full time

    Required Skills and Experience * - Bachelor's or master's degree in computer science, Software Engineering, or a related field. - Proven experience (7+ years) in SRE, automation testing - Strong skills in developing and implementing automation testing strategies and frameworks. - Solid understanding of site reliability principles and best practices. -...


  • Hyderabad, India Insight Global Full time

    Required Skills and Experience *- Bachelor's or master's degree in computer science, Software Engineering, or a related field.- Proven experience (7+ years) in SRE, automation testing- Strong skills in developing and implementing automation testing strategies and frameworks.- Solid understanding of site reliability principles and best practices.- Leadership...


  • hyderabad, India Virtusa Full time

    Site Reliability engineer - CREQ188641 Description Position : SRE Primary skills: devops CI/CD pipeline Location: Hyderabad Should have proficiency in understanding of application monitoring stack(Logs, Events, Metrics and Alerts) and ability to visualize and setup end-to-end observability.Should have proficiency in industry standard monitoring...


  • Hyderabad, India Virtusa Full time

    Site Reliability engineer - CREQ188641 Description Position : SRE Primary skills: devops CI/CD pipeline Location: Hyderabad Should have proficiency in understanding of application monitoring stack(Logs, Events, Metrics and Alerts) and ability to visualize and setup end-to-end observability. Should have proficiency in industry standard monitoring tools...


  • Hyderabad, India Virtusa Full time

    Site Reliability engineer - CREQ188641 Description Position : SRE Primary skills: devops CI/CD pipeline Location: Hyderabad Should have proficiency in understanding of application monitoring stack(Logs, Events, Metrics and Alerts) and ability to visualize and setup end-to-end observability. Should have proficiency in industry standard monitoring tools...


  • Hyderabad, India Snaphunt Full time

    The OfferWork within a company with a solid track record of successGreat work environmentAttractive salary & benefitsThe Job You will be responsible for : Gathering and evaluating user feedback.Providing code documentation and other inputs to technical documents.Supporting continuous improvement by investigating alternatives and new technologies and...


  • hyderabad, India Snaphunt Full time

    The Offer Work within a company with a solid track record of success Great work environment Attractive salary & benefits The Job You will be responsible for : Gathering and evaluating user feedback. Providing code documentation and other inputs to technical documents. Supporting continuous improvement by investigating alternatives and new technologies...


  • hyderabad, India Microsoft Full time

    Overview Are you interested in working for one of the most exciting products at Microsoft, passionate about exceeding customer expectations and advancing Microsoft's cloud first strategy? Are you interested in a start-up like the environment, passionate about cloud computing technology and driving growth in one of Microsoft's core businesses? If...


  • Hyderabad, India Microsoft Full time

    Overview Are you interested in working for one of the most exciting products at Microsoft, passionate about exceeding customer expectations and advancing Microsoft's cloud first strategy? Are you interested in a start-up like the environment, passionate about cloud computing technology and driving growth in one of Microsoft's core businesses? If so,...


  • Hyderabad, India Microsoft Full time

    Overview Are you interested in working for one of the most exciting products at Microsoft, passionate about exceeding customer expectations and advancing Microsoft's cloud first strategy? Are you interested in a start-up like the environment, passionate about cloud computing technology and driving growth in one of Microsoft's core businesses? If so,...


  • Hyderabad, India Korn Ferry Full time

    Role - Site Reliability EngineerExp - 5+ years RequiredLocation - Hyderabad ( Work from Office-Hybrid)Shift Timings - 5AM -1 PM IST We are looking for a Site Reliability Engineer with strong development background to join our team. In this role, you will be responsible for ensuring the reliability and performance of our systems. You will work closely to our...


  • Hyderabad, India Korn Ferry Full time

    Role - Site Reliability EngineerExp - 5+ years RequiredLocation - Hyderabad ( Work from Office-Hybrid)Shift Timings - 5AM -1 PM IST We are looking for a Site Reliability Engineer with strong development background to join our team. In this role, you will be responsible for ensuring the reliability and performance of our systems. You will work closely to our...


  • Hyderabad, India Korn Ferry Full time

    Role - Site Reliability EngineerExp - 5+ years RequiredLocation - Hyderabad ( Work from Office-Hybrid)Shift Timings - 5AM -1 PM IST We are looking for a Site Reliability Engineer with strong development background to join our team. In this role, you will be responsible for ensuring the reliability and performance of our systems. You will work closely to our...


  • Hyderabad, India FedEx ACC Full time

    Skill Required: Under general supervision, assists in the development and design of deliverables that support the resolution of moderately complex problems and technical design gaps. Supports improvement initiatives that are aligned with overarching global reliability of the company‘s systems, including capacity planning, failover strategies, performance...


  • Hyderabad, India Vistex Full time

    Vistex is currently hiring a Site Reliability Engineer. The Vistex Site Reliability Engineer will be primarily responsible for service availability, performance, monitoring, incident response, and capacity planning. This is a highly technical, hands-on role with a strong focus on automation, accurate monitoring, actionable alerting, resilient design,...