Senior Site Reliability Engineer

2 weeks ago


Pune, Maharashtra, India Red Hat India Private Limited Full time

Job Summary

Red Hat is seeking a Senior Site Reliability Engineer to develop, scale, and operate our OpenShift managed cloud services.

As an SRE, you will contribute to running OpenShift at scale by enabling customer self-service, making our monitoring system more sustainable, and eliminating work through automation.

Responsibilities

The day-to-day responsibilities of an SRE involve working with live systems and coding automation. As an SRE, you will be expected to:

  • Contribute code to increase the scalability and reliability of the service
  • Contribute software tests and participate in peer review to increase the quality of our codebase
  • Help and develop peers' capabilities through knowledge sharing, mentoring, and collaboration
  • Participate in a regular on-call schedule, including occasional paid weekends and holidays
  • Practice sustainable incident response and blameless postmortems
  • Resolve customer issues escalated from the Red Hat Global Support team
  • Work within a small agile team to develop and improve SRE software, support your peers, plan and self-improve

Requirements

A bachelor's degree in Computer Science or a related technical field involving software or systems engineering is required. However, hands-on experience that demonstrates your ability and interest in Site Reliability Engineering are valuable to us, and may be considered in lieu of degree requirements. You must have some experience programming in at least one of these languages: Python, Golang, Java, C, C++, or another object-oriented language. You must have experience working with public clouds such as AWS, GCP, or Azure. You must also have the ability to collaboratively troubleshoot and solve problems in a team setting.

We prefer candidates with experience troubleshooting an as-a-service offering (SaaS, PaaS, etc.) and some experience working with complex distributed systems. Direct experience with Kubernetes or OpenShift is a plus. We like to see a demonstrated ability to debug, optimize code, and automate routine tasks. We are Red Hat, so you need a basic understanding of Unix/Linux operating systems.

Preferred Skills

5+ years of experience managing Linux servers running Red Hat Enterprise Linux (RHEL), CentOS, or Fedora hosted at a cloud provider such as Amazon Web Services (AWS), Google Compute Engine (GCE), or Microsoft Azure

3+ years of experience with enterprise systems monitoring; knowledge of Prometheus is a plus

3+ years of experience with enterprise configuration management software like Ansible by Red Hat, Puppet, or Chef

2+ years of experience programming with at least one object-oriented language; Golang, Java, or Python are preferred

2+ years of experience delivering a hosted service

Demonstrated ability to quickly and accurately troubleshoot system issues

Solid understanding of standard TCP/IP networking and common protocols like DNS and HTTP

Solid communications skills and experience working directly with and presenting to customers

1+ year(s) of experience with Kubernetes is a plus

1+ year(s) of experience with docker-based containers is a plus



  • Pune, Maharashtra, India Global Payments Asia-Pacific India Private Limited Full time

    About This RoleAt Global Payments Asia-Pacific India Private Limited, we're shaping the future of payments technology. As a Senior Site Reliability Engineer, you'll play a key role in ensuring our systems are highly available, resilient, and performant.What You'll DoDesign and implement chaos experiments to test our systems' reliability and resilience.Push...


  • Pune, Maharashtra, India Coupa Software Full time

    About CoupaCoupa is a leading provider of spend management solutions, dedicated to helping businesses unlock their full potential and do well while doing good. Our mission is to empower customers to make informed decisions and drive growth through innovative technology and collaborative partnerships.Job DescriptionWe are seeking a highly skilled Senior Site...


  • Pune, Maharashtra, India Global Payments Asia-Pacific India Private Limited Full time

    At Global Payments Asia-Pacific India Private Limited, we're on a mission to make payments easier and more secure for millions of people around the world. As a Senior Site Reliability Engineer, you'll play a critical role in ensuring the availability, latency, and performance of our payment solutions.Key ResponsibilitiesDesign and implement solutions to...


  • Pune, Maharashtra, India Procore Technologies Full time

    Senior Site Reliability EngineerAbout the RoleWe are seeking a highly skilled Senior Reliability Engineer with strong backend software engineering skills to join our team at Procore Technologies. As a Senior Reliability Engineer, you will be responsible for designing, implementing, and maintaining our cloud infrastructure, ensuring the smooth operation of...


  • Pune, Maharashtra, India RED HAT Full time

    Job DescriptionRed Hat is seeking a highly skilled Senior Site Reliability Engineer to join our team and contribute to the development, scaling, and operation of our OpenShift managed cloud services. As a key member of our SRE team, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud services.Key...


  • Pune, Maharashtra, India Coupa Software Full time

    About the RoleCoupa Software is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a crucial role in ensuring the reliability, availability, and performance of our Enterprise platform.ResponsibilitiesDesign and implement automation solutions to increase reliability, availability, and...


  • Pune, Maharashtra, India People First Consultants Full time

    Job Title: Site Reliability EngineerAbout the Role:We are seeking a highly skilled Site Reliability Engineer to join our team at People First Consultants. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, efficiency, and performance of our applications and systems.Key Responsibilities:Work with development teams to...


  • Pune, Maharashtra, India PubMatic Full time

    Job Title: Site Reliability EngineerPubMatic, a leading technology company, is seeking a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the seamless operation and optimal performance of our large-scale distributed software applications.Key Responsibilities:Monitor and analyze...


  • Pune, Maharashtra, India Etraveli Group Full time

    Etraveli Group is a leading global flight-centric Online Travel Agency (OTA) with a strong presence in the market. We operate a diverse range of websites and platforms, including gotogate.com, pamediakopes.gr, and flygresor.se.As a Site Reliability Engineer, you will play a critical role in ensuring the stability, uptime, security, and performance of our...


  • Pune, Maharashtra, India Coupa Software Full time

    About the RoleCoupa Software is seeking a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in building and maintaining the technologies on our Coupa Cloud platform.ResponsibilitiesDesign and develop scalable, reliable, and secure cloud-based systemsWork closely with cross-functional teams to...


  • Pune, Maharashtra, India Roche Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Roche. As a Site Reliability Engineer, you will be responsible for designing, implementing, and maintaining site reliability engineering practices that ensure the reliability and performance of our production systems.Key ResponsibilitiesDesign and implement SRE...


  • Pune, Maharashtra, India Roche Full time

    Job Title: Site Reliability EngineerAbout the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Roche. As a Site Reliability Engineer, you will be responsible for designing, implementing, and maintaining site reliability engineering practices that ensure the reliability and performance of our production systems.Key...


  • Pune, Maharashtra, India People First Consultants Full time

    Job DescriptionWe are seeking a highly skilled Site Reliability Engineer to join our team at People First Consultants. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, efficiency, and performance of our applications and systems.Key Responsibilities:Work with development teams to ensure applications meet customer needs and...


  • Pune, Maharashtra, India F337 Deutsche India Private Limited, Pune Branch Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Deutsche Bank's Corporate Bank division. As a key member of our agile delivery team, you will play a pivotal role in ensuring the reliability, scalability, and performance of our systems.Your Key ResponsibilitiesDesign, build, and maintain robust and efficient...


  • Pune, Maharashtra, India Hansen Technologies Full time

    About The RoleWe are seeking a skilled Site Reliability Engineer to join our team in Pune, India. As a key member of our technical operations team, you will play a crucial role in ensuring the reliability, performance, and scalability of our systems.About YouWe are looking for a highly motivated and experienced Site Reliability Engineer who is passionate...


  • Pune, Maharashtra, India RED HAT Full time

    Role OverviewWe are seeking a skilled Senior Site Reliability Engineer (SRE) to join our team and contribute to the development, scaling, and operation of our OpenShift managed cloud services.Key ResponsibilitiesContribute to the design, implementation, and maintenance of scalable and reliable cloud services.Collaborate with cross-functional teams to...


  • Pune, Maharashtra, India Global Payments Asia-Pacific India Private Limited Full time

    At Global Payments Asia-Pacific India Private Limited, we're on a mission to make payments seamless and secure for millions of people around the world. As a Site Reliability Engineer, you'll play a critical role in ensuring the availability, latency, and performance of our payment solutions.Key ResponsibilitiesDesign and implement chaos engineering...


  • Pune, Maharashtra, India Coupa Software Full time

    **About the Role**We are seeking an experienced Cloud Systems Engineer to join our Site Reliability Engineering team at Coupa Software. As a Cloud Systems Engineer, you will be responsible for designing, implementing, and maintaining our cloud infrastructure to ensure high availability, scalability, and reliability of our services.**Responsibilities:***...


  • Pune, Maharashtra, India Red Hat India Private Limited Full time

    About the RoleWe are seeking a Senior Site Reliability Engineer to join our team at Red Hat India Private Limited. As a key member of our cloud services team, you will play a critical role in developing, scaling, and operating our OpenShift managed cloud services.Key ResponsibilitiesContribute to the design, development, and operation of our OpenShift...


  • Pune, Maharashtra, India Acoustic Full time

    Acoustic is seeking a seasoned Senior Site Reliability Engineer to contribute to the growth and success of our organization. We believe that the ideal candidate will bring innovative ideas and implement preventative measures to minimize downtime.Key ResponsibilitiesLead major incident calls and provide solutions to the team. Collaborate with our SRE teams to...