Red Hat Principal Site Reliability Engineer
1 month ago
Red Hat is seeking a highly skilled Principal Site Reliability Engineer to join our team and contribute to the development, scaling, and operation of our OpenShift managed cloud services. As a key member of our SRE team, you will play a critical role in enabling customer self-service, improving our monitoring system, and eliminating work through automation.
Key Responsibilities:
- Manage, deploy, and operate cloud solutions at scale using Site Reliability Engineering principles
- Participate in the design and development of new features to enable OpenShift 'as-a-service' across multiple public clouds
- Design and write automation software to provision, upgrade, monitor, and heal a large global fleet of OpenShift clusters deployed across multiple public clouds
- Identify single points of failure and other high-risk architecture issues; propose and implement more resilient resolutions
- Interact with multiple teams within Red Hat and with the open source community to contribute to both the upstream and downstream projects to deliver functionality
- Participate in product release cycles, deploying code to integration, staging, and production environments, integrating with CI/CD tooling, monitoring, and change management
- Perform software updates, peer code reviews, testing, and CVE analysis; respond to security threats
- Interact with automated monitoring and healing infrastructure to ensure healthy environments
- Provide engineering support to Red Hat's global technical support team to resolve customer issues
- Help and develop peers through knowledge sharing, mentoring, and collaboration
- Create and maintain standard operating procedures (SOPs) for performing maintenance tasks, applying configuration changes, and remediating problems in our environment
- Participate in a follow-the-sun on-call rotation
Requirements:
- 10+ years of software engineering experience using object-oriented languages; golang is preferred
- 5+ years of experience managing Linux-based systems in a public cloud such as AWS, GCP, or Azure
- 5+ years of experience with enterprise systems monitoring; knowledge of Prometheus is preferred
- 5+ years of experience with enterprise configuration management such as Ansible, Puppet, or Chef
- 3+ years of experience delivering hosted cloud services
- 1+ year of experience with Kubernetes
- 1+ year of experience with containers on Linux
- Superior communications skills and experience working directly with and presenting to customers
- Ability to quickly learn new technologies and follow industry trends
- Demonstrated ability to quickly and accurately troubleshoot systems issues
- Solid understanding of standard TCP/IP networking and common protocols like DNS and HTTP
About Red Hat:
Red Hat is the world's leading provider of enterprise open source software solutions, using a community-powered approach to deliver high-performing Linux, cloud, container, and Kubernetes technologies. We are a global team with a flexible work environment that allows associates to choose the work environment that suits their needs. We encourage creative, passionate people to contribute their ideas, help solve complex problems, and make an impact. Opportunities are open, and we welcome and encourage applicants from all dimensions of diversity.
Diversity, Equity & Inclusion at Red Hat:
Red Hat's culture is built on the open source principles of transparency, collaboration, and inclusion, where the best ideas can come from anywhere and anyone. We aspire to create an environment where everyone experiences equal opportunity and access, and all voices are not only heard but also celebrated. We welcome and encourage applicants from all dimensions of diversity.
Equal Opportunity Policy (EEO):
Red Hat is proud to be an equal opportunity workplace and an affirmative action employer. We review applications for employment without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, ancestry, citizenship, age, veteran status, genetic information, physical or mental disability, medical condition, marital status, or any other basis prohibited by law.
-
Red Hat
2 months ago
Pune, Maharashtra, India RED HAT Full timeJob Description :Red Hat is seeking a Principal Site Reliability Engineer (SRE) to develop, scale, and operate our OpenShift managed cloud services. OpenShift is Red Hats enterprise Kubernetes distribution. As an SRE you will contribute to running OpenShift at scale by enabling customer self-service, making our monitoring system more sustainable, and...
-
Red Hat
2 months ago
Pune, India RED HAT Full timeJob Description :Red Hat is seeking a Principal Site Reliability Engineer (SRE) to develop, scale, and operate our OpenShift managed cloud services. OpenShift is Red Hats enterprise Kubernetes distribution. As an SRE you will contribute to running OpenShift at scale by enabling customer self-service, making our monitoring system more sustainable, and...
-
Red Hat
2 months ago
Pune, India RED HAT Full timeJob Description :Red Hat is seeking a Senior Site Reliability Engineer (SRE) to develop, scale, and operate our OpenShift managed cloud services. OpenShift is Red Hats enterprise Kubernetes distribution. As an SRE you will contribute to running OpenShift at scale by enabling customer self-service, making our monitoring system more sustainable, and...
-
Senior Site Reliability Engineer
1 month ago
Pune, Maharashtra, India RED HAT Full timeJob DescriptionRed Hat is seeking a highly skilled Senior Site Reliability Engineer to join our team and contribute to the development, scaling, and operation of our OpenShift managed cloud services. As a key member of our SRE team, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud services.Key...
-
Principal Software Engineer
4 months ago
Pune, India Red Hat India Private Limited Full timeRed Hat is seeking a Principal Site Reliability Engineer (SRE) to develop, scale, and operate our OpenShift managed cloud services. OpenShift is Red Hat’s enterprise Kubernetes distribution. As an SRE you will contribute to running OpenShift at scale by enabling customer self-service, making our monitoring system more sustainable, and eliminating work...
-
Red Hat
2 months ago
Pune, Maharashtra, India RED HAT Full timeAbout the job : The Red Hat Experience Engineering (XE) Sustaining Engineering team is looking for a Senior Software Engineer to help lead a new team aimed at improving the long-term product experience of our Red Hat Enterprise Linux (RHEL) customers. In this role, you will work closely with Product Engineering to deliver on extended product maintenance...
-
Red Hat
2 months ago
Pune, India RED HAT Full timeAbout the job : The Red Hat Experience Engineering (XE) Sustaining Engineering team is looking for a Senior Software Engineer to help lead a new team aimed at improving the long-term product experience of our Red Hat Enterprise Linux (RHEL) customers. In this role, you will work closely with Product Engineering to deliver on extended product maintenance...
-
Senior Software Engineer
1 month ago
Pune, India RED HAT Full timeAbout the RoleThe Red Hat Enterprise Linux Sustaining Engineering team is seeking a Senior Software Engineer to lead the development and delivery of fixes for extended product maintenance work. In this role, you will work closely with Product Engineering to handle defects, bugs, and CVEs in any extended life streams of Red Hat Enterprise Linux.Key...
-
Red Hat Technical Support Associate Manager
1 month ago
Pune, India RED HAT Full timeAbout the JobWe are seeking an experienced Associate Manager for Technical Support to join our team at Red Hat in Pune, India. As a key member of our Customer Experience and Engagement (CEE) team, you will be responsible for managing a team of highly technical associates who provide exceptional service to our enterprise customers.Key Responsibilities:Manage...
-
Principal Cloud Reliability Engineer
4 weeks ago
Pune, Maharashtra, India Red Hat India Private Limited Full timeRed Hat is seeking an experienced Principal Cloud Reliability Engineer to develop, scale, and operate our OpenShift managed cloud services. As a key member of our SRE team, you will contribute to running OpenShift at scale by enabling customer self-service, making our monitoring system more sustainable, and eliminating work through automation.On the SRE...
-
Associate Manager for Technical Support
1 month ago
Pune, India RED HAT Full timeAbout the JobThe Red Hat Customer Experience and Engagement team is seeking a skilled Associate Manager for Technical Support to join us in India. In this role, you will lead a team of technical associates, providing an exceptional service experience for our enterprise customers. You will work with colleagues worldwide to drive initiatives and develop...
-
Red Hat
2 months ago
Pune, Maharashtra, India RED HAT Full timeAbout the Job : The Red Hat Customer Experience and Engagement (CEE) team is looking for an Associate Manager for Technical Support to join us in Pune, India. In this role, you will manage a team of highly technical associates who are responsible for providing an excellent service for our enterprise customers. You'll work with your peers around the world...
-
Red Hat
2 months ago
Pune, India RED HAT Full timeAbout the Job : The Red Hat Customer Experience and Engagement (CEE) team is looking for an Associate Manager for Technical Support to join us in Pune, India. In this role, you will manage a team of highly technical associates who are responsible for providing an excellent service for our enterprise customers. You'll work with your peers around the...
-
Senior Site Reliability Engineer
3 weeks ago
Pune, Maharashtra, India Red Hat India Private Limited Full timeJob Summary Red Hat is seeking a Senior Site Reliability Engineer to develop, scale, and operate our OpenShift managed cloud services. As an SRE, you will contribute to running OpenShift at scale by enabling customer self-service, making our monitoring system more sustainable, and eliminating work through automation. Responsibilities The day-to-day...
-
Senior Site Reliability Engineer
1 month ago
Pune, Maharashtra, India Red Hat India Private Limited Full timeAbout the RoleWe are seeking a Senior Site Reliability Engineer to join our team at Red Hat India Private Limited. As a key member of our cloud services team, you will play a critical role in developing, scaling, and operating our OpenShift managed cloud services.Key ResponsibilitiesContribute to the design, development, and operation of our OpenShift...
-
Senior Site Reliability Engineer for OpenShift
3 weeks ago
Pune, Maharashtra, India RED HAT Full timeRole OverviewWe are seeking a skilled Senior Site Reliability Engineer (SRE) to join our team and contribute to the development, scaling, and operation of our OpenShift managed cloud services.Key ResponsibilitiesContribute to the design, implementation, and maintenance of scalable and reliable cloud services.Collaborate with cross-functional teams to...
-
Site Reliability Engineer
5 months ago
Pune, India Red Hat India Private Limited Full timeAbout the job: The Red Hat IT OpenShift team is looking for a Site Reliability Engineer (SRE) based in India (Pune or Bangalore) to join our team. In this role, you will develop, scale, and operate our Red Hat OpenShift Managed Cloud platform. Red Hat OpenShift is our enterprise kubernetes distribution. As an SRE, you will contribute to running Red Hat...
-
Senior Software Engineer
4 months ago
Pune, India Red Hat India Private Limited Full timeRed Hat is seeking a Senior Site Reliability Engineer (SRE) to develop, scale, and operate our OpenShift managed cloud services. OpenShift is Red Hat’s enterprise Kubernetes distribution. As an SRE you will contribute to running OpenShift at scale by enabling customer self-service, making our monitoring system more sustainable, and eliminating work...
-
Senior Cloud Architect
1 month ago
Pune, India RED HAT Full timeJob Title: Principal Site Reliability EngineerJob OverviewRed Hat is seeking an experienced Principal Site Reliability Engineer to develop and operate our OpenShift managed cloud services. As an SRE, you will contribute to running OpenShift at scale by enabling customer self-service, making our monitoring system more sustainable, and eliminating work through...
-
Principal Software Engineer
1 month ago
Pune, Maharashtra, India Red Hat India Private Limited Full timeRed Hat is seeking a highly skilled Principal Software Engineer to develop, scale, and operate our OpenShift managed cloud services. As a key member of our team, you will contribute to running OpenShift at scale by enabling customer self-service, making our monitoring system more sustainable, and eliminating work through automation.As a Principal Software...