Site Reliability Engineer
1 day ago
At Amgen, if you feel like you're part of something bigger, it's because you are. Our shared mission—to serve patients living with serious illnesses—drives all that we do.
Since 1980, we've helped pioneer the world of biotech in our fight against the world's toughest diseases. With our focus on four therapeutic areas –Oncology, Inflammation, General Medicine, and Rare Disease– we reach millions of patients each year. As a member of the Amgen team, you'll help make a lasting impact on the lives of patients as we research, manufacture, and deliver innovative medicines to help people live longer, fuller happier lives.
Our award-winning culture is collaborative, innovative, and science based. If you have a passion for challenges and the opportunities that lay within them, you'll thrive as part of the Amgen team. Join us and transform the lives of patients while transforming your career.
Site Reliability Engineer
What you will doLet's do this. Let's change the world. In this vital role you will responsible for the reliability, stability, performance, scalability, and security of platforms that support Amgen's digital products and engineering teams. This hands-on role focuses on supporting cloud-based infrastructure, automating operations, maintaining observability, and improving platform reliability through code.
You'll work closely with senior engineers and cross-functional teams to support CI/CD workflows, container platforms, incident response, and enterprise tooling—all while adopting modern SRE principles and practices.
This role is ideal for engineers who have foundational site reliability experience and are looking to expand their skills in a cloud-native, enterprise-scale environment.
Roles & Responsibilities:
Infrastructure & Platform Support- Provision and manage cloud infrastructure using Infrastructure as Code (IaC)
- Support container orchestration platforms, ensuring availability, access control, and resource management
- Assist in configuring and maintaining CI/CD pipelines and environments
Monitoring & Incident Response
- Set up and maintain observability tools to track system health and performance
- Participate in alert tuning, incident resolution, and root cause analysis
- Support integration of observability platforms with incident response workflows
Automation & Platform Operations
- Automate routine platform tasks such as provisioning, patching, and configuration
- Write scripts to improve platform reliability, reduce manual work, and enforce compliance
- Participate in platform upgrades, maintenance windows, and service validation efforts
- Support the adoption of AI-assisted operational tools for log analysis, anomaly detection, and predictive alerts
- Collaborate with senior engineers to evaluate AI/ML-based observability and automation platforms
- Assist in integrating AI-driven insights into dashboards, alerts, or incident workflows
- Stay current with emerging AI trends in infrastructure and site reliability, and contribute to tool evaluations and pilots
- Work with development, QA, and security teams to ensure reliable and secure deployments
- Document operational procedures, playbooks, and system runbooks
- Learn and support enterprise collaboration platforms and internal tooling
- Participate in Agile and SAFe delivery processes—including sprint planning, stand-ups, retrospectives, and PI planning—to ensure security and platform reliability are embedded across development cycles.
We are all different, yet we all use our unique contributions to serve patients. The [vital attribute] professional we seek is a [type of person] with these qualifications.
Basic Qualifications:
- Master's degree / Bachelor's degree and 5 to 9 years in Computer Science, IT or related field
- 4 years of hands-on related experience in site reliability, DevOps, or platform engineering roles
- Hands-on experience with cloud platforms preferably AWS
- Familiarity with Kubernetes or container orchestration technologies
- Exposure to CI/CD practices and pipeline automation
- Experience troubleshooting Linux systems, processes, and services
Preferred Qualifications:
Must-Have Skills:- Practical experience with cloud platforms (e.g., AWS, Azure, or GCP), including compute, networking, IAM, and storage services
- Familiarity with container orchestration platforms (e.g., Kubernetes, Docker), including basic workload deployment and troubleshooting
- Experience using Infrastructure as Code (IaC) tools such as Terraform or CloudFormation
- Working knowledge of Linux administration, including system services, package management, and file system structures
- Hands-on exposure to CI/CD platforms (e.g., GitLab CI, Jenkins, GitHub Actions) and pipeline troubleshooting
- Proficiency in scripting or automation languages like Python, Bash, or Go
- Exposure to observability tooling (e.g., Dynatrace, Prometheus, or Grafana) for monitoring and alerting
- Familiarity with incident management practices and tools (e.g., runbooks, escalation workflows, basic alert tuning)
- Version control skills using Git and understanding of branching strategies
- Experience supporting or integrating enterprise collaboration platforms (e.g., Jira, Confluence, ServiceNow)
- Interest and basic understanding of AI/ML tools used in infrastructure and operations (e.g., anomaly detection, intelligent alerting, log analysis)
- Experience using Infrastructure as Code (IaC) tools like Terraform or CloudFormation
- Familiarity with IT incident response workflows and ticketing platforms
- Knowledge of secrets management, configuration management tools (e.g., Ansible), or logging frameworks
- Exposure to AI-assisted tooling (e.g., AIOps platforms, AI-enhanced alerting, anomaly detection)
Professional Certifications (Preferred)
- Cloud DevOps Certification (AWS/Azure/GCP)
- Certified Kubernetes Administrator (CKA) or Security Specialist (CKS)
- CI/CD Platform Certification
- ITIL Foundation or equivalent service management certification
- Strong analytical and troubleshooting skills
- Collaborative and proactive mindset
- Effective communication and documentation practices
- Curiosity and willingness to adopt new tools and methods, including AI integrations
- Ability to manage time and prioritize tasks in dynamic environments
Shift Information: This position is an onsite role and may require working during later hours to align with business hours. Candidates must be willing and able to work outside of standard hours as required to meet business needs.
What you can expect of usAs we work to develop treatments that take care of others, we also work to care for your professional and personal growth and well-being. From our competitive benefits to our collaborative culture, we'll support your journey every step of the way.
In addition to the base salary, Amgen offers competitive and comprehensive Total Rewards Plans that are aligned with local industry standards.
Apply now and make a lasting impact with the Amgen team.As an organization dedicated to improving the quality of life for people around the world, Amgen fosters an inclusive environment of diverse, ethical, committed and highly accomplished people who respect each other and live the Amgen values to continue advancing science to serve patients. Together, we compete in the fight against serious disease.
Amgen is an Equal Opportunity employer and will consider all qualified applicants for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, protected veteran status, disability status, or any other basis protected by applicable law.
We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request accommodation.
-
Site Reliability Engineer
6 days ago
Hyderabad, Telangana, India Apple Full time ₹ 15,00,000 - ₹ 25,00,000 per yearImagine what you could do here. Apple is a place where extraordinary people gather to do their best work. Together we craft products and experiences people once couldn't have imagined — and now can't imagine living without. If you're motivated by the idea of making a real impact, and joining a team where we pride ourselves in being one of the most diverse...
-
SRE(Site Reliability Engineer)
4 days ago
Hyderabad, Telangana, India Talent Worx Full time ₹ 20,00,000 - ₹ 25,00,000 per yearSRE (Site Reliability Engineer)Talent Worx is seeking a talented SRE (Site Reliability Engineer) to enhance our technology team. In this role, you will be pivotal in ensuring the reliability, performance, and availability of our applications and services. Your work will involve both software engineering and systems operations as you strive to improve...
-
Site Reliability Engineer
2 weeks ago
Hyderabad, Telangana, India TurboHire Full time ₹ 15,00,000 - ₹ 28,00,000 per yearSite Reliability Engineer (SRE)Location: Hyderabad (Hybrid)Experience: 3–5 yearsAbout the RoleWe are looking for an SRE Engineer to own reliability, deployment, and monitoringof TurboHire's cloud infrastructure. You will ensure our platform is scalable, secure,and highly available. The role balances hands-on coding, automation, and infraoperations, freeing...
-
Site Reliability Engineer
4 days ago
Hyderabad, Telangana, India LivePerson Full time ₹ 8,00,000 - ₹ 15,00,000 per yearLivePerson (NASDAQ: LPSN) is a leading customer engagement company, creating digital experiences powered by Curiously Human AI. Every person is unique, and our technology makes it possible for companies, including leading brands like HSBC, Orange, and GM Financial, to treat their audiences that way at scale. Nearly a billion conversational interactions are...
-
Site Reliability Engineer III
3 days ago
Hyderabad, Telangana, India Chase- Candidate Experience page Full time ₹ 1,04,000 - ₹ 1,30,878 per yearThere's nothing more exciting than being at the center of a rapidly growing field in technology and applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems.As a Site Reliability Engineer III at JPMorgan Chase within the Chief Technology Office team, you will solve complex and broad business problems...
-
Site Reliability Engineering
22 hours ago
Hyderabad, Telangana, India Acesoft Labs Full time ₹ 20,00,000 - ₹ 25,00,000 per yearHi ,Kindly find the below JD :Job Title: Site Reliability Engineering (SRE) ManagerLocation: HyderabadEmployment Type: Full-TimeWork Model - 3 Days from office (Hybrid)Summary:The SRE Manager at TechBlocks India will lead the reliability engineering function, ensuring infrastructure resiliency and optimal operational performance. This hybrid role blends...
-
Lead Site Reliability Engineer
1 week ago
Hyderabad, Telangana, India EPAM Systems Full time ₹ 15,00,000 - ₹ 25,00,000 per yearWe are seeking a skilledLead Site Reliability Engineerto drive the stability, scalability, and reliability of our systems while improving efficiency through automation and best practices.This role calls for deep expertise in DevOps methodologies, Infrastructure as Code (IaC), and collaboration across teams to ensure optimal system...
-
Site Reliability Support Engineer
18 hours ago
Hyderabad, Telangana, India Innovatz Global Full time ₹ 9,00,000 - ₹ 12,00,000 per yearCompany DescriptionInnovatz Global is a leading Management Consulting, Technology Services, and Business Process Outsourcing company headquartered in Kuala Lumpur, Malaysia. With over 500 skilled professionals, we have a significant presence across America, China, India, Australia, and several other countries. We have a proven track record of delivering...
-
Principal Site Reliability Engineer
2 weeks ago
Hyderabad, Telangana, India Amgen Inc Full time ₹ 8,00,000 - ₹ 12,00,000 per yearWe are looking for a Site Reliability Engineer/Cloud Engineer (SRE) to work on the performance optimization, standardization, and automation of Amgens critical infrastructure and systems. This role is crucial to ensuring the reliability, scalability, and cost-effectiveness of our production systems. The ideal candidate will work on operational excellence...
-
Lead Site Reliability Engineer
4 days ago
Hyderabad, Telangana, India JPMorgan Chase Full time ₹ 2,00,00,000 - ₹ 2,50,00,000 per yearAssume a critical role in defining the future of a globally recognized firm and have a direct and significant effect in a realm tailored for top achievers in site reliability. As a Lead Site Reliability Engineer at JPMorgan Chase within the Consumer & Community Banking, you hold a leadership role in your team, demonstrate strong knowledge across...