(31/10/2025) Sr. Site Reliability Engineer

3 weeks ago

Hyderabad Telangana, India Amgen Full time

Career Category Information Systems Join Amgen s Mission of Serving Patients At Amgen if you feel like you re part of something bigger it s because you are Our shared mission to serve patients living with serious illnesses drives all that we do Since 1980 we ve helped pioneer the world of biotech in our fight against the world s toughest diseases With our focus on four therapeutic areas -Oncology Inflammation General Medicine and Rare Disease- we reach millions of patients each year As a member of the Amgen team you ll help make a lasting impact on the lives of patients as we research manufacture and deliver innovative medicines to help people live longer fuller happier lives Our award-winning culture is collaborative innovative and science based If you have a passion for challenges and the opportunities that lay within them you ll thrive as part of the Amgen team Join us and transform the lives of patients while transforming your career Sr Site Reliability Engineer What you will do Let s do this Let s change the world In this vital role you will play a key role in building scaling and securing the platforms that underpin Amgen s global digital initiatives This role focuses on ensuring the reliability performance and efficiency of cloud-native platforms while enabling development velocity and operational excellence You will be responsible for designing and operating infrastructure and shared platforms used across the enterprise including CI CD observability incident management and collaboration systems You will work extensively with containerized environments handle multi-tenant Kubernetes platforms and automate processes to improve resilience and reduce operational burden This role requires deep technical depth leadership skills and the ability to drive initiatives across cross-functional teams and global stakeholders Roles Responsibilities Platform Reliability Engineering Design operate and scale secure highly available cloud-based infrastructure using Infrastructure as Code IaC Handle multi-tenant container orchestration environments with advanced access controls workload isolation and governance policies Ensure enterprise CI CD platforms are performant secure and optimized for high-throughput engineering teams Monitoring Observability Incident Management Build and handle observability platforms for full-stack visibility leveraging metrics logs and traces Define implement and continuously refine SLIs SLOs and error budgets for platform health and service performance Automate incident response workflows integrate with incident management platforms and lead post-incident reviews and root cause analysis Enterprise Platform Administration Operate and improve core engineering platforms e g CI CD collaboration knowledge sharing to ensure availability security and ease of use Automate platform provisioning upgrades access controls and integration pipelines to reduce manual effort and improve consistency Implement compliance audit logging and policy enforcement through code-driven governance models AI Adoption Enablement Drive the adoption of AI ML-based tools to enhance observability incident prediction remediation and intelligent alerting Evaluate and integrate AI-assisted automation platforms to reduce toil and improve operational efficiency Partner with platform security and development teams to embed predictive analytics into dashboards workflows and root cause tooling Champion a data-driven SRE practice by enabling thoughtful insights and anomaly detection across systems and platforms Leadership Collaboration Serve as a technical thought leader and mentor within the SRE organization Promote SRE principles and reliability culture across engineering teams Collaborate with cross-functional stakeholders to influence architecture roadmaps and platform investment Lead operational reviews and service health retrospectives with a focus on continuous improvement Participate in Agile and SAFe delivery processes including sprint planning stand-ups retrospectives and PI planning to ensure security and platform reliability are embedded across development cycles What we expect of you We are all different yet we all use our unique contributions to serve patients The vital attribute professional we seek is a type of person with these qualifications Basic Qualifications Doctorate degree Master s degree Bachelor s degree and 8 to 13 years in Computer Science Information Technology or a related technical field Demonstrated success operating cloud-native infrastructure in production environments Practical experience handling Kubernetes clusters and CI CD environments at enterprise scale Exposure to global on-call or incident support rotations Excellent collaboration and communication skills across technical and non-technical teams Preferred Qualifications Must-Have Skills Deep experience with cloud platforms AWS Azure or GCP including services such as compute networking IAM and VPC design Proven proficiency in Infrastructure as Code IaC using tools such as Terraform or CloudFormation Advanced skills in managing container orchestration platforms e g Kubernetes including workload isolation resource quotas and role-based access control Strong understanding of Linux system administration process management and system performance tuning Hands-on experience with CI CD platforms and pipelines build automation artifact storage environment provisioning rollback strategies Strong background in observability tooling including Prometheus Grafana Dynatrace and distributed tracing frameworks like OpenTelemetry or Jaeger Strong practical experience with incident management platforms and practices e g alert routing runbooks escalation paths Automation and scripting proficiency in languages such as Python Go or Bash Experience with configuration management tools like Ansible Chef or SaltStack Strong grasp of networking fundamentals such as routing DNS OSI layers load balancing firewalls TLS and security groups Version control and collaboration workflows using Git and GitOps principles Experience with enterprise collaboration platforms including provisioning integration and permission control Good-to-Have Skills Exposure to service mesh technologies e g Istio Linkerd and zero-trust network concepts Familiarity with secrets management platforms e g HashiCorp Vault AWS Secrets Manager Experience using incident response and chaos engineering tools e g Gremlin Chaos Mesh Background in cost optimization budgeting and resource tracking FinOps Awareness of policy-as-code frameworks e g OPA Kyverno Familiarity with feature flagging and progressive delivery tools e g LaunchDarkly Argo Rollouts Integration experience with ticketing and change management platforms e g ServiceNow Jira Understanding of compliance standards e g HIPAA GDPR SOC 2 and how they apply to infrastructure operations Understanding of security and encryption technologies and authentication protocols such as OpenID OIDC OAuth SAML and LDAP Professional Certifications Preferred Cloud DevOps Certification AWS Azure GCP Certified Kubernetes Administrator CKA or Security Specialist CKS CI CD Platform Certification ITIL Foundation or equivalent service management certification Soft Skills High level of ownership and accountability for platform reliability Strong diagnostic and analytical capabilities with a bias for action Clear and confident communicator with an ability to influence without authority Passion for automation operational excellence and team mentorship Shift Information This position is an onsite role and may require working during later hours to align with business hours Candidates must be willing and able to work outside of standard hours as required to meet business needs What you can expect of us As we work to develop treatments that take care of others we also work to care for your professional and personal growth and well-being From our competitive benefits to our collaborative culture we ll support your journey every step of the way In addition to the base salary Amgen offers competitive and comprehensive Total Rewards Plans that are aligned with local industry standards Apply now and make a lasting impact with the Amgen team careers amgen com As an organization dedicated to improving the quality of life for people around the world Amgen fosters an inclusive environment of diverse ethical committed and highly accomplished people who respect each other and live the Amgen values to continue advancing science to serve patients Together we compete in the fight against serious disease Amgen is an Equal Opportunity employer and will consider all qualified applicants for employment without regard to race color religion sex sexual orientation gender identity national origin protected veteran status disability status or any other basis protected by applicable law We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process to perform essential job functions and to receive other benefits and privileges of employment Please contact us to request accommodation

Java Software Engineer

4 weeks ago

Hyderabad, India Tata Consultancy Services Full time

TCS Hiring for Java Backend Development Role!!TCS presents an excellent opportunity for Java Backend DeveloperRole: Java Backend DeveloperDesired Experience Range: 5-10 yearsLocation: Chennai & HyderabadMode of Interview : VirtualDate : 29-10-2025 & 31-10-2025Desired Competencies : (Technical/Behavioural Competency)Must Have- ***Java / Springboot /...
Site Reliability Engineer

5 days ago

Hyderabad, India inTune Systems Inc Full time

Tittle : Sr. SRE/App Support Engineer Location Hyderabad Job Summary: We are looking for a Senior Site Reliability Engineer (SRE) to join our growing Engineering team. As an SRE, you will play a key role in ensuring the reliability, scalability, and performance of our production systems across a multi-cloud environment (GCP & AWS). You’ll be responsible...
Site Reliability Engineer

3 days ago

Hyderabad, India inTune Systems Inc Full time

Tittle : Sr. SRE/App Support Engineer Location Hyderabad Job Summary: We are looking for a Senior Site Reliability Engineer (SRE) to join our growing Engineering team. As an SRE, you will play a key role in ensuring the reliability, scalability, and performance of our production systems across a multi-cloud environment (GCP & AWS). You’ll be responsible...
Site Reliability Engineer

5 days ago

Hyderabad, India inTune Systems Inc Full time

Tittle : Sr. SRE/App Support EngineerLocation Hyderabad Job Summary: We are looking for a Senior Site Reliability Engineer (SRE) to join our growing Engineering team. As an SRE, you will play a key role in ensuring the reliability, scalability, and performance of our production systems across a multi-cloud environment (GCP & AWS). You’ll be responsible...
Site Reliability Engineer

5 days ago

hyderabad, India inTune Systems Inc Full time

Tittle : Sr. SRE/App Support EngineerLocation Hyderabad Job Summary: We are looking for a Senior Site Reliability Engineer (SRE) to join our growing Engineering team. As an SRE, you will play a key role in ensuring the reliability, scalability, and performance of our production systems across a multi-cloud environment (GCP & AWS). You’ll be responsible...
Site Reliability Engineer

3 days ago

Hyderabad, India inTune Systems Inc Full time

Tittle : Sr. SRE/App Support EngineerLocation HyderabadJob Summary: We are looking for a Senior Site Reliability Engineer (SRE) to join our growing Engineering team. As an SRE, you will play a key role in ensuring the reliability, scalability, and performance of our production systems across a multi-cloud environment (GCP & AWS). You’ll be responsible for...
▷ (29/10/2025) Site Reliability Engineer

4 weeks ago

Hyderabad, India Sonata Software Full time

Role:Site Reliability EngineerLocation:HyderabadNotice Period: Immediate to 20 DaysEmployment Type:Full TimeExperience- 7–12 years in site reliability, cloud-based data infrastructure, data pipeline observability, automation, and high-availability engineering within EdTech platforms (2U) - Primary Skills (Must-Have) - AWS, CI/CD, Jenkins, IAAC, Terraform,...
Senior Site Reliability Engineer

3 weeks ago

Hyderabad, India Insight Global, LLC Full time

Job Title : Sr. SREAbout the Company : Insight Globals ClientType : Ongoing EOR, depending on experience levelLocation : ONSITE 4X/WEEK in HITEC City, Hyderabad, INPriority scheduling for candidates who : - Submit resume promptly- Are available for immediate interviews- Connect via LinkedIn with resume and CTC rateRequirements : - Ability to be onsite...
Sr Engineer, Site Reliability

5 days ago

Hyderabad, Telangana, India TMUS Global Solutions Full time

About TMUS Global Solutions T-Mobile is Americas supercharged Un-carrier challenging conventions and setting new standards in wireless With the nations largest and fastest 5G network T-Mobile delivers advanced connectivity and unmatched value to millions across the U S Were unwaveringly obsessed with providing the best possible service experience driven by a...
▷ [31/10/2025] Director Security

3 weeks ago

Hyderabad, Telangana, India Amgen Full time

Career Category Safety HOW MIGHT YOU DEFY IMAGINATION If you feel like you re part of something bigger it s because you are At Amgen our shared mission to serve patients drives all that we do It is key to our becoming one of the world s leading biotechnology companies We are global collaborators who achieve together researching manufacturing and delivering...

Americas

Europe

Asia / Oceania

Africa

(31/10/2025) Sr. Site Reliability Engineer