▷ (Urgent) Senior Site Reliability Engineer

4 weeks ago


Bengaluru India GE HealthCare Full time

Job Description

Job Description Summary

The Senior Site Reliability Engineer will be responsible for performance and availability of Compute and Network infrastructure consumed by all business segments. The Site Reliability teams are composed of highly talented individuals obsessively focused with availability through operational excellence. The ideal individual is relentlessly technical, passionate for automating everything and totally committed to delivering amazing customer experiences.

GE HealthCare is a leading global medical technology and digital solutions innovator. Our purpose is to create a world where healthcare has no limits. Unlock your ambition, turn ideas into world-changing realities, and join an organization where every voice makes a difference, and every difference builds a healthier world.

Job Description

Roles and Responsibilities

In This Role, You Will

- Establish performance baseline, capacity thresholds, correlate events, and define monitoring/alerting criteria
- Develop automated solutions to address potential problems before they result in a service interruption
- Provide impact assessment and mitigation plan for changes going into the production environment
- Investigate root cause of severe and systemic outages, identify corrective actions and apply across the enterprise
- Develop availability measures that align with consumer experience to accurately assess the usability of crucial services
- Build capacity models to baseline transactional load compared to resource performance and leverage data to predict overall system capacity while automating load placement to avoid outages
- Identify thresholds for all critical links in the data path to quickly isolate where imbalances may result in potential outages
- Analyze failure points in services to model risk level and resolution steps if failure occurs.
- Assist in driving architecture enhancements into system to mitigate potential failure points.
- Programmatically monitor for and remediate configuration drift of critical devices
- Develop response plans to potential failure points and evaluate effectiveness during planned tests
- Perform comprehensive operational health checks of the entire services to identify areas of concern and track activities to drive improvements at all levels of the architecture
- Able to deliver well written Infrastructure as a code (TF or CF) and do code reviews for the peers for consistency across the board, able to write test case for the code and review the automation to enhance the capabilities as well as think holistically to design testing solutions.
- Have an experience working with tools like ALM tools would be a plus.
- Provide technical coaching and direction to more junior teammates

Required Qualifications

Bachelor's Degree in Computer Science or STEM Majors (Science, Technology, Engineering and Math) with at least years of experience 5-7 years

Desired Qualifications

- Excellent knowledge of common operating systems (Unix/Linux, Windows)
- Excellent knowledge of TCP/IP networking, and inter-networking technologies (routing/switching, proxy, firewall, load balancing etc.)
- Demonstrated experience scripting or developing software and services for the cloud Ruby, Python, Go, Java, Node.js, .NET, etc.
- Extensive Experience with Infrastructure Automation
- Experience using an automated configuration management system (Terraform, Chef, Puppet, Ansible, Salt, etc.)
- Experience deploying and managing infrastructure on public clouds such as AWS or Azure
- Experience with configuring, customizing, and extending monitoring tools (Datadog, Sensu, Grafana, Splunk, etc.)

We expect all employees to live and breathe our behaviours: to act with humility and build trust; lead with transparency; deliver with focus, and drive ownership always with unyielding integrity.

Our total rewards are designed to unlock your ambition by giving you the boost and flexibility you need to turn your ideas into world-changing realities. Our salary and benefits are everything you'd expect from an organization with global strength and scale, and you'll be surrounded by career opportunities in a culture that fosters care, collaboration and support.

Inclusion and Diversity

GE Healthcare is an Equal Opportunity Employer where inclusion matters. Employment decisions are made without regard to race, color, religion, national or ethnic origin, sex, sexual orientation, gender identity or expression, age, disability, protected veteran status or other characteristics protected by law.

We expect all employees to live and breathe our behaviors: to act with humility and build trust; lead with transparency; deliver with focus, and drive ownership always with unyielding integrity.

Our total rewards are designed to unlock your ambition by giving you the boost and flexibility you need to turn your ideas into world-changing realities. Our salary and benefits are everything you'd expect from an organization with global strength and scale, and you'll be surrounded by career opportunities in a culture that fosters care, collaboration and support.

Additional Information

Relocation Assistance Provided: Yes



  • , India, IN Sonata Software Full time

    We're Hiring: Senior Site Reliability Engineer Location: Onsite (Office: Hyderabad – Mandatory from Day 1) Employment Type: Full-time Notice Period: Immediate to 15 Days Only Experience: 8+ Years About the RoleWe’re looking for a Senior Site Reliability Engineer (SRE) to lead reliability initiatives across our production systems. This is a high-impact...


  • Bengaluru, India Chevron Full time

    Job Description About The Position The Global Capability Center (GCC) - IT Foundation Platform (ITFP) Network Product Line (NPL) responsible for supporting the Business Network ensuring cost competitive, reliable, and secure operations of Chevron's Network environment globally while also enabling digital capabilities. Products managed include all Business...


  • Bengaluru, Karnataka, India, Karnataka WhiteLotus Talent Partners Full time

    We are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes. In this role, you will focus on monitoring, basic troubleshooting, and incident response, helping to maintain high system...


  • Bengaluru, India WhiteLotus Talent Partners Full time

    We are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes. In this role, you will focus on monitoring, basic troubleshooting, and incident response, helping to maintain high system...

  • Senior/expert site

    1 week ago


    India IVedha Inc. Full time

    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice Location: India (Remote) -Must be available to work in the EST (US/Canada) Time Zone. Role Summary:Are you a Senior Site Reliability Engineer (SRE) with deep ELK expertise, ready to take ownership of large-scale observability infrastructure?We're looking for an SRE with 7+...


  • Bengaluru, Karnataka, India Josys Full time

    Senior Site Reliability Engineer (SRE)About JOSYSJosys, a dynamic B2B SaaS platform startup, has embarked on a mission to revolutionize IT operations globally, following an exceptional launch in Japan and securing $125 million in Series A and B funding. Our platform enables businesses to conquer the complexities of work-from-anywhere setups, rapid digital...


  • India Akamai Technologies Full time

    Job Description Job Description Do you have the passion to architect and lead the next generation of public cloud infrastructure Would you like to lead modernization initiatives while building a public cloud platform from scratch Join our IaaS Site Reliability Engineering (SRE) team. We design, develop, and operate infrastructure and services that power...


  • India Akamai Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Would you enjoy improving stability and safety of one of the largest global networks?Would you enjoy hands-on network operations work on a global scale to improve our operational efficiency?Join the Platform Cloud Services Engineering TeamThe Platform Cloud Services SRE team supports globally distributed hosting and database systems for Akamai. These systems...


  • Pune, India Barclays Full time

    Job Description Step into the role of Senior Site Reliability Engineer. At Barclays, we are more than a bank we are a force for progress. You will be the part of the central SRE (Site Reliability Engineer) core team within our wider Infrastructure team. You will act as a centre of excellence providing hands on consultancy to our different infrastructure...


  • India Sapaad Full time

    WHO WE ARE Sapaad is a global leader in unified commerce platforms, delivering world-class software solutions for the food and beverage industry. Our flagship product, also named Sapaad, has achieved remarkable success over the past decade, empowering thousands of F& B businesses across 40+ countries —with many more coming onboard each day. Driven by a...