▷ 15h Left Lead Site Reliability Engineer

4 weeks ago


Bengaluru India Optum Full time

Job Description

Optum is a global organization that delivers care, aided by technology to help millions of people live healthier lives. The work you do with our team will directly improve health outcomes by connecting people with the care, pharmacy benefits, data and resources they need to feel their best. Here, you will find a culture guided by inclusion, talented peers, comprehensive benefits and career development opportunities. Come make an impact on the communities we serve as you help us advance health optimization on a global scale. Join us to start Caring. Connecting. Growing together.

What makes this a one of kind opportunity We have more than 12,000 technology colleagues serving the IT needs of our clients across the globe and our own Fortune 6 IT needs. At Optum, you'll be encouraged to combine your passion and technical expertise to help us shape the health care system for years to come. You'll help change the way our businesses and consumers engage with technology across a wide platform of health services and delivery systems by setting team goals, forecasting resource needs, and guiding solutions developed to solve business and operational challenges. If you're out to make a difference, apply today.

Medicare & Retirement (M&R) | Community and State | Individual and Family Plan - Technology Operations needs an experienced Senior Site Reliability Engineer (SRE) acting as a bridge between software engineering and IT operations. The primary goal of this role is to maintain software applications/Infrastructure that are reliable, scalable, resilient and to improve performance and operational efficiency along with ensuring all business-critical products having implemented right tools and executed exercise to validate system availability, latency, performance, efficiency, monitoring, incident priority, and capacity planning. This role will enable Government Programs (M&R, C&S and IFP) Technology Operations to meet our business segment's needs as an IT partner and advocate.

Primary Responsibilities:

- Defining and setting up best industry alert and monitoring practices across line of business and design/architect efficient monitoring dashboards on Splunk/Dynatrace /Grafana common for all applications/products across line of business
- Participating in 5-9 program and other peak season readiness initiatives and collaboration with application teams evaluating applications from resiliency, availability, and reliability perspective
- Act as a gatekeeper for changes rolling into production
- Embrace continuous learning of engineering practices to ensure industry best practices and technology adoption, including DevOps, Cloud and Agile thinking
- Tech debt reduction/Tech transformation including opensource/inner source adoption, Cloud adoption, HCP assessment and adoption
- Improve processes/runbooks and lead automation efforts of any manual items around support cutting down manual toil
- Participate in on-call rotation
- Improve operational tooling, frameworks, perform chaos engineering activities
- Respond to platform emergencies, alerts, and escalations from Customer Support
- Comply with the terms and conditions of the employment contract, company policies and procedures, and any and all directives (such as, but not limited to, transfer and/or re-assignment to different work locations, change in teams and/or work shifts, policies in regards to flexibility of work benefits and/or work environment, alternative work arrangements, and other decisions that may arise due to the changing business environment). The Company may adopt, vary or rescind these policies and directives in its absolute discretion and without any limitation (implied or otherwise) on its ability to do so

Required Qualifications:

- Undergraduate degree or equivalent experience
- 10+ years of experience in IT industry across entire SDLC
- 5+ years of experience in integrating monitoring and alerting into cloud software solutions
- 3+ years of coding experience with one or more of the follow languages Java, C#, C/C++, Go, Python, Perl, PowerShell or JavaScript with a willingness and ability to learn new ones
- 3+ years of experience in Splunk / Dynatrace / DataDog/Grafana/ Telemetry or similar for monitoring tools
- 2+ years of experience building and programmatically consuming REST APIs
- ServiceNow experience
- Work experience as a Site Reliability Engineer or similar role
- Experience with any database
- Experience in operations support for any application
- Experience with programmatic interaction with a relational database SQL Server/MySQL/PostgreSQL
- Experience planning and supporting 99.999% availability against critical applications in production
- Knowledge of any scripting or programming language
- Solid understanding of engineering fundamentals: unit testing, performance testing, code reviews, telemetry, agile and DevOps
- Solid understanding of: continuous integration / continuous delivery tools, serverless architecture, containerization, public / private cloud, application observability and/or messaging / stream architecture
- Technical writing skills (creating flow diagrams, end user documentation, etc)
- Proven ability to communicate effectively to both technical and non-technical, globally distributed audiences

At UnitedHealth Group, our mission is to help people live healthier lives and make the health system work better for everyone. We believe everyone-of every race, gender, sexuality, age, location and income-deserves the opportunity to live their healthiest life. Today, however, there are still far too many barriers to good health which are disproportionately experienced by people of color, historically marginalized groups and those with lower incomes. We are committed to mitigating our impact on the environment and enabling and delivering equitable care that addresses health disparities and improves health outcomes - an enterprise priority reflected in our mission.



  • India Sapaad Full time

    WHO WE ARE Sapaad is a global leader in unified commerce platforms, delivering world-class software solutions for the food and beverage industry. Our flagship product, also named Sapaad, has achieved remarkable success over the past decade, empowering thousands of F&B businesses across 40+ countries—with many more coming onboard each day. Driven by a...


  • Bengaluru, India Landmark Group Full time

    Job Description Job Title: SRE Lead (Engineering & Reliability) Job Summary: We are seeking an experienced and dynamic Site Reliability Engineering (SRE) Lead to oversee the reliability, scalability, and performance of our critical systems. As an SRE Lead, you will play a pivotal role in establishing and implementing SRE practices, leading a team of...


  • Noida, India Thales Full time

    Job Description Location: Noida, India Thales people architect identity management and data protection solutions at the heart of digital security. Business and governments rely on us to bring trust to the billons of digital interactions they have with people. Our technologies and services help banks exchange funds, people cross borders, energy become...


  • Bengaluru, India Groww Full time

    Job Description About Groww We are a passionate group of people focused on making financial services accessible to every Indian through a multi-product platform. Each day, we help millions of customers take charge of their financial journey. Customer obsession is in our DNA. Every product, every design, every algorithm down to the tiniest detail is...


  • Hyderabad, India Chase Bank Full time

    Job Description Assume a critical role in defining the future of a globally recognized firm and have a direct and significant effect in a realm tailored for top achievers in site reliability. As a Lead Site Reliability Engineer at JPMorgan Chase within the Consumer & Community Banking, youhold a leadership role in your team, demonstrate strong knowledge...


  • Bengaluru, India Freshworks Full time

    Job DescriptionOverview At Freshworks, uptime is sacred. As a Lead Site Reliability Engineer (SRE), you'll be the engineer behind the curtain—designing for resilience, automating recovery, and ensuring our systems stay fast, stable, and observable at scale. You’ll partner closely with engineering, platform, and product teams to shift reliability left and...


  • Bengaluru, India Freshworks Full time

    Job DescriptionOverview At Freshworks, uptime is sacred. As a Lead Site Reliability Engineer (SRE), you'll be the engineer behind the curtain—designing for resilience, automating recovery, and ensuring our systems stay fast, stable, and observable at scale. You’ll partner closely with engineering, platform, and product teams to shift reliability left...

  • 15h Left! Dynamisch

    7 days ago


    Pune, India Dynamisch Full time

    Job Description Job Title : DevOps & Site Reliability Engineer Experience : 4+ Yrs Qualification : B.E./ B.Tech/ M.E./M.SC IT / MCA Duties And Responsibilities - Engage, Improve, develop, measure, and implement processes and tools for Continues Integrations and Delivery, Site Reliability Engineering, and automation of deployment and support of products into...


  • Bengaluru, India Booking Holdings Full time

    Job Description Key Job Responsibilities and Duties: The core premise for the Booking SRE lies in treating operational and reliability problems of software systems as a software engineering problem. We code our way out of problems where operations are concerned addressing availability, scalability, latency, and efficiency challenges within the vast...

  • Minfy Technologies

    2 weeks ago


    Bengaluru, India Minfy Technologies Private Limited Full time

    Job Summary We are seeking a strategic and technically proficient Head of Site Reliability Engineering (SRE) to lead the design, implementation, and scaling of our reliability, observability, and operational practices. As the Head of SRE, you will play a critical role in ensuring our systems are highly available, scalable, and performant while maintaining a...