Lead Engineer – Site Reliability Engineer

4 weeks ago


Bengaluru, Karnataka, India Elanco Full time

Job Description:

Role Title: Lead Engineer – Site Reliability Engineer

Team: Software Engineering and Platforms

Supervisor: Software Engineering Director

Career Progression: Engineering, Architecture

Position Description:

Historically, the role of IT has been to provide a reliable ecosystem to run the business, drive efficiencies and reduce costs. These areas remain integral, however, driven by the quickening pace

of innovation, IT must evolve, proactively partnering with the business to enable new digital

business models that power new types of customer engagement.

At Elanco, our engineer roles bring adaptive set of skills covering Software-as-a-Service (SaaS),

Commercial-of-the-Shelf (CotS) and/or Custom Developed applications. The role is part of our

software engineering team established to deliver Engineering expertise to business facing products

and services. As an Engineer you will be deployed into a multi-disciplined product team applying

your software engineering talent to Elanco's biggest opportunities.

To be successful in an engineering role in Elanco requires a highly motivated individual, with an

innovative mindset and a willingness to drive tangible outcomes. The individual must be able to

articulate complex technical topics and collaborate with the internal engineering organisation to

improve engineering across the enterprise.

The Role

We are seeking a skilled and motivated engineer, passionate about improving application reliability

across our enterprise. As part of our Platform Engineering organization, you will join a product team focused on a suite of capabilities designed to enhance all aspects of our engineering portfolio. In this role, you will be primarily accountable for configuring and operating our observability toolset. You will also lead the charge across the enterprise, driving the transition from reactive to proactive application support.

This is a fantastic opportunity to join a growing engineering team with the scope to partner across

our entire enterprise of products. Your contributions will help ensure that everything we deliver to

our customers come with top-notch reliability as standard.

Typical responsibilities:


• Help define Elanco's approach to reliability of applications partnering with our product manager for our portfolio health products.


• Collaborate with stakeholders such as product and platform owners, to define service level objectives (SLOs), and service-level indicators (SLIs) for system operations focused on the critical features of the customers journey and experience.


• Assist and coach product teams implementation of telemetry against SLIs/SLOs to ensure

adequate traceability is in place.


• Track and manage reliability performance against agreed SLOs, in partnership with product

teams or other stakeholders, and ensure systems continue to meet SLOs over time.


• Ensure key stakeholders, product owners, and platform owners are informed of reliability

concerns and their potential impact to the customers experience.


• Provide expert knowledge on reliability approaches, to ensure our organization achieves its

goals and roadmap for reliability.


• Champion reliability being treated as a feature in products and platforms and promote the concept across all phases of the software development life cycle.


• Create dashboards and reports to communicate key metrics, to product teams and key stakeholders.


• Beyond observability engage in initiatives across the product line including cost, security, and adoption helping the team drive to a health portfolio throughout an applications lifecycle.


• Participate problem management activities, including post-mortem incident analysis, and provision of technical insight, documented findings, outcomes and recommendations as part of a root cause analysis to troubleshoot priority incidents.


• Implement automation to reduce probability and/or impact of problems recurring and

target 'self-healing' through automation of reoccurring incidents.


• For critical applications, utilize practices such as chaos engineering and performance engineering to test in preproduction environments. This includes disaster recovery (DR)

testing, performance testing, and tabletop planning exercises.


• Participate and exert influence in organizational learning initiatives such as communities of practice to share knowledge and foster a continuous learning and improvement mindset.


• Support architects working on new solutions, including analyzing requirements, supporting

technical architecture activities, prototyping, designing and developing reusable

infrastructure artifacts, testing, implementing, and preparing for ongoing support.


• Train and mentor junior and engineers to ensure SRE best practices evolve and scale successfully in the organization


• Partner with the product manager of portfolio health to build out golden paths, education

and services to 'package' the capability in a consumable way on our developer portal.


• Be a product team champion extending into product teams helping to deliver foundational

platform engineering capabilities where applicable.


• Partner with compliance teams to ensure the data we bring into observability platforms meets privacy and compliance standards


• Maintain consistent standards and set out a taxonomy of telemetry to enable future opportunities including leveraging of AI capability.

Basic Qualifications:


• Experience in some of the following areas essential.


• 10-15 years of hands-on engineering experience.


• 5 years' experience in Platform Engineering, SRE or similar role


• 5-10 years of experience working with modern application architecture methodologies (Service Orientated Architecture, API-Centric Design, Twelve-Factor App, FAIR, etc.).


• 5 + years of experience working with Cloud Native design patterns, with a

preference towards Microsoft Azure / Google Cloud.


• 5 + years of experience designing and delivering digital solutions following a product-mindset and a variety of delivery methodologies (e.g. Agile, CCPM, etc.).


• 5 + years of experience working within a "DevSecOps" culture, including modern software development practices, covering Continuous Integration and Continuous Delivery (CI/CD), Test-Driven Development (TDD), etc.


• Experience with enterprise observability platforms. E.g Datadog, New Relic


• Experience with monitoring 3rd party and SaaS applications.


• Experience establishing standards around MELT (Metrics, Events, Logging and Tracing and implementing at an enterprise level.


• Experience with Open Telemetry advantageous.


• Experience supporting digital platforms, including Integrations, Release Management, Regression Testing, Integrations, Data Obfuscation, etc.


• Experience scaling an "API-Ecosystem", designing, and implementing "API-First" integration patterns.


• Experience working with authentication and authorisation protocols/patterns.


• Experience defining and implementing large-scale, transformative digital solutions.


• Demonstrated influence and communication skills across all levels of IT and third parties.


• Experience working in complex, diverse landscapes (business, technology, regulatory, partners, providers, geographies, etc.).


• Strong organizational and communications skills with multiple examples of being able to convey complex technical topics, that resulted in a definitive direction.

Education Requirements: Bachelor's degree in information technology.

Other Information: Occasional travel may be required.

Elanco is an EEO/Affirmative Action Employer and does not discriminate on the basis of age, race, color, religion, gender, sexual orientation, gender identity, gender expression, national origin, protected veteran status, disability or any other legally protected status



  • Bengaluru, Karnataka, India Cisco Full time

    About the RoleCisco's Site Reliability Engineering team is seeking a skilled engineer to lead efforts in optimizing cloud expenditures, streamlining infrastructure management, and ensuring efficient resource utilization. As a Senior Site Reliability Engineer, you will be responsible for designing and implementing scalable solutions to drive efficiency and...


  • Bengaluru, Karnataka, India Wipro Full time

    We are looking for a skilled Senior Site Reliability Engineer to join our team at Wipro.The successful candidate will have a minimum of 10 years of experience in site reliability engineering principles, including SLO, SLI, SLA, error budgets, and eliminating toil via automation.You will work closely with our development teams to design and implement...


  • Bengaluru, Karnataka, India Applexus Technologies Full time

    Company OverviewApplexus Technologies is a leading technology company that specializes in delivering innovative solutions to clients across various industries.Why Join Us?As a Site Reliability Engineer at Applexus Technologies, you will have the opportunity to work with a talented team of engineers and contribute to the development of scalable and reliable...


  • Bengaluru, Karnataka, India CORTEX Consultants Full time

    Site Reliability Engineer Experience Required : 5 Years Work Timing : 2 : 00 pm to 10 : 00 pm (Bangalore based candidates only) About the Role : Highly skilled Site Reliability Engineer with 5 years of experience. We are seeking a Senior Site Reliability Engineer to collaborate with software engineering teams, focusing on building and securing cloud...


  • Bengaluru, Karnataka, India NatWest Group Full time

    Company Overview:The NatWest Group is a leading financial services provider dedicated to delivering innovative solutions. Our team is passionate about creating a seamless customer experience.As a Site Reliability Engineer, you will play a key role in supporting the improvement of non-functional and operational characteristics. You will work closely with...


  • Bengaluru, Karnataka, India Pocket FM Full time

    Job Title: Lead Site Reliability Engineer (SRE)Location: BangaloreExperience: 6+ years (with at least 2 years in a leadership/SRE-specific role)Employment Type: Full-timeAbout Pocket FMPocket FM is India's largest audio OTT platform delivering high-quality, engaging, and vernacular audio content. With millions of users and a growing global footprint, we're...


  • Bengaluru, Karnataka, India Abha Engineer Full time

    We are looking for a Senior Mechanical Engineer Roles are described below. 1. Manpower Planning. 2. Preparing of Project Cost. 3. Schedule wise work execution. 4. As Drawing & quality work execution. 5. Client & Third Party Manage. 6. Working Team Manage & Review. 7. Reporting to Management. 8. ROB & FOB Fabrication & Erection Work Knowledge.


  • Bengaluru, Karnataka, India Athenahealth Technology Private Limited Full time

    Athenahealth Technology Private Limited is a leading healthcare technology company that aims to create a thriving ecosystem that delivers accessible, high-quality, and sustainable healthcare for all. We are seeking a skilled Site Reliability Engineer to join our Cloud Infrastructure Engineering division.About UsWe are a vibrant and talented team of engineers...


  • Bengaluru, Karnataka, India Intuition IT – Intuitive Technology Recruitment Full time

    Job SummaryWe are seeking an experienced Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our products and services. This includes tracking and reducing toil, defining SLIs, SLOs, and defining error budgets that support finding the right balance between risk...


  • Bengaluru, Karnataka, India Chevron Full time

    Total Number of Openings 1 About the position: We are seeking a T-shaped dynamic Lead Site Reliability Engineer to lead and provide end-to-end solution support for our digital platforms and help us achieve higher returns and lower carbon goals. Key responsibilities: Drive reliability and systemic improvements across the subsurface tool set to support a...


  • Bengaluru, Karnataka, India LTIMindtree Limited Full time

    We are seeking a dynamic and versatile Lead Site Reliability Engineer to lead and provide end-to-end solution support for our digital platforms. This role involves conceiving and developing presentations that are logical, well-written, and concise.Key ResponsibilitiesConceive and develop presentations to a peer groupLead and provide end-to-end solution...


  • Bengaluru, Karnataka, India LTIMindtree Limited Full time

    We are seeking a T-shaped dynamic Lead Site Reliability Engineer to lead end-to-end solution support for our digital platforms. The successful candidate will be responsible for providing critical thinking, self-motivation, and excellent communication skills.Job DescriptionThe role of a Site Reliability Engineer is to ensure the stability, scalability, and...


  • Bengaluru, Karnataka, India Applexus Technologies Full time

    About the RoleAs a Site Reliability Engineering (SRE) Manager at Applexus Technologies, you will be responsible for building and leading a high-performing team of software engineers to drive reliability, scalability, and performance in our distributed systems. Your primary role will involve managing employees, developing technical knowledge on distributed...


  • Bengaluru, Karnataka, India Outcomes® Full time

    About the Role:As a Site Reliability Engineer at Outcomes®, you will serve as a vital link between our software development and Dev Ops operations teams, applying a software engineering mindset to bridge these two worlds.This role combines daily operations and script development to improve site reliability and performance. These scripts or utilities should...


  • Bengaluru, Karnataka, India Reuters Full time

    About the RoleWe are seeking a skilled Site Reliability Engineer to join our team at Reuters. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our cloud-based systems.Key ResponsibilitiesInvestigate and resolve technical issues related to system reliability and performance.Collaborate with...


  • Bengaluru, Karnataka, India CORTEX Consultants Full time

    Site Reliability EngineerExperience Required : 5+ YearsWork Timing : 2 : 00 pm to 10 : 00 pm (Bangalore based candidates only)About the Role : Highly skilled Site Reliability Engineer with 5+ years of experience. We are seeking a Senior Site Reliability Engineer to collaborate with software engineering teams, focusing on building and securing cloud...


  • Bengaluru, Karnataka, India CORTEX Consultants Full time

    Site Reliability EngineerExperience Required : 5+ YearsWork Timing : 2 : 00 pm to 10 : 00 pm (Bangalore based candidates only)About the Role : Highly skilled Site Reliability Engineer with 5+ years of experience. We are seeking a Senior Site Reliability Engineer to collaborate with software engineering teams, focusing on building and securing cloud...


  • Bengaluru, Karnataka, India Dexian Full time

    About the job : Job Role : Site Reliability Engineer Location : Bangalore- Hybrid Notice period : Immediate or currently serving less than 30 days Experience : 5 years of relevance Primary Responsibilities : - Work with other Site Reliability Engineers to implement and maintain scalable, reliable, performant, and efficient systems. - Help the team to...


  • Bengaluru, Karnataka, India Outcomes® Full time

    About the Role As a Site Reliability Engineer, you will bridge the gap between software development and DevOps operations by applying a software engineering mindset. Key Responsibilities Split time between daily operations and developing scripts to improve site reliability and performance. Develop self-service tools for the Site Reliability Team. ...


  • Bengaluru, Karnataka, India Whitefield Careers Full time

    Overview : The Site Reliability Engineer (SRE) plays a vital role in bridging the gap between development and operations, utilizing a software engineering mindset to automate and enhance the reliability, scalability, and performance of the organization's infrastructure and applications. As a key contributor, the SRE ensures that services are available,...