Lead Engineer – Site Reliability Engineer
4 weeks ago
Job Description:
Role Title: Lead Engineer – Site Reliability Engineer
Team: Software Engineering and Platforms
Supervisor: Software Engineering Director
Career Progression: Engineering, Architecture
Position Description:
Historically, the role of IT has been to provide a reliable ecosystem to run the business, drive efficiencies and reduce costs. These areas remain integral, however, driven by the quickening pace
of innovation, IT must evolve, proactively partnering with the business to enable new digital
business models that power new types of customer engagement.
At Elanco, our engineer roles bring adaptive set of skills covering Software-as-a-Service (SaaS),
Commercial-of-the-Shelf (CotS) and/or Custom Developed applications. The role is part of our
software engineering team established to deliver Engineering expertise to business facing products
and services. As an Engineer you will be deployed into a multi-disciplined product team applying
your software engineering talent to Elanco's biggest opportunities.
To be successful in an engineering role in Elanco requires a highly motivated individual, with an
innovative mindset and a willingness to drive tangible outcomes. The individual must be able to
articulate complex technical topics and collaborate with the internal engineering organisation to
improve engineering across the enterprise.
The Role
We are seeking a skilled and motivated engineer, passionate about improving application reliability
across our enterprise. As part of our Platform Engineering organization, you will join a product team focused on a suite of capabilities designed to enhance all aspects of our engineering portfolio. In this role, you will be primarily accountable for configuring and operating our observability toolset. You will also lead the charge across the enterprise, driving the transition from reactive to proactive application support.
This is a fantastic opportunity to join a growing engineering team with the scope to partner across
our entire enterprise of products. Your contributions will help ensure that everything we deliver to
our customers come with top-notch reliability as standard.
Typical responsibilities:
• Help define Elanco's approach to reliability of applications partnering with our product manager for our portfolio health products.
• Collaborate with stakeholders such as product and platform owners, to define service level objectives (SLOs), and service-level indicators (SLIs) for system operations focused on the critical features of the customers journey and experience.
• Assist and coach product teams implementation of telemetry against SLIs/SLOs to ensure
adequate traceability is in place.
• Track and manage reliability performance against agreed SLOs, in partnership with product
teams or other stakeholders, and ensure systems continue to meet SLOs over time.
• Ensure key stakeholders, product owners, and platform owners are informed of reliability
concerns and their potential impact to the customers experience.
• Provide expert knowledge on reliability approaches, to ensure our organization achieves its
goals and roadmap for reliability.
• Champion reliability being treated as a feature in products and platforms and promote the concept across all phases of the software development life cycle.
• Create dashboards and reports to communicate key metrics, to product teams and key stakeholders.
• Beyond observability engage in initiatives across the product line including cost, security, and adoption helping the team drive to a health portfolio throughout an applications lifecycle.
• Participate problem management activities, including post-mortem incident analysis, and provision of technical insight, documented findings, outcomes and recommendations as part of a root cause analysis to troubleshoot priority incidents.
• Implement automation to reduce probability and/or impact of problems recurring and
target 'self-healing' through automation of reoccurring incidents.
• For critical applications, utilize practices such as chaos engineering and performance engineering to test in preproduction environments. This includes disaster recovery (DR)
testing, performance testing, and tabletop planning exercises.
• Participate and exert influence in organizational learning initiatives such as communities of practice to share knowledge and foster a continuous learning and improvement mindset.
• Support architects working on new solutions, including analyzing requirements, supporting
technical architecture activities, prototyping, designing and developing reusable
infrastructure artifacts, testing, implementing, and preparing for ongoing support.
• Train and mentor junior and engineers to ensure SRE best practices evolve and scale successfully in the organization
• Partner with the product manager of portfolio health to build out golden paths, education
and services to 'package' the capability in a consumable way on our developer portal.
• Be a product team champion extending into product teams helping to deliver foundational
platform engineering capabilities where applicable.
• Partner with compliance teams to ensure the data we bring into observability platforms meets privacy and compliance standards
• Maintain consistent standards and set out a taxonomy of telemetry to enable future opportunities including leveraging of AI capability.
Basic Qualifications:
• Experience in some of the following areas essential.
• 10-15 years of hands-on engineering experience.
• 5 years' experience in Platform Engineering, SRE or similar role
• 5-10 years of experience working with modern application architecture methodologies (Service Orientated Architecture, API-Centric Design, Twelve-Factor App, FAIR, etc.).
• 5 + years of experience working with Cloud Native design patterns, with a
preference towards Microsoft Azure / Google Cloud.
• 5 + years of experience designing and delivering digital solutions following a product-mindset and a variety of delivery methodologies (e.g. Agile, CCPM, etc.).
• 5 + years of experience working within a "DevSecOps" culture, including modern software development practices, covering Continuous Integration and Continuous Delivery (CI/CD), Test-Driven Development (TDD), etc.
• Experience with enterprise observability platforms. E.g Datadog, New Relic
• Experience with monitoring 3rd party and SaaS applications.
• Experience establishing standards around MELT (Metrics, Events, Logging and Tracing and implementing at an enterprise level.
• Experience with Open Telemetry advantageous.
• Experience supporting digital platforms, including Integrations, Release Management, Regression Testing, Integrations, Data Obfuscation, etc.
• Experience scaling an "API-Ecosystem", designing, and implementing "API-First" integration patterns.
• Experience working with authentication and authorisation protocols/patterns.
• Experience defining and implementing large-scale, transformative digital solutions.
• Demonstrated influence and communication skills across all levels of IT and third parties.
• Experience working in complex, diverse landscapes (business, technology, regulatory, partners, providers, geographies, etc.).
• Strong organizational and communications skills with multiple examples of being able to convey complex technical topics, that resulted in a definitive direction.
Education Requirements: Bachelor's degree in information technology.
Other Information: Occasional travel may be required.
Elanco is an EEO/Affirmative Action Employer and does not discriminate on the basis of age, race, color, religion, gender, sexual orientation, gender identity, gender expression, national origin, protected veteran status, disability or any other legally protected status
-
Site Reliability Engineering Lead
2 days ago
Bengaluru, Karnataka, India Cisco Full timeAbout the RoleCisco's Site Reliability Engineering team is seeking a skilled engineer to lead efforts in optimizing cloud expenditures, streamlining infrastructure management, and ensuring efficient resource utilization. As a Senior Site Reliability Engineer, you will be responsible for designing and implementing scalable solutions to drive efficiency and...
-
Site Reliability Engineering Lead
3 days ago
Bengaluru, Karnataka, India Wipro Full timeWe are looking for a skilled Senior Site Reliability Engineer to join our team at Wipro.The successful candidate will have a minimum of 10 years of experience in site reliability engineering principles, including SLO, SLI, SLA, error budgets, and eliminating toil via automation.You will work closely with our development teams to design and implement...
-
Site Reliability Engineering Lead
2 days ago
Bengaluru, Karnataka, India Applexus Technologies Full timeCompany OverviewApplexus Technologies is a leading technology company that specializes in delivering innovative solutions to clients across various industries.Why Join Us?As a Site Reliability Engineer at Applexus Technologies, you will have the opportunity to work with a talented team of engineers and contribute to the development of scalable and reliable...
-
Site Reliability Engineer
3 weeks ago
Bengaluru, Karnataka, India CORTEX Consultants Full timeSite Reliability Engineer Experience Required : 5 Years Work Timing : 2 : 00 pm to 10 : 00 pm (Bangalore based candidates only) About the Role : Highly skilled Site Reliability Engineer with 5 years of experience. We are seeking a Senior Site Reliability Engineer to collaborate with software engineering teams, focusing on building and securing cloud...
-
Site Reliability Engineer
6 days ago
Bengaluru, Karnataka, India NatWest Group Full timeCompany Overview:The NatWest Group is a leading financial services provider dedicated to delivering innovative solutions. Our team is passionate about creating a seamless customer experience.As a Site Reliability Engineer, you will play a key role in supporting the improvement of non-functional and operational characteristics. You will work closely with...
-
Lead Site Reliability Engineer
6 days ago
Bengaluru, Karnataka, India Pocket FM Full timeJob Title: Lead Site Reliability Engineer (SRE)Location: BangaloreExperience: 6+ years (with at least 2 years in a leadership/SRE-specific role)Employment Type: Full-timeAbout Pocket FMPocket FM is India's largest audio OTT platform delivering high-quality, engaging, and vernacular audio content. With millions of users and a growing global footprint, we're...
-
Senior Mechanical Engineer
7 days ago
Bengaluru, Karnataka, India Abha Engineer Full timeWe are looking for a Senior Mechanical Engineer Roles are described below. 1. Manpower Planning. 2. Preparing of Project Cost. 3. Schedule wise work execution. 4. As Drawing & quality work execution. 5. Client & Third Party Manage. 6. Working Team Manage & Review. 7. Reporting to Management. 8. ROB & FOB Fabrication & Erection Work Knowledge.
-
Site Reliability Engineer Lead
1 week ago
Bengaluru, Karnataka, India Athenahealth Technology Private Limited Full timeAthenahealth Technology Private Limited is a leading healthcare technology company that aims to create a thriving ecosystem that delivers accessible, high-quality, and sustainable healthcare for all. We are seeking a skilled Site Reliability Engineer to join our Cloud Infrastructure Engineering division.About UsWe are a vibrant and talented team of engineers...
-
Site Reliability Engineering Lead
1 week ago
Bengaluru, Karnataka, India Intuition IT – Intuitive Technology Recruitment Full timeJob SummaryWe are seeking an experienced Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our products and services. This includes tracking and reducing toil, defining SLIs, SLOs, and defining error budgets that support finding the right balance between risk...
-
Lead Site Reliability Engineer
3 days ago
Bengaluru, Karnataka, India Chevron Full timeTotal Number of Openings 1 About the position: We are seeking a T-shaped dynamic Lead Site Reliability Engineer to lead and provide end-to-end solution support for our digital platforms and help us achieve higher returns and lower carbon goals. Key responsibilities: Drive reliability and systemic improvements across the subsurface tool set to support a...
-
Lead Site Reliability Engineer
6 days ago
Bengaluru, Karnataka, India LTIMindtree Limited Full timeWe are seeking a dynamic and versatile Lead Site Reliability Engineer to lead and provide end-to-end solution support for our digital platforms. This role involves conceiving and developing presentations that are logical, well-written, and concise.Key ResponsibilitiesConceive and develop presentations to a peer groupLead and provide end-to-end solution...
-
Site Reliability Engineer Leader
2 weeks ago
Bengaluru, Karnataka, India LTIMindtree Limited Full timeWe are seeking a T-shaped dynamic Lead Site Reliability Engineer to lead end-to-end solution support for our digital platforms. The successful candidate will be responsible for providing critical thinking, self-motivation, and excellent communication skills.Job DescriptionThe role of a Site Reliability Engineer is to ensure the stability, scalability, and...
-
Site Reliability Engineering Team Lead
2 days ago
Bengaluru, Karnataka, India Applexus Technologies Full timeAbout the RoleAs a Site Reliability Engineering (SRE) Manager at Applexus Technologies, you will be responsible for building and leading a high-performing team of software engineers to drive reliability, scalability, and performance in our distributed systems. Your primary role will involve managing employees, developing technical knowledge on distributed...
-
Site Reliability Engineer Lead
2 weeks ago
Bengaluru, Karnataka, India Outcomes® Full timeAbout the Role:As a Site Reliability Engineer at Outcomes®, you will serve as a vital link between our software development and Dev Ops operations teams, applying a software engineering mindset to bridge these two worlds.This role combines daily operations and script development to improve site reliability and performance. These scripts or utilities should...
-
Site Reliability Engineer Lead
3 days ago
Bengaluru, Karnataka, India Reuters Full timeAbout the RoleWe are seeking a skilled Site Reliability Engineer to join our team at Reuters. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our cloud-based systems.Key ResponsibilitiesInvestigate and resolve technical issues related to system reliability and performance.Collaborate with...
-
Site Reliability Engineer
7 days ago
Bengaluru, Karnataka, India CORTEX Consultants Full timeSite Reliability EngineerExperience Required : 5+ YearsWork Timing : 2 : 00 pm to 10 : 00 pm (Bangalore based candidates only)About the Role : Highly skilled Site Reliability Engineer with 5+ years of experience. We are seeking a Senior Site Reliability Engineer to collaborate with software engineering teams, focusing on building and securing cloud...
-
Site Reliability Engineer
3 days ago
Bengaluru, Karnataka, India CORTEX Consultants Full timeSite Reliability EngineerExperience Required : 5+ YearsWork Timing : 2 : 00 pm to 10 : 00 pm (Bangalore based candidates only)About the Role : Highly skilled Site Reliability Engineer with 5+ years of experience. We are seeking a Senior Site Reliability Engineer to collaborate with software engineering teams, focusing on building and securing cloud...
-
Site Reliability Engineer
3 weeks ago
Bengaluru, Karnataka, India Dexian Full timeAbout the job : Job Role : Site Reliability Engineer Location : Bangalore- Hybrid Notice period : Immediate or currently serving less than 30 days Experience : 5 years of relevance Primary Responsibilities : - Work with other Site Reliability Engineers to implement and maintain scalable, reliable, performant, and efficient systems. - Help the team to...
-
Site Reliability Engineer
2 weeks ago
Bengaluru, Karnataka, India Outcomes® Full timeAbout the Role As a Site Reliability Engineer, you will bridge the gap between software development and DevOps operations by applying a software engineering mindset. Key Responsibilities Split time between daily operations and developing scripts to improve site reliability and performance. Develop self-service tools for the Site Reliability Team. ...
-
Site Reliability Engineer
3 weeks ago
Bengaluru, Karnataka, India Whitefield Careers Full timeOverview : The Site Reliability Engineer (SRE) plays a vital role in bridging the gap between development and operations, utilizing a software engineering mindset to automate and enhance the reliability, scalability, and performance of the organization's infrastructure and applications. As a key contributor, the SRE ensures that services are available,...