Lead Engineer – Site Reliability Engineer
4 weeks ago
Job Description:
Role Title: Lead Engineer Site Reliability Engineer
Team: Software Engineering and Platforms
Supervisor: Software Engineering Director
Career Progression: Engineering, Architecture
Position Description:
Historically, the role of IT has been to provide a reliable ecosystem to run the business, drive efficiencies and reduce costs. These areas remain integral, however, driven by the quickening pace
of innovation, IT must evolve, proactively partnering with the business to enable new digital
business models that power new types of customer engagement.
At Elanco, our engineer roles bring adaptive set of skills covering Software-as-a-Service (SaaS),
Commercial-of-the-Shelf (CotS) and/or Custom Developed applications. The role is part of our
software engineering team established to deliver Engineering expertise to business facing products
and services. As an Engineer you will be deployed into a multi-disciplined product team applying
your software engineering talent to Elancos biggest opportunities.
To be successful in an engineering role in Elanco requires a highly motivated individual, with an
innovative mindset and a willingness to drive tangible outcomes. The individual must be able to
articulate complex technical topics and collaborate with the internal engineering organisation to
improve engineering across the enterprise.
The Role
We are seeking a skilled and motivated engineer, passionate about improving application reliability
across our enterprise. As part of our Platform Engineering organization, you will join a product team focused on a suite of capabilities designed to enhance all aspects of our engineering portfolio. In this role, you will be primarily accountable for configuring and operating our observability toolset. You will also lead the charge across the enterprise, driving the transition from reactive to proactive application support.
This is a fantastic opportunity to join a growing engineering team with the scope to partner across
our entire enterprise of products. Your contributions will help ensure that everything we deliver to
our customers come with top-notch reliability as standard.
Typical responsibilities:
Help define Elancos approach to reliability of applications partnering with our product manager for our portfolio health products.
Collaborate with stakeholders such as product and platform owners, to define service level objectives (SLOs), and service-level indicators (SLIs) for system operations focused on the critical features of the customers journey and experience.
Assist and coach product teams implementation of telemetry against SLIs/SLOs to ensure
adequate traceability is in place.
Track and manage reliability performance against agreed SLOs, in partnership with product
teams or other stakeholders, and ensure systems continue to meet SLOs over time.
Ensure key stakeholders, product owners, and platform owners are informed of reliability
concerns and their potential impact to the customers experience.
Provide expert knowledge on reliability approaches, to ensure our organization achieves its
goals and roadmap for reliability.
Champion reliability being treated as a feature in products and platforms and promote the concept across all phases of the software development life cycle.
Create dashboards and reports to communicate key metrics, to product teams and key stakeholders.
Beyond observability engage in initiatives across the product line including cost, security, and adoption helping the team drive to a health portfolio throughout an applications lifecycle.
Participate problem management activities, including post-mortem incident analysis, and provision of technical insight, documented findings, outcomes and recommendations as part of a root cause analysis to troubleshoot priority incidents.
Implement automation to reduce probability and/or impact of problems recurring and
target self-healing through automation of reoccurring incidents.
For critical applications, utilize practices such as chaos engineering and performance engineering to test in preproduction environments. This includes disaster recovery (DR)
testing, performance testing, and tabletop planning exercises.
Participate and exert influence in organizational learning initiatives such as communities of practice to share knowledge and foster a continuous learning and improvement mindset.
Support architects working on new solutions, including analyzing requirements, supporting
technical architecture activities, prototyping, designing and developing reusable
infrastructure artifacts, testing, implementing, and preparing for ongoing support.
Train and mentor junior and engineers to ensure SRE best practices evolve and scale successfully in the organization
Partner with the product manager of portfolio health to build out golden paths, education
and services to package the capability in a consumable way on our developer portal.
Be a product team champion extending into product teams helping to deliver foundational
platform engineering capabilities where applicable.
Partner with compliance teams to ensure the data we bring into observability platforms meets privacy and compliance standards
Maintain consistent standards and set out a taxonomy of telemetry to enable future opportunities including leveraging of AI capability.
Basic Qualifications:
Experience in some of the following areas essential.
10-15 years of hands-on engineering experience.
5 years experience in Platform Engineering, SRE or similar role
5-10 years of experience working with modern application architecture methodologies (Service Orientated Architecture, API-Centric Design, Twelve-Factor App, FAIR, etc.).
5 + years of experience working with Cloud Native design patterns, with a
preference towards Microsoft Azure / Google Cloud.
5 + years of experience designing and delivering digital solutions following a product-mindset and a variety of delivery methodologies (e.g. Agile, CCPM, etc.).
5 + years of experience working within a DevSecOps culture, including modern software development practices, covering Continuous Integration and Continuous Delivery (CI/CD), Test-Driven Development (TDD), etc.
Experience with enterprise observability platforms. E.g Datadog, New Relic
Experience with monitoring 3rd party and SaaS applications.
Experience establishing standards around MELT (Metrics, Events, Logging and Tracing and implementing at an enterprise level.
Experience with Open Telemetry advantageous.
Experience supporting digital platforms, including Integrations, Release Management, Regression Testing, Integrations, Data Obfuscation, etc.
Experience scaling an API-Ecosystem, designing, and implementing API-First integration patterns.
Experience working with authentication and authorisation protocols/patterns.
Experience defining and implementing large-scale, transformative digital solutions.
Demonstrated influence and communication skills across all levels of IT and third parties.
Experience working in complex, diverse landscapes (business, technology, regulatory, partners, providers, geographies, etc.).
Strong organizational and communications skills with multiple examples of being able to convey complex technical topics, that resulted in a definitive direction.
Education Requirements: Bachelors degree in information technology.
Other Information: Occasional travel may be required.
Elanco is an EEO/Affirmative Action Employer and does not discriminate on the basis of age, race, color, religion, gender, sexual orientation, gender identity, gender expression, national origin, protected veteran status, disability or any other legally protected status
-
Site Reliability Engineer
3 days ago
Bengaluru, Karnataka, India NatWest Group Full timeCompany Overview:The NatWest Group is a leading financial services provider dedicated to delivering innovative solutions. Our team is passionate about creating a seamless customer experience.As a Site Reliability Engineer, you will play a key role in supporting the improvement of non-functional and operational characteristics. You will work closely with...
-
Site Reliability Engineer
2 weeks ago
Bengaluru, Karnataka, India CORTEX Consultants Full timeSite Reliability Engineer Experience Required : 5 Years Work Timing : 2 : 00 pm to 10 : 00 pm (Bangalore based candidates only) About the Role : Highly skilled Site Reliability Engineer with 5 years of experience. We are seeking a Senior Site Reliability Engineer to collaborate with software engineering teams, focusing on building and securing cloud...
-
Lead Site Reliability Engineer
2 days ago
Bengaluru, Karnataka, India Pocket FM Full timeJob Title: Lead Site Reliability Engineer (SRE)Location: BangaloreExperience: 6+ years (with at least 2 years in a leadership/SRE-specific role)Employment Type: Full-timeAbout Pocket FMPocket FM is India's largest audio OTT platform delivering high-quality, engaging, and vernacular audio content. With millions of users and a growing global footprint, we're...
-
Site Reliability Engineer Lead
4 days ago
Bengaluru, Karnataka, India Athenahealth Technology Private Limited Full timeAthenahealth Technology Private Limited is a leading healthcare technology company that aims to create a thriving ecosystem that delivers accessible, high-quality, and sustainable healthcare for all. We are seeking a skilled Site Reliability Engineer to join our Cloud Infrastructure Engineering division.About UsWe are a vibrant and talented team of engineers...
-
Lead Site Reliability Engineer
3 days ago
Bengaluru, Karnataka, India LTIMindtree Limited Full timeWe are seeking a dynamic and versatile Lead Site Reliability Engineer to lead and provide end-to-end solution support for our digital platforms. This role involves conceiving and developing presentations that are logical, well-written, and concise.Key ResponsibilitiesConceive and develop presentations to a peer groupLead and provide end-to-end solution...
-
Site Reliability Engineering Lead
7 days ago
Bengaluru, Karnataka, India Intuition IT – Intuitive Technology Recruitment Full timeJob SummaryWe are seeking an experienced Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our products and services. This includes tracking and reducing toil, defining SLIs, SLOs, and defining error budgets that support finding the right balance between risk...
-
Site Reliability Engineer Leader
1 week ago
Bengaluru, Karnataka, India LTIMindtree Limited Full timeWe are seeking a T-shaped dynamic Lead Site Reliability Engineer to lead end-to-end solution support for our digital platforms. The successful candidate will be responsible for providing critical thinking, self-motivation, and excellent communication skills.Job DescriptionThe role of a Site Reliability Engineer is to ensure the stability, scalability, and...
-
Senior Mechanical Engineer
3 days ago
Bengaluru, Karnataka, India Abha Engineer Full timeWe are looking for a Senior Mechanical Engineer Roles are described below. 1. Manpower Planning. 2. Preparing of Project Cost. 3. Schedule wise work execution. 4. As Drawing & quality work execution. 5. Client & Third Party Manage. 6. Working Team Manage & Review. 7. Reporting to Management. 8. ROB & FOB Fabrication & Erection Work Knowledge.
-
Site Reliability Engineer Lead
1 week ago
Bengaluru, Karnataka, India Outcomes® Full timeAbout the Role:As a Site Reliability Engineer at Outcomes®, you will serve as a vital link between our software development and Dev Ops operations teams, applying a software engineering mindset to bridge these two worlds.This role combines daily operations and script development to improve site reliability and performance. These scripts or utilities should...
-
Site Reliability Engineer
3 days ago
Bengaluru, Karnataka, India CORTEX Consultants Full timeSite Reliability EngineerExperience Required : 5+ YearsWork Timing : 2 : 00 pm to 10 : 00 pm (Bangalore based candidates only)About the Role : Highly skilled Site Reliability Engineer with 5+ years of experience. We are seeking a Senior Site Reliability Engineer to collaborate with software engineering teams, focusing on building and securing cloud...
-
Site Reliability Engineer
3 weeks ago
Bengaluru, Karnataka, India Dexian Full timeAbout the job : Job Role : Site Reliability Engineer Location : Bangalore- Hybrid Notice period : Immediate or currently serving less than 30 days Experience : 5 years of relevance Primary Responsibilities : - Work with other Site Reliability Engineers to implement and maintain scalable, reliable, performant, and efficient systems. - Help the team to...
-
Site Reliability Engineer
2 weeks ago
Bengaluru, Karnataka, India Outcomes® Full timeAbout the Role As a Site Reliability Engineer, you will bridge the gap between software development and DevOps operations by applying a software engineering mindset. Key Responsibilities Split time between daily operations and developing scripts to improve site reliability and performance. Develop self-service tools for the Site Reliability Team. ...
-
Site Reliability Engineer
3 weeks ago
Bengaluru, Karnataka, India Whitefield Careers Full timeOverview : The Site Reliability Engineer (SRE) plays a vital role in bridging the gap between development and operations, utilizing a software engineering mindset to automate and enhance the reliability, scalability, and performance of the organization's infrastructure and applications. As a key contributor, the SRE ensures that services are available,...
-
Site Reliability Engineer
2 weeks ago
Bengaluru, Karnataka, India Deltaclass Technology Solutions Pvt. Ltd. Full timeRole : SRE Senior Engineer Experience : 6-8 years Responsibilities : - Define, track, and report on SLOs and SLIs for critical service - Setup Monitoring and observability for the system - Take lead on complex incidents and provide deep technical expertise to resolve issues quickly. - Perform RCA in-depth for incident management and suggest permanent fix -...
-
Site Reliability Engineer Leader
2 days ago
Bengaluru, Karnataka, India Acara Solutions, Inc. Full timeAbout the RoleWe are looking for a skilled Lead - Cloud Site Reliability Engineer to lead our cloud operations team at Acara Solutions, Inc.The successful candidate will have extensive experience with Kubernetes workloads on AWS EKS, Docker images, and Helm charts.As a leader, you will be responsible for mentoring and guiding junior engineers, ensuring that...
-
Site Reliability Engineer
3 weeks ago
Bengaluru, Karnataka, India COGNITUD ADVISORY SERVICES PRIVATE LIMITED Full timeAbout the job. Domain : IT Services & Consulting. Position : Site Reliability Engineer. Experience : 4-7 Years. Location : Chennai, Kolkata, Hyderabad & Bangalore. Your Team - You are invited to work with a top-tier organization that's been in the game for 50 years, partnering with some of the world's biggest businesses. As India's largest multinational...
-
Lead Site Reliability Engineer
3 weeks ago
Bengaluru, Karnataka, India Delta Air Lines Full timeKey Responsibilities Building and supporting a reliable customer facing application suite for the environment to meet the development and maintenance requirements of systems/platforms. Working with development teams to evaluate the health, stability, and reliability of customer-facing applications. Utilizing monitoring, alerts, dashboards, and management...
-
Site Reliability Engineer
4 weeks ago
Bengaluru, Karnataka, India Dexian Full timeAbout the job :Job Role : Site Reliability EngineerLocation : Bangalore- HybridNotice period : Immediate or currently serving less than 30 daysExperience : 5+ years of relevancePrimary Responsibilities :- Work with other Site Reliability Engineers to implement and maintain scalable, reliable, performant, and efficient systems.- Help the team to continuously...
-
Site Reliability Engineer
3 days ago
Bengaluru, Karnataka, India Dexian Full timeAbout the job :Job Role : Site Reliability EngineerLocation : Bangalore- HybridNotice period : Immediate or currently serving less than 30 daysExperience : 5+ years of relevancePrimary Responsibilities :- Work with other Site Reliability Engineers to implement and maintain scalable, reliable, performant, and efficient systems.- Help the team to continuously...
-
Site Reliability Engineer Lead
2 weeks ago
Bengaluru, Karnataka, India Thomson Reuters Full timeJob DescriptionIn this role, you will be responsible for implementing site reliability engineering and DevOps best practices. You will feed non-functional requirements into the product backlog, including high availability, scalability, self-healing, observability, continuous delivery, and security. Additionally, you will build and maintain monitoring for all...