Site Reliability Engineer
1 week ago
At Lilly, we unite caring with discovery to make life better for people around the world. We are a global healthcare leader headquartered in Indianapolis, Indiana. Our employees around the world work to discover and bring life-changing medicines to those who need them, improve the understanding and management of disease, and give back to our communities through philanthropy and volunteerism. We give our best effort to our work, and we put people first. We're looking for people who are determined to make life better for people around the world.
About the Role
We are seeking a highly experienced Site Reliability Engineer (SRE) who will play a key role in designing, building, and scaling reliable, automated, and self-healing infrastructure and applications. This role requires someone who is not only strong in system operations but also in engineering mindset, coding, and automation enabling us to move faster while maintaining system resilience and performance.
You will work closely with product and development teams, but you won't just "take tickets and complete requests." Instead, you will challenge, automate, and optimize ensuring that systems are robust, scalable, and efficient with minimal manual intervention.
What You'll Do
* Automation First: Identify repetitive manual work and design automation frameworks, self-service tooling, and auto-healing systems.
* Observability & Monitoring: Build end-to-end monitoring, logging, and alerting systems to ensure visibility and proactive issue resolution.
* Incident Response: Lead complex incident troubleshooting, root cause analysis, and drive blameless postmortems.
* CI/CD & Infrastructure: Enhance CI/CD pipelines and use Infrastructure as Code (IaC) to provision, configure, and manage cloud resources.
* Collaboration: Partner with dev teams to embed reliability into design and development not just after deployment.
* Innovation: Continuously evaluate emerging tools and technologies, keeping the stack modern and efficient.
* Participate in on-call rotation and improve processes to minimize human intervention.
What We're Looking For
* 6–9 years of hands-on experience as an SRE Engineer.
* Strong expertise in at least one major cloud platform (AWS, Azure, or GCP).
* Deep knowledge of Linux/Unix systems, networking, and distributed systems.
* Proficiency in programming/scripting (Python, Go, or similar).
* Advanced skills with containers and orchestration (Docker, Kubernetes at scale). * Proven experience with CI/CD pipelines and Infrastructure as Code (Terraform, Ansible, Helm, etc.).
* Expertise with observability platforms (Prometheus, Grafana, ELK, Datadog, Splunk).
* Strong background in incident management, disaster recovery, and capacity planning.
* Familiarity with SRE practices (SLIs, SLOs, error budgets, blameless postmortems).
* Excellent problem-solving, debugging, and performance optimization skills.
Desirable Qualifications
* Experience with AI/ML in operations (AIOps) for anomaly detection, predictive scaling, or automated incident triage.
* Hands-on with security engineering - IAM, secrets management, vulnerability scanning.
* Exposure to FinOps / cloud cost optimization strategies.
* Contribution to open-source projects or thought leadership in SRE/DevOps communities.
Soft Skills
* Ownership mindset - drives initiatives, not just tasks.
* Excellent communication and collaboration with dev/product leadership.
* Strategic thinking with ability to balance speed, reliability, and cost.
* Mentor and guide junior engineers on SRE best practices.
Lilly is dedicated to helping individuals with disabilities to actively engage in the workforce, ensuring equal opportunities when vying for positions. If you require accommodation to submit a resume for a position at Lilly, please complete the accommodation request form ) for further assistance. Please note this is for individuals to request an accommodation as part of the application process and any other correspondence will not receive a response.
Lilly does not discriminate on the basis of age, race, color, religion, gender, sexual orientation, gender identity, gender expression, national origin, protected veteran status, disability or any other legally protected status.
#WeAreLilly
-
Site Reliability Engineer
1 week ago
Hyderabad, Telangana, India Oracle Financial Services Software Ltd Full time ₹ 12,00,000 - ₹ 36,00,000 per yearPrincipal Site Reliability Engineer Oracle is seeking motivated Principal Site Reliability Engineer who thrives in a fast-paced rapidly evolving technology environment. This position requires wide and overall knowledge in Linux administration, AI technologies, software development, cloud computing, networking, cloud security, performance analysis and...
-
Site Reliability Engineer
3 days ago
Hyderabad, Telangana, India Oracle Financial Services Software Ltd Full time ₹ 12,00,000 - ₹ 36,00,000 per yearPrincipal Site Reliability Engineer Oracle is seeking motivated Principal Site Reliability Engineer who thrives in a fast-paced rapidly evolving technology environment. This position requires wide and overall knowledge in Mainframe zLinux, DB2, zVM, AIX. Site Reliability Engineer expected to work with multiple service and product development teams,...
-
Site Reliability Engineer
1 day ago
Hyderabad, Telangana, India Talent Worx Full time ₹ 12,00,000 - ₹ 36,00,000 per yearSite Reliability Engineer (SRE)At Talent Worx, we are looking for a dedicated Site Reliability Engineer (SRE) to join our team. This role involves maintaining high availability and reliability of our services through the application of software engineering practices and systems administration skills. The ideal candidate will bridge the gap between...
-
Site Reliability Engineer
3 days ago
Hyderabad, Telangana, India Technology Next Full time ₹ 20,00,000 - ₹ 30,00,000 per yearUrgently hiring for Site Reliability Engineer (SRE) / Chaos EngineerLocation: HyderabadJob Type: Full-time, PermanentJob Description:We are looking for an experienced Site Reliability Engineer (SRE) with strong Python automation skills (Boto3 required) and hands-on experience in chaos engineering to improve system reliability and resilience. The ideal...
-
Site Reliability Engineer
1 week ago
Hyderabad, Telangana, India SMARTWORK IT SERVICES Full time ₹ 12,00,000 - ₹ 24,00,000 per yearDescription : Role : Site Reliability Engineer (SRE). Location : Hyderabad. Experience : 10 to 15 Years. Job Summary : The Site Reliability Engineer (SRE) will play a critical role in ensuring the reliability, scalability, and performance of Citizens Banks enterprise systems and cloud environments. The ideal candidate brings deep technical...
-
Site Reliability Engineer
3 days ago
Hyderabad, Telangana, India Apple Full time ₹ 15,00,000 - ₹ 25,00,000 per yearImagine what you could do here. Apple is a place where extraordinary people gather to do their best work. Together we craft products and experiences people once couldn't have imagined — and now can't imagine living without. If you're motivated by the idea of making a real impact, and joining a team where we pride ourselves in being one of the most diverse...
-
Site Reliability Engineer
23 hours ago
Hyderabad, Telangana, India BYLD Group Full time ₹ 12,00,000 - ₹ 36,00,000 per yearDescriptionJob Title :Site Reliability Engineer (SRE) - DataDog / AWS Lambda / DynamoDB / ServerlessLocation :Bangalore / Pune / HyderabadExperience :5- 10 YearsAbout The RoleWe are seeking an experienced Site Reliability Engineer (SRE) with strong expertise in DataDog integration, AWS Lambda, DynamoDB, and Serverless architectures. The ideal candidate will...
-
Principal Site Reliability Engineer
5 days ago
Hyderabad, Telangana, India Oracle Full time ₹ 12,00,000 - ₹ 36,00,000 per yearOracle is seeking motivated Principal Site Reliability Engineer who thrives in a fast-paced rapidly evolving technology environment. This position requires wide and overall knowledge in Mainframe zLinux, DB2, zVM, AIX. Site Reliability Engineer expected to work with multiple service and product development teams, identifying cross-team issues that...
-
Principal Site Reliability Engineer
5 days ago
Hyderabad, Telangana, India Oracle Full time ₹ 12,00,000 - ₹ 36,00,000 per yearOracle is seeking motivated Principal Site Reliability Engineer who thrives in a fast-paced rapidly evolving technology environment. This position requires wide and overall knowledge in Linux administration, AI technologies, software development, cloud computing, networking, cloud security, performance analysis and monitoring to provide the stability,...
-
Senior Site Reliability Engineer
3 days ago
Hyderabad, Telangana, India Jade Global Software Pvt Ltd Full time ₹ 12,00,000 - ₹ 24,00,000 per yearSenior Site Reliability Engineer (SRE) – Datadog ObservabilitySenior Site Reliability Engineer (SRE) – Datadog Observability1 Job Title: Senior Site Reliability Engineer (SRE) – Datadog ObservabilityExperience Required: 8+ years overall in SRE and Infrastructure Operations with minimum 3+ years hands-on experience in DatadogLocation: Hyderabad...