
Site Reliability Engineer
5 days ago
Job Description About the Role We are seeking a highly experienced Site Reliability Engineer (SRE) who will play a key role in designing, building, and scaling reliable, automated, and self-healing infrastructure and applications. This role requires someone who is not only strong in system operations but also in engineering mindset, coding, and automation enabling us to move faster while maintaining system resilience and performance. You will work closely with product and development teams, but you wonu2019t just u201Ctake tickets and complete requests.u201D Instead, you will challenge, automate, and optimize ensuring that systems are robust, scalable, and efficient with minimal manual intervention. What Youu2019ll Do . Automation First: Identify repetitive manual work and design automation frameworks, self-service tooling, and auto-healing systems. . Observability & Monitoring: Build end-to-end monitoring, logging, and alerting systems to ensure visibility and proactive issue resolution. . Incident Response: Lead complex incident troubleshooting, root cause analysis, and drive blameless postmortems. . CI/CD & Infrastructure: Enhance CI/CD pipelines and use Infrastructure as Code (IaC) to provision, configure, and manage cloud resources. . Collaboration: Partner with dev teams to embed reliability into design and development not just after deployment. . Innovation: Continuously evaluate emerging tools and technologies, keeping the stack modern and efficient. . Participate in on-call rotation and improve processes to minimize human intervention. What Weu2019re Looking For . 6u20139 years of hands-on experience as an SRE Engineer. . Strong expertise in at least one major cloud platform (AWS, Azure, or GCP). . Deep knowledge of Linux/Unix systems, networking, and distributed systems. . Proficiency in programming/scripting (Python, Go, or similar). . Advanced skills with containers and orchestration (Docker, Kubernetes at scale). . Proven experience with CI/CD pipelines and Infrastructure as Code (Terraform, Ansible, Helm, etc.). . Expertise with observability platforms (Prometheus, Grafana, ELK, Datadog, Splunk). . Strong background in incident management, disaster recovery, and capacity planning. . Familiarity with SRE practices (SLIs, SLOs, error budgets, blameless postmortems). . Excellent problem-solving, debugging, and performance optimization skills. Desirable Qualifications . Experience with AI/ML in operations (AIOps) for anomaly detection, predictive scaling, or automated incident triage. . Hands-on with security engineering - IAM, secrets management, vulnerability scanning. . Exposure to FinOps / cloud cost optimization strategies. . Contribution to open-source projects or thought leadership in SRE/DevOps communities. Soft Skills . Ownership mindset - drives initiatives, not just tasks. . Excellent communication and collaboration with dev/product leadership. . Strategic thinking with ability to balance speed, reliability, and cost. . Mentor and guide junior engineers on SRE best practices. Lilly is dedicated to helping individuals with disabilities to actively engage in the workforce, ensuring equal opportunities when vying for positions. If you require accommodation to submit a resume for a position at Lilly, please complete the accommodation request form () for further assistance. Please note this is for individuals to request an accommodation as part of the application process and any other correspondence will not receive a response. Lillyu00A0does not discriminate on the basis of age, race, color, religion, gender, sexual orientation, gender identity, gender expression, national origin, protected veteran status, disability or any other legally protected status. #WeAreLilly
-
Site Reliability Engineer III
2 weeks ago
Hyderabad, India Chase Bank Full timeJob Description There's nothing more exciting than being at the center of a rapidly growing field in technology and applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems. As a Site Reliability Engineer III at JPMorgan Chase within the Consumer and Community Banking, youwill solve complex and broad...
-
Site reliability engineer
3 weeks ago
India Employ Full timeRole - Site Reliability Engineer (SRE)/ Platform Engineering/ or Dev Ops Engineering roles Location – Fully Remote Type - 6 months Contract Work Ex - 5+ Yrs We’re working with a AI product company that’s building the next generation of Gen AI powered developer platforms . We’re looking for an experienced Site Reliability Engineer to join...
-
Site Reliability Engineer
3 weeks ago
India Elgebra Full timeHiring: Site Reliability Engineer – 7+ Years Location: Bangalore / Chennai Payroll: Elgebra
-
Lead Site Reliability Engineer
3 weeks ago
Hyderabad, India Chase Bank Full timeJob Description Assume a critical role in defining the future of a globally recognized firm and have a direct and significant effect in a realm tailored for top achievers in site reliability. As a Lead Site Reliability Engineer at JPMorgan Chase within the Consumer & Community Banking, youhold a leadership role in your team, demonstrate strong knowledge...
-
Site Reliability Engineer
4 weeks ago
India Concord Full timeSRE Sr. Engineers (Individual Contributors) Key Attributes: - Strong SRE (Site Reliability Engineering) experience - DevOps skills – CI/CD, monitoring, automation, infrastructure as code, etc. - Excellent troubleshooting and debugging skills (infrastructure + application level) - Perseverance – must push through complex/challenging issues without...
-
Site reliability engineer
3 weeks ago
India Concord Full timeSRE Sr. Engineers (Individual Contributors) Key Attributes : Strong SRE (Site Reliability Engineering) experience Dev Ops skills – CI/CD, monitoring, automation, infrastructure as code, etc. Excellent troubleshooting and debugging skills (infrastructure + application level) Perseverance – must push through complex/challenging issues without...
-
Site Reliability Engineer
1 week ago
Hyderabad, India Talent Worx Full timeSite Reliability Engineer (SRE) At Talent Worx, we are looking for a dedicated Site Reliability Engineer (SRE) to join our team. This role involves maintaining high availability and reliability of our services through the application of software engineering practices and systems administration skills. The ideal candidate will bridge the gap between...
-
Site Reliability Engineer III
21 hours ago
Hyderabad, India hackajob Full timeJob Description hackajob is collaborating with J.P. Morgan to connect them with exceptional tech professionals for this role. As a Site Reliability Engineer III at JPMorgan Chase within the Chief Technology Office, you will collaborate with engineering, support, and operations teams to maintain and improve the reliability of mission-critical applications....
-
Site Reliability Engineer
3 weeks ago
Hyderabad, India Jigya Software Services Full timeJob Title:Senior Site Reliability Engineer (SRE) - AWS/Kubernetes Location:Hyderabad - Onsite Job Type:Full-Time About the Role: We are looking for a highly skilled and motivated Site Reliability Engineer to design, build, and maintain our high-performance, scalable cloud infrastructure. You will play a critical role in ensuring the reliability, performance,...
-
Site Reliability Engineer
2 weeks ago
Hyderabad, Telangana, India Jigya Software Services Full time ₹ 1,50,000 - ₹ 28,00,000 per yearJob Title:Senior Site Reliability Engineer (SRE) - AWS/KubernetesLocation:Hyderabad - OnsiteJob Type:Full-TimeAbout the Role:We are looking for a highly skilled and motivated Site Reliability Engineer to design, build, and maintain our high-performance, scalable cloud infrastructure. You will play a critical role in ensuring the reliability, performance, and...