(3 Days Left) Associate Manager SRE

3 weeks ago

Hyderabad, India Pepsico Full time

Overview We are seeking a self-driven, inquisitive, and curious Site Reliability Engineer (SRE) to drive reliability, availability, performance, and security across our global digital product ecosystem. This role is central to ensuring a seamless and resilient experience for our users by blending deep engineering expertise with operational excellence and automation. You will be part of a global SRE practice supporting a portfolio of 260+ modern cloud-native applications across consumer, commercial, supply chain, and enablement functions. Your mission: prevent incidents before they occur, ensure rapid recovery when they do, and build scalable systems that evolve with our growing business. Responsibilities Champion reliability, observability, and operational excellence across mission-critical applications. - Develop and maintain service-level indicators (SLIs), objectives (SLOs), and error budgets to measure and improve system performance. - Implement automated monitoring, alerting, and recovery mechanisms to reduce manual intervention and improve response times. - Collaborate closely with software engineering, platform, and operations teams to embed SRE practices across the development lifecycle. - Lead and participate in incident response, root cause analysis, and postmortem reviews to drive long-term improvements. - Identify and eliminate sources of toil through automation, tooling, and process refinement. - Continuously improve resiliency design, capacity planning, and release management in production systems. - Influence engineering teams with best practices on cloud-native architecture, observability, and deployment strategies. Qualifications Required Skills: - 5+ years of experience in production engineering, DevOps, or SRE roles. - Strong foundation in Linux systems, networking, and cloud platforms (Azure, AWS, or GCP). - Hands-on experience with observability tools (e.g., AppDynamics, Prometheus, Grafana, ELK, FullStory). - Proficiency in scripting or programming (e.g., Python, Bash, Go) and automation frameworks (e.g., Ansible, Terraform). - Deep understanding of CI/CD pipelines, release strategies, and deployment automation. - Experience in managing high-scale, distributed systems in cloud-native environments. - Strong analytical skills and a passion for continuous improvement. Preferred Skills: - Familiarity with microservices, Kubernetes, containers, and service mesh architecture. - Exposure to incident and problem management frameworks (e.g., ITIL, RCA practices). - Experience working in global teams supporting mission-critical applications. Required Skills: - 5+ years of experience in production engineering, DevOps, or SRE roles. - Strong foundation in Linux systems, networking, and cloud platforms (Azure, AWS, or GCP). - Hands-on experience with observability tools (e.g., AppDynamics, Prometheus, Grafana, ELK, FullStory). - Proficiency in scripting or programming (e.g., Python, Bash, Go) and automation frameworks (e.g., Ansible, Terraform). - Deep understanding of CI/CD pipelines, release strategies, and deployment automation. - Experience in managing high-scale, distributed systems in cloud-native environments. - Strong analytical skills and a passion for continuous improvement. Preferred Skills: - Familiarity with microservices, Kubernetes, containers, and service mesh architecture. - Exposure to incident and problem management frameworks (e.g., ITIL, RCA practices). - Experience working in global teams supporting mission-critical applications. Champion reliability, observability, and operational excellence across mission-critical applications. - Develop and maintain service-level indicators (SLIs), objectives (SLOs), and error budgets to measure and improve system performance. - Implement automated monitoring, alerting, and recovery mechanisms to reduce manual intervention and improve response times. - Collaborate closely with software engineering, platform, and operations teams to embed SRE practices across the development lifecycle. - Lead and participate in incident response, root cause analysis, and postmortem reviews to drive long-term improvements. - Identify and eliminate sources of toil through automation, tooling, and process refinement. - Continuously improve resiliency design, capacity planning, and release management in production systems. - Influence engineering teams with best practices on cloud-native architecture, observability, and deployment strategies.

Manager Sre

5 days ago

Hyderabad, Telangana, India PepsiCo Full time

Overview Manager SRE for the Cloud automation and SRE analyst Responsibilities Candidate must have experience of 7-9 Years Engineer should be having hands on experience on development Either Ansible and Terraform experience is required Python powershell experience is preferred Engineer should develop automation scripts for the Cloud team Maintain existing...
SRE Solution Architect

1 day ago

Hyderabad, India PepsiCo Full time

Overview PepsiCo’s Digital Transformation requires new business processes, new digital products and new operations outcomes. The transformation towards modern operations in an SRE construct is for all the programs under Digital products, Tech Strategy & Enterprise Product, per the main purpose to drive higher order outcomes to our customers who use our...
▷ [High Salary] SRE Solution Architect

2 weeks ago

Hyderabad, India PepsiCo Full time

Job Description Overview PepsiCo's Digital Transformation requires new business processes, new digital products and new operations outcomes. The transformation towards modern operations in an SRE construct is for all the programs under Digital products, Tech Strategy & Enterprise Product, per the main purpose to drive higher order outcomes to our customers...
SRE

1 day ago

Hyderabad, India Virtusa Full time

SRE - CREQ Description Bi Tools, API & Batch monitoring Support Responsibilities 1. Troubleshoot Recurring failures & participate in incident triages 2. Troubleshoot issues, both from a production as well as a performance standpoint 3. on-call to be able to respond during App failures 4. Monitor critical applications and services to minimize downtime and...
SRE Design

3 weeks ago

Hyderabad, India Pepsico Full time

Overview We are looking for a self-driven, software engineering mindset SRE engineer to - Drive new shift left activities critical to apply Site Reliability Engineering (SRE) and quality assurance principles within the application design / Project roadmap that enablees resilient outcomes - Apply pre-emptive approach into production minimizing business...
▷ 3 Days Left: Associate General Manager

3 weeks ago

Hyderabad, India Livspace Full time

As an Associate General Manager - Design, you will own the revenue of a region and manage the critical growth and performance metrics of both the business and people. You will take complete ownership of business critical initiatives - product launches, internal process improvements, category expansion, vendor base etc.- Contribute actively to business...
SRE Lead Design

1 day ago

Hyderabad, India PepsiCo Full time

Overview We are looking for a self-driven, software engineering mindset SRE engineer to Drive new shift left activities critical to apply Site Reliability Engineering (SRE) and quality assurance principles within the application design / Project roadmap that enablees resilient outcomes Apply pre-emptive approach into production minimizing business impact,...
SRE Engineer

1 day ago

Hyderabad, India Virtusa Full time

SRE Engineer - CREQ Description Requirements: Bachelors degree, preferably in Computer Science, or relevant work experience. 3+ years of experience as an SRE. Wants to automate everything. Familiarity in ITIL or other Information Technology operations foundations. Interest and ability in mentoring staff - improving technical capabilities. Knowledge of...
sre

1 week ago

Gurugram, Hyderabad, Noida, India Zensar Full time ₹ 15,00,000 - ₹ 25,00,000 per year

Short Description for Internal CandidatesBachelors degree in Computer Science, IT, or equivalent. - 3–6 years in SRE, Observability, Application Monitoring, or Performance Engineering roles. - Hands-on exposure to Glassbox and Sumo Logic strongly preferred.*Description for CandidatesWe are seeking a Site Reliability Engineer (SRE) with a strong focus on...
Sr. SRE

5 days ago

Hyderabad, Telangana, India Nexure Tech Full time ₹ 12,00,000 - ₹ 36,00,000 per year

Hiring: Sr. SRE / Application Support Engineer (3 Open Roles) Location: Hyderabad (Hybrid) | Night Shift (Remote option available) Experience: 8+ years Shifts (24x7 coverage): 9:00 AM 5:00 PM 5:00 PM – 1:00 AM 1:00 AM – 9:00 AM Interview Process: 1 Round with Algoscape Systems 1 Round with Client About the Role We are looking for highly skilled...

Americas

Europe

Asia / Oceania

Africa

(3 Days Left) Associate Manager SRE