Site Reliability Engineer-Backend Sr Analyst

6 days ago


Gurugram, India PepsiCo Full time
Overview

PepsiCo is one of the world's leading food and beverage companies with more than $79 Billion in Net Revenue and a global portfolio of diverse and beloved brands. We have a complementary food and beverage portfolio that includes 22 brands that each generated more than $1 Billion in annual retail sales. PepsiCo's products are sold in more than 200 countries and territories around the world. PepsiCo's strength is its people. We are over 250,000 game changers, mountain movers and history makers, located around the world, and united by a shared set of values and goals. 

Responsibilities Automation, enhancements, improvements (stability, capacity, resiliency) Application support Monitoring – monitor and resolve system and application alerts Incidents and service requests – review, troubleshoot and resolve issues reported by customers End to end troubleshooting – standard L1/L2 support Deep dive on recurring issues and long-term fixes Alert reduction, rationalization and standardization Job execution, configuration and optimization SLA upkeep and timely acknowledgments of issues Managing runbooks for all major tasks and KEDB (known error database) for known issues Good understanding of the backend technology stacks, hosting and deployment of APIs/Microservices and back end solutions. Qualifications Experience driving and actively monitoring feedback on technical designs for backend systems. Good understanding of Observability (monitoring, logging, tracing, metrics), Chaos engineering concepts. Proficiency in using Application Performance Monitoring (APM) tool like New Relic for monitoring, logging, tracing. Expert level hands on knowledge in public cloud platform AWS and/or Google Cloud Platform. Professional level certificate on one of the public clouds is highly desirable. Experience with DevOps/DevSecOps practices and comfortability in operating backend systems. Must have hands-on experience in using configuration management systems and infrastructure automation tools. Should have experience providing backend system solutions around Service Level Indicators (SLIs) and Service Level Objectives (SLOs) for services. Should have supported Production Incidents (PIs) on critical applications of a company. Troubleshoot, debug, and diagnose operational issues and drive them to closure. Understanding of software delivery life cycles, particularly Agile/Lean & DevOps. Proven experience in handling large scale and growing infrastructure across Data Centres and heterogeneous Cloud platforms. Ability to work with creative – fast growing technical team.

  • gurugram, India PepsiCo Full time

    Overview PepsiCo is one of the world's leading food and beverage companies with more than $79 Billion in Net Revenue and a global portfolio of diverse and beloved brands. We have a complementary food and beverage portfolio that includes 22 brands that each generated more than $1 Billion in annual retail sales. PepsiCo's products are sold in more than 200...


  • Gurugram, India Codersbrain technology pvt ltd Full time

    Key Responsibilities :- Provide expert production support for application teams utilizing our platform, ensuring high availability, reliability, and performance.- Diagnose and resolve complex issues in production environments, collaborating closely with development teams and stakeholders.- Implement and maintain monitoring, alerting, and logging solutions to...


  • Bangalore/Gurgaon/Gurugram, India Codersbrain technology pvt ltd Full time

    Key Responsibilities :- Provide expert production support for application teams utilizing our platform, ensuring high availability, reliability, and performance.- Diagnose and resolve complex issues in production environments, collaborating closely with development teams and stakeholders.- Implement and maintain monitoring, alerting, and logging solutions to...


  • Bangalore/Gurgaon/Gurugram, IN Codersbrain technology pvt ltd Full time

    Key Responsibilities :- Provide expert production support for application teams utilizing our platform, ensuring high availability, reliability, and performance.- Diagnose and resolve complex issues in production environments, collaborating closely with development teams and stakeholders.- Implement and maintain monitoring, alerting, and logging solutions to...


  • Gurugram, India BayOne Solutions Full time

    Technical SkillsProficiency with cloud platforms AWS, Terraform, Kubernetes, Jira, CloudFlare, FreshService, Jira, DataDog, GitHub, Postgres, Java / Kotlin,Proficient in SRE principlesUnderstanding of CI / CD processesExperience6+ years of experience in a Site Reliability Engineering, DevOps or similar roleBachelor's degree in Computer Science, Information...


  • Gurugram, India BayOne Solutions Full time

    Technical SkillsProficiency with cloud platforms AWS, Terraform, Kubernetes, Jira, CloudFlare, FreshService, Jira, DataDog, GitHub, Postgres, Java / Kotlin,Proficient in SRE principlesUnderstanding of CI / CD processesExperience6+ years of experience in a Site Reliability Engineering, DevOps or similar roleBachelor's degree in Computer Science, Information...


  • gurugram, India BayOne Solutions Full time

    Technical Skills Proficiency with cloud platforms AWS, Terraform, Kubernetes, Jira, CloudFlare, FreshService, Jira, DataDog, GitHub, Postgres, Java / Kotlin, Proficient in SRE principles Understanding of CI / CD processes Experience 6+ years of experience in a Site Reliability Engineering, DevOps or similar role Bachelor's degree in Computer Science,...


  • Gurugram, India FX Consulting Full time

    A Site Reliability Engineer (SRE) plays a crucial role in maintaining the reliability and performance of computer systems within an organization. As a bridge between development and IT operations, an SRE takes on operational tasks and responsibilities typically handled by operations teams. Here's a comprehensive job description for an SRE with the...


  • gurugram, India FX Consulting Full time

    A Site Reliability Engineer (SRE) plays a crucial role in maintaining the reliability and performance of computer systems within an organization. As a bridge between development and IT operations, an SRE takes on operational tasks and responsibilities typically handled by operations teams. Here's a comprehensive job description for an SRE with the specified...


  • Gurugram, India NatWest Group Full time

    Join us as a Site Reliability Engineer  You’ll manage the provision of stable, resilient, reliable applications with the end goal of minimising disruption to Customer & Colleague Journeys (CCJ) We’ll look to you to identify and automate manual tasks and implement observability solutions, ensuring a thorough understanding of CCJ across...


  • gurugram, India GEMINI Full time

    Department : Platform Our Platform organization’s purpose is to enable Gemini to scale effectively and empower our engineering teams to focus on building innovative financial products and experiences for individuals around the world. Within Platform, the Site Reliability Engineering team is responsible for partnering with Gemini’s other...


  • Gurugram, India GEMINI Full time

    Department : Platform Our Platform organization’s purpose is to enable Gemini to scale effectively and empower our engineering teams to focus on building innovative financial products and experiences for individuals around the world. Within Platform, the Site Reliability Engineering team is responsible for partnering with Gemini’s other engineering...


  • Gurgaon/Gurugram, India FX Consulting Full time

    A Site Reliability Engineer (SRE) plays a crucial role in maintaining the reliability and performance of computer systems within an organization. As a bridge between development and IT operations, an SRE takes on operational tasks and responsibilities typically handled by operations teams. Here's a comprehensive job description for an SRE with the...


  • Gurgaon/Gurugram, IN FX Consulting Full time

    A Site Reliability Engineer (SRE) plays a crucial role in maintaining the reliability and performance of computer systems within an organization. As a bridge between development and IT operations, an SRE takes on operational tasks and responsibilities typically handled by operations teams. Here's a comprehensive job description for an SRE with the...


  • gurugram, India StatusNeo Full time

    Job Description: We are seeking a highly skilled and experienced Senior Site Reliability Engineer with expertise in Core Tools and DevOps to join our dynamic team. The ideal candidate will have a strong background in Linux administration, cloud infrastructure, Infrastructure as Code (IaC), Python programming, and be a subject matter expert in DevOps tools...


  • gurugram, India American Express Full time

    You Lead the Way. We’ve Got Your Back. With the right backing, people and businesses have the power to progress in incredible ways. When you join Team Amex, you become part of a global and diverse community of colleagues with an unwavering commitment to back our customers, communities and each other. Here, you’ll learn and grow as we help you create a...


  • gurugram, India AMEX Full time

    You Lead the Way. Weve Got Your Back. With the right backing, people and businesses have the power to progress in incredible ways. When you join Team Amex, you become part of a global and diverse community of colleagues with an unwavering commitment to back our customers, communities and each other. Here, youll learn and grow as we help you create a...


  • Gurugram, India AMEX Full time

    You Lead the Way. Weve Got Your Back. With the right backing, people and businesses have the power to progress in incredible ways. When you join Team Amex, you become part of a global and diverse community of colleagues with an unwavering commitment to back our customers, communities and each other. Here, youll learn and grow as we help you create a career...


  • Gurugram, India American Express Full time

    You Lead the Way. We’ve Got Your Back. With the right backing, people and businesses have the power to progress in incredible ways. When you join Team Amex, you become part of a global and diverse community of colleagues with an unwavering commitment to back our customers, communities and each other. Here, you’ll learn and grow as we help you create a...


  • Gurugram, India IndusInd Bank Full time

    About the RoleAs a Site Reliability Engineer you will troubleshoot, debug, evaluate and resolve customer impacting issues with a focus on detecting patterns and working with the engineering development and or product teams to eliminate defects. The position requires a combination of strong troubleshooting, technical, communication and problem solving skills....