Lead Site Reliability Engineer

3 weeks ago


gurugram, India Epam Full time

Description

EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.

We are seeking a talented and motivated Lead Site Reliability Engineer to join our team. As a key member of our multi-disciplined team, you will play a crucial role in ensuring the reliability, performance, and security of our complex distributed systems. If you are passionate about operational risk management, have a deep understanding of Kubernetes and Containers, and possess strong problem-solving skills, this role offers an exciting opportunity to contribute to the success of our operations.

#LI-DNI

Responsibilities

Ability to rapidly and effectively understand and translate requirements into technical solutions Ability to reason about performance, security, and process interactions in complex distributed system. Passionate about managing operational risk Ability to work effectively as part of a diverse multi-disciplined team Motivated, self-organized and have good time & work management skills

Requirements

Should have 8 to 12 years of experience as Site Reliability Engineer Must have expert/intermediate level knowledge of Azure (preferred) or AWS/ GCP Cloud Infrastructure, networking, security, Storage. (GCP will be decommissioned in upcoming days, just Azure is also fine) Must have intermediate level Python core skills Must have expert/intermediate level python/cloud/windows admin debugging skills Must have intermediate level knowledge of Windows or Linux administration. (Only Linux is also okay, Windows administration training can be given for 2 weeks) Good to have expert/intermediate level knowledge in infrastructure monitoring as well as application monitoring and related tools ELK/Opsbridge/DynaTrace Good to have Observability & Centralized Logging experience Good to have knowledge of incident management (PagerDuty/OpsGinie/VictorOps) Good to have knowledge of change management Good to have knowledge of SLO, SLI, SLA Good to have knowledge of Kubernetes and Docker Good to have knowledge of CI/CD (especially Azure DevOps)

We offer

Opportunity to work on technical challenges that may impact across geographies Vast opportunities for self-development: online university, knowledge sharing opportunities globally, learning opportunities through external certifications Opportunity to share your ideas on international platforms Sponsored Tech Talks & Hackathons Unlimited access to LinkedIn learning solutions Possibility to relocate to any EPAM office for short and long-term projects Focused individual development Benefit package: Health benefits Retirement benefits Paid time off Flexible benefits Forums to explore beyond work passion (CSR, photography, painting, sports, etc.)

  • Gurugram, India NatWest Group Full time

    Join us as a Site Reliability Engineer, VPIn this key role, you’ll improve, drive, and embed non-functional and operational characteristics such as availability, performance, efficiency, change management, monitoring, security, incident response, and capacity planning of our products and servicesYou’ll enjoy significant stakeholder interaction, working...


  • Gurugram, India American Express Full time

    You Lead the Way. We’ve Got Your Back. With the right backing, people and businesses have the power to progress in incredible ways. When you join Team Amex, you become part of a global and diverse community of colleagues with an unwavering commitment to back our customers, communities and each other. Here, you’ll learn and grow as we help you create a...


  • gurugram, India American Express Full time

    You Lead the Way. We’ve Got Your Back. With the right backing, people and businesses have the power to progress in incredible ways. When you join Team Amex, you become part of a global and diverse community of colleagues with an unwavering commitment to back our customers, communities and each other. Here, you’ll learn and grow as we help you create a...


  • gurugram, India AMEX Full time

    You Lead the Way. Weve Got Your Back. With the right backing, people and businesses have the power to progress in incredible ways. When you join Team Amex, you become part of a global and diverse community of colleagues with an unwavering commitment to back our customers, communities and each other. Here, youll learn and grow as we help you create a...


  • Gurugram, India AMEX Full time

    You Lead the Way. Weve Got Your Back. With the right backing, people and businesses have the power to progress in incredible ways. When you join Team Amex, you become part of a global and diverse community of colleagues with an unwavering commitment to back our customers, communities and each other. Here, youll learn and grow as we help you create a career...


  • Gurugram, India GEMINI Full time

    Department : Platform Our Platform organization’s purpose is to enable Gemini to scale effectively and empower our engineering teams to focus on building innovative financial products and experiences for individuals around the world. Within Platform, the Site Reliability Engineering team is responsible for partnering with Gemini’s other engineering...


  • gurugram, India GEMINI Full time

    Department : Platform Our Platform organization’s purpose is to enable Gemini to scale effectively and empower our engineering teams to focus on building innovative financial products and experiences for individuals around the world. Within Platform, the Site Reliability Engineering team is responsible for partnering with Gemini’s other...


  • Gurugram, India FX Consulting Full time

    A Site Reliability Engineer (SRE) plays a crucial role in maintaining the reliability and performance of computer systems within an organization. As a bridge between development and IT operations, an SRE takes on operational tasks and responsibilities typically handled by operations teams. Here's a comprehensive job description for an SRE with the...


  • gurugram, India FX Consulting Full time

    A Site Reliability Engineer (SRE) plays a crucial role in maintaining the reliability and performance of computer systems within an organization. As a bridge between development and IT operations, an SRE takes on operational tasks and responsibilities typically handled by operations teams. Here's a comprehensive job description for an SRE with the specified...


  • Gurugram, India IndusInd Bank Full time

    About the RoleAs a Site Reliability Engineer you will troubleshoot, debug, evaluate and resolve customer impacting issues with a focus on detecting patterns and working with the engineering development and or product teams to eliminate defects. The position requires a combination of strong troubleshooting, technical, communication and problem solving skills....


  • Gurugram, India IndusInd Bank Full time

    About the RoleAs a Site Reliability Engineer you will troubleshoot, debug, evaluate and resolve customer impacting issues with a focus on detecting patterns and working with the engineering development and or product teams to eliminate defects. The position requires a combination of strong troubleshooting, technical, communication and problem solving skills....

  • Site Reliability Engineer

    43 minutes ago


    gurugram, India IndusInd Bank Full time

    About the Role As a Site Reliability Engineer you will troubleshoot, debug, evaluate and resolve customer impacting issues with a focus on detecting patterns and working with the engineering development and or product teams to eliminate defects. The position requires a combination of strong troubleshooting, technical, communication and problem solving...


  • Gurgaon/Gurugram, India FX Consulting Full time

    A Site Reliability Engineer (SRE) plays a crucial role in maintaining the reliability and performance of computer systems within an organization. As a bridge between development and IT operations, an SRE takes on operational tasks and responsibilities typically handled by operations teams. Here's a comprehensive job description for an SRE with the...


  • gurugram, India StatusNeo Full time

    Job Description: We are seeking a highly skilled and experienced Senior Site Reliability Engineer with expertise in Core Tools and DevOps to join our dynamic team. The ideal candidate will have a strong background in Linux administration, cloud infrastructure, Infrastructure as Code (IaC), Python programming, and be a subject matter expert in DevOps tools...


  • Gurugram, India StatusNeo Full time

    Job Description: We are seeking a highly skilled and experienced Senior Site Reliability Engineer with expertise in Core Tools and DevOps to join our dynamic team. The ideal candidate will have a strong background in Linux administration, cloud infrastructure, Infrastructure as Code (IaC), Python programming, and be a subject matter expert in DevOps tools...


  • Gurugram, India Codersbrain technology pvt ltd Full time

    Key Responsibilities :- Provide expert production support for application teams utilizing our platform, ensuring high availability, reliability, and performance.- Diagnose and resolve complex issues in production environments, collaborating closely with development teams and stakeholders.- Implement and maintain monitoring, alerting, and logging solutions to...


  • Gurugram, India Builder.ai - What would you Build? Full time

    About usWe’re on a mission to make app building so easy everyone can do it – regardless of their background, tech knowledge or budget. We’ve already helped thousands of entrepreneurs, small businesses and even global brands, like the BBC, Makro and Pepsi achieve their software goals and we’ve only just started. Builder.ai was voted as one of 2023’s...


  • Gurugram, India Builder.ai Full time

    About usWe’re on a mission to make app building so easy everyone can do it – regardless of their background, tech knowledge or budget. We’ve already helped thousands of entrepreneurs, small businesses and even global brands, like the BBC, Makro and Pepsi achieve their software goals and we’ve only just started. was voted as one of 2023’s ‘Most...


  • gurugram, India Builder.ai - What would you Build? Full time

    About us We’re on a mission to make app building so easy everyone can do it – regardless of their background, tech knowledge or budget. We’ve already helped thousands of entrepreneurs, small businesses and even global brands, like the BBC, Makro and Pepsi achieve their software goals and we’ve only just started. Builder.ai was voted as one of...


  • gurugram, India Builder.ai Full time

    About us We’re on a mission to make app building so easy everyone can do it – regardless of their background, tech knowledge or budget. We’ve already helped thousands of entrepreneurs, small businesses and even global brands, like the BBC, Makro and Pepsi achieve their software goals and we’ve only just started. was voted as one of 2023’s ‘Most...