Principal Site Reliability Engineer

3 days ago


Hyderabad, Telangana, India JPMorgan Chase Full time ₹ 45,00,000 - ₹ 90,00,000 per year

Join a globally recognized financial organization and advance your profession to new heights by contributing to revolutionary projects. You've discovered the perfect environment to have a major impact.

As a Principal Site Reliability Engineer at JPMorgan Chase within the Consumer & Community Banking division, you will leverage your advanced expertise to identify new opportunities for influencing critical incident management and enhancing the end-to-end software development lifecycle for the firm. Your role will involve managing, designing, and implementing infrastructure components and essential services to boost reliability and ensure operational efficiency within the Card Site Reliability Engineering function. You will be part of a globally distributed team dedicated to maintaining production stability, automation, reliability, and observability. We seek solution-oriented, commercially minded, and customer-focused team members who excel in an agile environment and are eager to contribute to building innovative solutions from the ground up within a diverse and inclusive team.

Job responsibilities

  • Identifies and solves problems of high complexity.
  • Works with development teams throughout the Software Development Life Cycle to ensure sustainable software releases
  • Leads medium to large projects by bringing together the proper perspective, identifying roadblocks, and integrating feedback from team members and subject matter experts at the firm.
  • Manages complex business challenges with elegant, efficient solutions, harnessing the power of code and cloud infrastructure to configure, maintain, monitor, and optimize applications, driving continuous improvement and scalability.
  • Participates in support responsibilities for coverage of critical applications. Sees problems as opportunities to improve
  • Architect and implement observability platforms and tools for proactive detection and continuous improvement.
  • Lead the design and development of core observability services, including metrics pipelines and log aggregation.
  • Leverage modern technologies such as Open Telemetry and AI/ML for anomaly detection and automated insights.
  • Collaborate with engineering and SRE teams to define service-level objectives (SLOs) and error budgets.
  • Provide technical leadership and mentorship to engineering teams, ensuring best practices in system design.
  • Champion observability as a first-class concern in the software development lifecycle.

Required qualifications, capabilities, and skills

  • Formal training or certification on Site Reliability Engineering concepts and 10+ years applied experience
  • Fluent in at least one programming language such as: Python, Java/Spring Boot.
  • Experience with cloud-native (AWS) instrumentation and streaming data platforms.
  • Proficient with continuous integration and continuous delivery tools like Jenkins, GitLab, or Terraform
  • Proficient with container and container orchestration: (ECS, Kubernetes, Docker).
  • Experience with troubleshooting common networking technologies and issues.
  • Ability to determine how each system relates to each other and build automation to improve reliability.
  • Experience with translating research, analysis, and tests into business recommendations.
  • Ability to balance and be accountable for the work of multiple architects and designers.
  • Understands and leads partnerships across job functions to develop efficient systems.
  • Engages team members and expresses complex ideas with appropriate level of detail, while providing constructive feedback.

Preferred qualifications, capabilities, and skills

  • Influence technology and policy decisions while fostering commitment and confidence in team members.
  • Develop effective solutions and analyze competitive positions by considering market trends.
  • Support the introduction of innovative methods and communicate clearly to persuade audiences.
  • Demonstrate concern and meet the needs of both internal and external customers.
 

  • Hyderabad, Telangana, India Amgen Inc Full time ₹ 8,00,000 - ₹ 12,00,000 per year

    We are looking for a Site Reliability Engineer/Cloud Engineer (SRE) to work on the performance optimization, standardization, and automation of Amgens critical infrastructure and systems. This role is crucial to ensuring the reliability, scalability, and cost-effectiveness of our production systems. The ideal candidate will work on operational excellence...


  • Hyderabad, Telangana, India Apple Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    Imagine what you could do here. Apple is a place where extraordinary people gather to do their best work. Together we craft products and experiences people once couldn't have imagined — and now can't imagine living without. If you're motivated by the idea of making a real impact, and joining a team where we pride ourselves in being one of the most diverse...


  • Hyderabad, Telangana, India ZORTECH SOLUTIONS PRIVATE LIMITED Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    Job Title : Site Reliability Engineering (SRE) ManagerLocation : HyderabadEmployment Type : Full-TimeWork Model : 3 Days from office (Hybrid)Summary : The SRE Manager will lead the reliability engineering function, ensuring infrastructure resiliency and optimal operational performance. This hybrid role blends technical leadership with team mentorship and...


  • Hyderabad, Telangana, India Zeta Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    About ZetaZeta is a Next-Gen Banking Tech company that empowers banks and fintechs to launch banking products for the future. It was founded by Bhavin Turakhia and Ramki Gaddipati in 2015.Our flagship processing platform - Zeta Tachyon - is the industry's first modern, cloud-native, and fully API-enabled stack that brings together issuance, processing,...


  • Hyderabad, Telangana, India Talent Worx Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    SRE (Site Reliability Engineer)Talent Worx is seeking a talented SRE (Site Reliability Engineer) to enhance our technology team. In this role, you will be pivotal in ensuring the reliability, performance, and availability of our applications and services. Your work will involve both software engineering and systems operations as you strive to improve...


  • Hyderabad, Telangana, India TurboHire Full time ₹ 15,00,000 - ₹ 28,00,000 per year

    Site Reliability Engineer (SRE)Location: Hyderabad (Hybrid)Experience: 3–5 yearsAbout the RoleWe are looking for an SRE Engineer to own reliability, deployment, and monitoringof TurboHire's cloud infrastructure. You will ensure our platform is scalable, secure,and highly available. The role balances hands-on coding, automation, and infraoperations, freeing...


  • Hyderabad, Telangana, India LivePerson Full time ₹ 8,00,000 - ₹ 15,00,000 per year

    LivePerson (NASDAQ: LPSN) is a leading customer engagement company, creating digital experiences powered by Curiously Human AI. Every person is unique, and our technology makes it possible for companies, including leading brands like HSBC, Orange, and GM Financial, to treat their audiences that way at scale. Nearly a billion conversational interactions are...


  • Hyderabad, Telangana, India Chase- Candidate Experience page Full time ₹ 1,04,000 - ₹ 1,30,878 per year

    There's nothing more exciting than being at the center of a rapidly growing field in technology and applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems.As a Site Reliability Engineer III at JPMorgan Chase within the Chief Technology Office team, you will solve complex and broad business problems...


  • Hyderabad, Telangana, India Microsoft Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    At Azure DevOps, we pride ourselves on building services that make engineering teams productive. This is the reason why Azure DevOps is solution of choice for millions of engineers – including thousands of Microsoft largest customers and internal teams. Azure DevOps - - is a suite of services as part of Microsoft Azure, which provides work planning,...


  • Hyderabad, Telangana, India EPAM Systems Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    We are seeking a skilledLead Site Reliability Engineerto drive the stability, scalability, and reliability of our systems while improving efficiency through automation and best practices.This role calls for deep expertise in DevOps methodologies, Infrastructure as Code (IaC), and collaboration across teams to ensure optimal system...