Lead Site Reliability Engineer

3 days ago


Hyderabad, Telangana, India JP Morgan Chase & Co. Full time
Job Description

Assume a critical role in defining the future of a globally recognized firm and have a direct and significant effect in a realm tailored for top achievers in site reliability.

As a Lead Site Reliability Engineer at JPMorgan Chase within the Consumer & Community Banking Team, you will take the lead in conducting resiliency design reviews, break down complex problems into manageable tasks for other engineers, act as a technical lead for medium to large-sized products, and provide advice and mentoring to your fellow engineers.

Job responsibilities

- Demonstrates and champions site reliability culture and practices and exerts technical influence throughout your team
- Leads initiatives to improve the reliability and stability of your team's applications and platforms using data-driven analytics to improve service levels
- Collaborates with team members to identify comprehensive service level indicators and stakeholders to establish reasonable service level objectives and error budgets with customers
- Demonstrates a high level of technical expertise within one or more technical domains and proactively identifies and solves technology-related bottlenecks in your areas of expertise
- Acts as the main point of contact during major incidents for your application and demonstrates the skills to identify and solve issues quickly to avoid financial losses
- Documents and shares knowledge within your organization via internal forums and communities of practice

Required qualifications, capabilities, and skills

- Formal training or certification on site reliability Engineering concepts and 5+ years applied experience
- Expertise in application development and support with multiple technologies and design techniques.
- Experience in developing AI/ML solutions using public cloud architecture, specifically Azure and AWS and experience in Python for AI/ML modeling.
- Experience in automation and continuous delivery methods.
- Familiarity with agile methodologies, including CI/CD, application resiliency, and security.
- Experience in implementing GenAI services using Azure OpenAI models and AWS Bedrock service.Deep proficiency in reliability, scalability, performance, security, enterprise system architecture, toil reduction, and other site reliability best practices with the ability to implement these practices within an application or platform
- Proficiency and experience in observability such as white and black box monitoring, SLO alerting, and telemetry collection using tools such as Grafana, Dynatrace, Prometheus, Datadog, Splunk, etc.
- Proficiency in continuous integration and continuous delivery tools (e.g., Jenkins, GitLab, Terraform, etc.)
- Experience with container and container orchestration (e.g., ECS, Kubernetes, Docker, etc.)
- Experience with troubleshooting common networking technologies and issues
- Ability to identify and solve problems related to complex data structures and algorithms

  • Hyderabad, Telangana, India Chase Bank Full time

    Job DescriptionElevate your engineering prowess to unprecedented levels by joining a team of exceptionally gifted professionals and position yourself among the top echelon in site reliability.As a Principal Site Reliability Engineer at JPMorgan Chase within the Consumer & Community Banking, youwork with your fellow stakeholders to define non-functional...


  • Hyderabad, Telangana, India Talent Worx Full time ₹ 9,00,000 - ₹ 12,00,000 per year

    Site Reliability Engineer (SRE)At Talent Worx, we are looking for a dedicated Site Reliability Engineer (SRE) to join our team. This role involves maintaining high availability and reliability of our services through the application of software engineering practices and systems administration skills. The ideal candidate will bridge the gap between...


  • Hyderabad, Telangana, India beBeeSre Full time ₹ 1,50,00,000 - ₹ 2,50,00,000

    Job DescriptionWe are seeking an experienced SRE Engineer to join our team. As a Site Reliability Engineer, you will play a pivotal role in safeguarding our assets, data, and reputation in the industry.Your primary focus will be on ensuring platform and application availability, scalability, and reliability. This involves building, monitoring, and...


  • Hyderabad, Telangana, India Insight Global Full time

    Join a mission-critical SCADA reliability team —now hiring Lead, Senior, and Junior Site Reliability Engineers in HITECH Hyderabad Telangana.Step into a high-impact role with cutting-edge technologies, a flexible hybrid schedule, and a growth-driven culture backed by Evergreen, the professional services division of Insight Global.Key Technologies &...


  • Hyderabad, Telangana, India Talent Worx Full time

    Talent Worx is seeking a talented SRE (Site Reliability Engineer) to enhance our technology team. In this role, you will be pivotal in ensuring the reliability, performance, and availability of our applications and services.Your work will involve both software engineering and systems operations as you strive to improve customer experiences and operational...


  • Hyderabad, Telangana, India IntraEdge Full time

    Site Reliability EngineerExperience: 7+ YearsLocation: HyderabadHybrid 4-day office and 1 Day remoteSkills for Principal:Strong leadership and people management skills.Exceptional technical proficiency in Pearson's technology stack.Advanced project management capabilities.Excellent communication and collaboration skills.Adept at risk assessment and crisis...


  • Hyderabad, Telangana, India IntraEdge Full time

    Position - SRE (Site Reliability Engineer)Experience - 5+ YearsLocation - HyderabadSkills for Principal:Strong leadership and people management skills.Exceptional technical proficiency in Pearson's technology stack.Advanced project management capabilities.Excellent communication and collaboration skills.Adept at risk assessment and crisis...


  • Hyderabad, Telangana, India IntraEdge Full time

    Site Reliability EngineerExperience: 7+ YearsLocation: HyderabadHybrid 4-day office and 1 Day remoteSkills for Principal:- Strong leadership and people management skills.- Exceptional technical proficiency in Pearson's technology stack.- Advanced project management capabilities.- Excellent communication and collaboration skills.- Adept at risk assessment and...


  • Hyderabad, Telangana, India Kshema General Insurance Limited Full time

    About Us: Kshema General Insurace is a leading innovator in Crop Insurance. We are building scalable, reliable, and high-performance cloud-native applications on Microsoft Azure. We are seeking a talented and passionate Site Reliability Engineer (SRE) to join our team, focusing on establishing robust observability with OpenTelemetry and driving operational...


  • Hyderabad, Telangana, India Kshema General Insurance Limited Full time

    About Us: Kshema General Insurace is a leading innovator in Crop Insurance. We are building scalable, reliable, and high-performance cloud-native applications on Microsoft Azure. We are seeking a talented and passionate Site Reliability Engineer (SRE) to join our team, focusing on establishing robust observability with OpenTelemetry and driving operational...