Principal Site Reliability Engineer
24 hours ago
Join a globally recognized financial organization and advance your profession to new heights by contributing to revolutionary projects. You've discovered the perfect environment to have a major impact.
As a Principal Site Reliability Engineer at JPMorgan Chase within the Consumer & Community Banking division, you will leverage your advanced expertise to identify new opportunities for influencing critical incident management and enhancing the end-to-end software development lifecycle for the firm. Your role will involve managing, designing, and implementing infrastructure components and essential services to boost reliability and ensure operational efficiency within the Card Site Reliability Engineering function. You will be part of a globally distributed team dedicated to maintaining production stability, automation, reliability, and observability. We seek solution-oriented, commercially minded, and customer-focused team members who excel in an agile environment and are eager to contribute to building innovative solutions from the ground up within a diverse and inclusive team.
Job responsibilities
- Identifies and solves problems of high complexity.
- Works with development teams throughout the Software Development Life Cycle to ensure sustainable software releases
- Leads medium to large projects by bringing together the proper perspective, identifying roadblocks, and integrating feedback from team members and subject matter experts at the firm.
- Manages complex business challenges with elegant, efficient solutions, harnessing the power of code and cloud infrastructure to configure, maintain, monitor, and optimize applications, driving continuous improvement and scalability.
- Participates in support responsibilities for coverage of critical applications. Sees problems as opportunities to improve
- Architect and implement observability platforms and tools for proactive detection and continuous improvement.
- Lead the design and development of core observability services, including metrics pipelines and log aggregation.
- Leverage modern technologies such as Open Telemetry and AI/ML for anomaly detection and automated insights.
- Collaborate with engineering and SRE teams to define service-level objectives (SLOs) and error budgets.
- Provide technical leadership and mentorship to engineering teams, ensuring best practices in system design.
- Champion observability as a first-class concern in the software development lifecycle.
Required qualifications, capabilities, and skills
- Formal training or certification on Site Reliability Engineering concepts and 10+ years applied experience
- Fluent in at least one programming language such as: Python, Java/Spring Boot.
- Experience with cloud-native (AWS) instrumentation and streaming data platforms.
- Proficient with continuous integration and continuous delivery tools like Jenkins, GitLab, or Terraform
- Proficient with container and container orchestration: (ECS, Kubernetes, Docker).
- Experience with troubleshooting common networking technologies and issues.
- Ability to determine how each system relates to each other and build automation to improve reliability.
- Experience with translating research, analysis, and tests into business recommendations.
- Ability to balance and be accountable for the work of multiple architects and designers.
- Understands and leads partnerships across job functions to develop efficient systems.
- Engages team members and expresses complex ideas with appropriate level of detail, while providing constructive feedback.
Preferred qualifications, capabilities, and skills
- Influence technology and policy decisions while fostering commitment and confidence in team members.
- Develop effective solutions and analyze competitive positions by considering market trends.
- Support the introduction of innovative methods and communicate clearly to persuade audiences.
- Demonstrate concern and meet the needs of both internal and external customers.
-
Principal Site Reliability Engineer
4 days ago
Hyderabad, Telangana, India Oracle Full time ₹ 12,00,000 - ₹ 36,00,000 per yearOracle is seeking motivated Principal Site Reliability Engineer who thrives in a fast-paced rapidly evolving technology environment. This position requires wide and overall knowledge in Mainframe zLinux, DB2, zVM, AIX. Site Reliability Engineer expected to work with multiple service and product development teams, identifying cross-team issues that...
-
Principal Site Reliability Engineer
4 days ago
Hyderabad, Telangana, India Oracle Full time ₹ 12,00,000 - ₹ 36,00,000 per yearOracle is seeking motivated Principal Site Reliability Engineer who thrives in a fast-paced rapidly evolving technology environment. This position requires wide and overall knowledge in Linux administration, AI technologies, software development, cloud computing, networking, cloud security, performance analysis and monitoring to provide the stability,...
-
Site Reliability Engineer
3 days ago
Hyderabad, Telangana, India Oracle Financial Services Software Ltd Full time ₹ 12,00,000 - ₹ 36,00,000 per yearPrincipal Site Reliability Engineer Oracle is seeking motivated Principal Site Reliability Engineer who thrives in a fast-paced rapidly evolving technology environment. This position requires wide and overall knowledge in Mainframe zLinux, DB2, zVM, AIX. Site Reliability Engineer expected to work with multiple service and product development teams,...
-
Site Reliability Engineer
1 week ago
Hyderabad, Telangana, India Oracle Financial Services Software Ltd Full time ₹ 12,00,000 - ₹ 36,00,000 per yearPrincipal Site Reliability Engineer Oracle is seeking motivated Principal Site Reliability Engineer who thrives in a fast-paced rapidly evolving technology environment. This position requires wide and overall knowledge in Linux administration, AI technologies, software development, cloud computing, networking, cloud security, performance analysis and...
-
Principal Site Reliability Engineer
1 week ago
Hyderabad, Telangana, India Oracle Full time ₹ 20,00,000 - ₹ 60,00,000 per yearOracle is seeking motivated Principal Site Reliability Engineer who thrives in a fast-paced rapidly evolving technology environment. This position requires wide and overall knowledge in Linux administration, AI technologies, software development, cloud computing, networking, cloud security, performance analysis and monitoring to provide the stability,...
-
Principal Site Reliability Engineer
13 hours ago
Hyderabad, Telangana, India Oracle Full time ₹ 12,00,000 - ₹ 36,00,000 per yearDescriptionOracle is seeking motivated Principal Site Reliability Engineer who thrives in a fast-paced rapidly evolving technology environment. This position requires wide and overall knowledge in Linux administration, AI technologies, software development, cloud computing, networking, cloud security, performance analysis and monitoring to provide the...
-
Principal Site Reliability Developer
13 hours ago
Hyderabad, Telangana, India Oracle Full time ₹ 12,00,000 - ₹ 36,00,000 per yearOracle is looking for a Principal Site Reliability Developer with world-class experience in developing and supporting large scale cloud deployments across the world. The candidate should have expert level knowledge of Oracle Weblogic Application, Automation, and Running the System Production at Operational Level. The position is part of SaaS Engineering...
-
Principal Site Reliability Engineer
1 week ago
Hyderabad, Telangana, India Amgen Inc Full time ₹ 8,00,000 - ₹ 12,00,000 per yearWe are looking for a Site Reliability Engineer/Cloud Engineer (SRE) to work on the performance optimization, standardization, and automation of Amgens critical infrastructure and systems. This role is crucial to ensuring the reliability, scalability, and cost-effectiveness of our production systems. The ideal candidate will work on operational excellence...
-
Site Reliability Engineer
24 hours ago
Hyderabad, Telangana, India Talent Worx Full time ₹ 12,00,000 - ₹ 36,00,000 per yearSite Reliability Engineer (SRE)At Talent Worx, we are looking for a dedicated Site Reliability Engineer (SRE) to join our team. This role involves maintaining high availability and reliability of our services through the application of software engineering practices and systems administration skills. The ideal candidate will bridge the gap between...
-
Principal Site Reliability Engineer
1 week ago
Hyderabad, Telangana, India Cubic Corporation Full time ₹ 12,00,000 - ₹ 24,00,000 per yearBusiness Unit:Cubic Transportation SystemsCompany Details:When you join Cubic, you become part of a company that creates and delivers technology solutions in transportation to make people's lives easier by simplifying their daily journeys, and defense capabilities to help promote mission success and safety for those who serve their nation. Led by our...