Lead Site Reliability Engineering

1 week ago


Bengaluru, Karnataka, India Visa Full time ₹ 48,00,000 - ₹ 72,00,000 per year
Company Description

Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more than 200 countries and territories each year. Our mission is to connect the world through the most innovative, convenient, reliable, and secure payments network, enabling individuals, businesses, and economies to thrive while driven by a common purpose – to uplift everyone, everywhere by being the best way to pay and be paid.

Make an impact with a purpose-driven industry leader. Join us today and experience Life at Visa.

Job Description

Overview:

Join Visa's Technology Organization, a dynamic community of problem solvers and innovators dedicated to redefining the future of commerce. We manage one of the world's most advanced processing networks, handling over 65,000 secure transactions per second across 80 million merchants, 15,000 financial institutions, and billions of individuals. As a Lead Site Reliability Engineer (SRE), you will lead efforts to ensure stability, security, and efficiency of our applications and systems, driving continuous improvement and innovation.

Key Responsibilities:

  • Security and Safety: Ensure the security and safety of application services and platforms. Lead efforts to enhance operational practices focusing on efficiency, security, and excellence.
  • Zero Downtime: Maintain zero downtime by swiftly addressing any issues to ensure the environment is always operational. Conduct rapid root cause analysis and implement remediation in production environments after thorough testing.
  • Environment Management: Oversee all activities within the environment, including deploying new code.
  • Team Leadership: Inspire and lead the team to deliver strategic and innovative approaches that drive Visa's growth. Provide mentorship and foster a culture of collaboration and continuous improvement.
  • Stakeholder Partnerships: Build strong partnerships with key stakeholders, including product management, engineering, design, and operations.
  • Strategic Impact: Impact strategic decisions at all levels by interacting with other leaders on complex issues and applying strong judgment and analysis.
  • Effective Communication: Communicate effectively with both technical and business partners to create frameworks for discussing complex topics.
  • Automation and AI: Regularly analyze the environment and promote the adoption of automation and Generative AI to stay competitive.
  • Cloud Infrastructure: Lead cloud infrastructure adoption and migration, ensuring a seamless transition with minimal downtime.
  • Problem Resolution: Run problem bridges by collaborating with different functional and technical teams, escalating issues as needed for timely resolution.
  • Information Sharing: Proactively share important context and information with relevant stakeholders.
  • Operational Excellence: Spearhead the enhancement of operational practices focusing on efficiency, security, and excellence.

This is a hybrid position. Expectations of days in office will be confirmed by your Hiring Manager.

Qualifications

Basic Qualifications:

  • 14 or more years of work experience with a Bachelor's Degree or at least 12+ years of work experience with an Advanced Degree (e.g. Masters/ MBA/JD/MD) or at least 10+ years of work experience with a PhD

Education and Experience:

  • 14+ years of work experience in Site Reliability Engineering.
  • 10+ years of experience with JAVA, J2EE applications, and a deep understanding of Web Services technologies: REST & SOAP.
  • 5+ years of experience managing applications on Containers (Docker) and Cloud (AWS, GCP, Azure).

Technical Skills:

  • Strong understanding of relational databases and middleware stacks (IIS, .NET, Java, TcServer, JBoss, Containers).
  • Knowledge of Generative AI capabilities and use cases.
  • Advanced level programming and or scripting in 3 or more of the following: Python, Java, Go, PowerShell, JavaScript, Terraform, Ansible, Helm, Chef, Cloud Formation
  • Proficiency in CI CD tooling such as Jenkins, Github, Bitbucket, ArgoCD, Artifactory, Bitbucket, Azure DevOps in a large-scale environment Experience in OO design and design patterns.
  • Proficiency in observability tooling such as Grafana, Prometheus, Splunk, Datadog, New Relic, Dynatrace, Sentry, etc. in a large-scale environment
  • Experience with Docker and Kubernetes.
  • Experience with integrating third-party Web Services.
  • Deep understanding of SRE principles: SLAs, SLOs, SLIs, error budgets, incident response, and postmortems.
  • Capacity planning: Anticipates future growth and ensures systems scale accordingly

Leadership and Communication:

  • 5+ years of leading and building Site Reliability teams.
  • Strong work ethic, self-starter, ability to work in a fast-paced, team-oriented environment, and comfortable working with a global team.
  • Exceptional analytical and problem-solving skills, along with strong oral and written communication abilities.
  • Proven proficiency in troubleshooting, root-cause analysis, application design, and implementing major components for large projects.
  • Experience in creating tools to automate production support activities.
  • •Knowledge of monitoring tools and observability practices
  • Curiosity and Continuous Learning – Keeps up with evolving technologies and best practices in reliability engineering.
  • Partnering with product development, product management, engineering, design, and operations teams is also crucial.
  • Experience in fast-paced 24x7 environments, demonstrating adaptability, empathy, and confident decision-making.
Additional Information

Visa is an EEO Employer. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability or protected veteran status. Visa will also consider for employment qualified applicants with criminal histories in a manner consistent with EEOC guidelines and applicable local law.



  • Bengaluru, Karnataka, India Landmark Group Full time

    Job Title: SRE Lead (Engineering & Reliability)Job Summary:We are seeking an experienced and dynamic Site Reliability Engineering (SRE) Lead to oversee the reliability, scalability, and performance of our critical systems. As an SRE Lead, you will play a pivotal role in establishing and implementing SRE practices, leading a team of engineers, and driving...


  • Bengaluru, Karnataka, India Landmark Group Full time ₹ 8,00,000 - ₹ 12,00,000 per year

    Job Title:SRE Lead (Engineering & Reliability)Job Summary:We are seeking an experienced and dynamicSite Reliability Engineering (SRE) Leadto oversee the reliability, scalability, and performance of our critical systems. As an SRE Lead, you will play a pivotal role in establishing and implementing SRE practices, leading a team of engineers, and driving...


  • Bengaluru, Karnataka, India Nike Full time ₹ 8,00,000 - ₹ 12,00,000 per year

    Who You'll Work WithSRE hired will work as an Reliability Engineer with the engineering teams. The candidate will belong to a horizontal domain called TechOps: Resilience Engineering. This position will provide a provision for the SRE to shift between multiple engineering platforms as demanded by the work, vision and/or criticality of the projects. Roles and...


  • Bengaluru, Karnataka, India Landmark Group Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    COMPANY- LANDMARK GROUPJob Title: SRE Lead (Engineering & Reliability)Experience: 8-12 yearsJob Summary:We are seeking an experienced and dynamic Site Reliability Engineering (SRE) Lead tooversee the reliability, scalability, and performance of our critical systems. As an SRE Lead,you will play a pivotal role in establishing and implementing SRE practices,...


  • Bengaluru, Karnataka, India AppHelix Full time ₹ 9,00,000 - ₹ 12,00,000 per year

    Role DescriptionThis is a full-time on-site role located in Bengaluru for a Site Reliability Engineer. The Site Reliability Engineer will be responsible for maintaining and improving the reliability of AppHelix's systems. Daily tasks include monitoring system performance, troubleshooting issues, managing infrastructure, and supporting software development....


  • Bengaluru, Karnataka, India NatWest Group Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    Site Reliability Engineer, AVP Join us as a Site Reliability EngineerYou'll manage the provision of stable, resilient, reliable applications with the end goal of minimising disruption to Customer & Colleague Journeys (CCJ) We'll look to you to identify and automate manual tasks and implement observability solutions, ensuring a thorough understanding of...


  • Bengaluru, Karnataka, India Chevron Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    Total Number of Openings2About the position:Come join our Subsurface Digital Platform where we are driving continuous innovations to improve reliability, scalability and sustainability of Chevron business via Chevron's Digital Transformation. We are seeking a T-shaped dynamic Senior Site Reliability Engineer to lead and provide end-to-end solution support...


  • Bengaluru, Karnataka, India Programming Full time ₹ 10,00,000 - ₹ 25,00,000 per year

    Role - Site Reliability Engineering.Location - BengaluruYears of Expereince - 4+ YearsProfessional & Technical Skills:Must To Have Skills: Proficiency in Site Reliability Engineering.Good To Have Skills: Experience with cloud service providers such as AWS, Azure, or Google Cloud.Strong understanding of CI/CD tools and practices.Experience with container...


  • Bengaluru, Karnataka, India Booking Holdings Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Role Description:Engineering Manager - Site Reliability - Private CloudOur mission at is to create transformative, innovative, and personalized travel experiences for millions of customers all across the world. We want customers to have an amazing experience wherever and whenever they choose: mobile, web, and through partners and 3rd parties.About the team...


  • Bengaluru, Karnataka, India FIS Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    About the Role :Site Reliability Engineer (SRE)with deep expertise inMainframe technologies like COBOL, JCL, etc. to support and enhance ourCard Management & Payment processing functions. This role will be responsible for ensuring reliability, high availability, scalability, stability and performance of mission-critical mainframe software applications and...