Sr Site Reliability Engineer
2 days ago
Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more than 200 countries and territories each year. Our mission is to connect the world through the most innovative, convenient, reliable, and secure payments network, enabling individuals, businesses, and economies to thrive while driven by a common purpose – to uplift everyone, everywhere by being the best way to pay and be paid.
Make an impact with a purpose-driven industry leader. Join us today and experience Life at Visa.
Job DescriptionThe Opportunity
We are seeking a skilled and innovative Sr. Site Reliability Engineer to join our team and help solve complex challenges on a global scale. The Middleware Product Reliability Engineering (PRE) group is dedicated to ensuring our products and services operate with Always On availability, exceptional reliability, and outstanding performance.
This role is ideal for software engineers with AI & ML backgrounds who want to apply their skills to large-scale reliability engineering. You'll develop intelligent automation systems and integrate machine learning models into our middleware infrastructure, working at the intersection of software development, artificial intelligence, and site reliability.
As a Visa Sr. Site Reliability Engineer, you will be an integral part of a cross-functional team inventing, designing, building, testing, and operating software products that reach a truly global customer base. While building and supporting components of cutting-edge payment technology, you will see your efforts shaping the digital future of monetary transactions.
What You'll Do
Support Middleware software and infrastructure components for all lines of business at Visa
Design and develop software solutions for middleware reliability using AI & ML techniques
Develop and improve Middleware monitoring and observability systems
Build intelligent automation systems leveraging machine learning models
Develop and maintain automation tools and integrations into existing AI & LLM frameworks to handle application support tasks
Collaborate with data science teams to integrate AI-driven insights into reliability engineering
Engage in production issue troubleshooting, provide immediate service restoration, follow up on root cause analysis, and ensure permanent fixes are implemented
Coordinate and execute Middleware releases and production deployments
Optimize performance and tuning for Middleware applications
The Skills You Bring
Collaboration and Communication You possess strong interpersonal skills and excel at both written and verbal communication. You thrive in team environments and collaborate effectively with globally dispersed virtual teams.
Learning and Growth You are adaptable and eager to learn new technologies and tools. You enjoy sharing knowledge with others and contributing to collective team growth.
Innovation and Problem-Solving You are comfortable exploring beyond traditional solutions and embrace new technologies and innovative approaches. You excel at analytical thinking and creative problem-solving.
Decision-Making and Prioritization You effectively prioritize, multitask, and deliver quality work on time. You can make informed decisions on execution timelines and maintain focus in high-pressure situations.
Professional Development You take initiative in your work and demonstrate a strong sense of ownership. You are motivated to learn new technologies and business concepts to facilitate both personal and organizational growth.
Professional Ethics You demonstrate strong business ethics, self-discipline, and trustworthiness, particularly when handling sensitive and confidential data in live production environments.
Technical Troubleshooting You possess strong analytical and problem-solving skills with the ability to swiftly identify and resolve complex technical issues. You excel at debugging, performance tuning, and root cause analysis. You are proactive in anticipating potential problems and implementing preventative measures to minimize disruptions.
This is a hybrid position. Expectation of days in office will be confirmed by your hiring manager.
QualificationsBasic Qualifications:
- 3 or more years of work experience with a Bachelor's Degree or more than 2 years of work experience with an Advanced Degree (e.g. Masters, MBA, JD, MD)
Core Skills:
- 3+ years of experience with modern middleware technologies. These might include (Tomcat, Apache, Springboot, SQS, JBoss, IBM MQ, IBM DataPower, Hazelcast, Flink, Connect Direct, SSL)
- Understanding of Linux/Unix systems, networking, cloud platforms (AWS, Azure, GCP), containerization (Kubernetes, Docker), and infrastructure-as-code tools (Terraform, Ansible).
- Proficiency with monitoring tools (Prometheus, Grafana, Datadog, etc.), logging systems (ELK stack, Splunk), and tracing tools (Jaeger, Zipkin).
- Proven track record of automating complex tasks and processes to improve efficiency and reliability using Python, Go, Java, or similar.
Technical Areas You'll Grow In:
- Cloud & System Architecture: Design scalable, resilient systems across hybrid cloud platforms (AWS, GCP, Azure)
- AI/ML Operations: Support and optimize ML model deployment pipelines and monitoring systems
- Observability & Performance: Master advanced monitoring, tracing, and performance optimization techniques
- Automation & Intelligence: Build smart alerting systems and automated remediation workflows
- Distributed Systems: Design and maintain globally distributed payment processing systems
What Makes You Thrive:
- You're energized by solving complex problems
- You believe in automation over manual processes
- You enjoy mentoring others and sharing knowledge
- You're comfortable with ambiguity and rapid change
- You value building reliable systems over quick fixes
Visa is an EEO Employer. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability or protected veteran status. Visa will also consider for employment qualified applicants with criminal histories in a manner consistent with EEOC guidelines and applicable local law.
-
Site Reliability Engineering
1 week ago
Bengaluru, Karnataka, India Thakral One Full time US$ 60,000 - US$ 1,20,000 per yearCompany DescriptionThakral One, headquartered in Singapore, is a technology consulting and services company with a strong presence across Asia. The company specializes in technology-driven consulting, custom solution development, data analytics, and leveraging cloud capabilities to deliver enhanced decision support and practical outcomes. Collaborating...
-
Site Reliability Engineering
7 days ago
Bengaluru, Karnataka, India Viraaj HR Solutions Private Limited Full time ₹ 12,00,000 - ₹ 36,00,000 per yearSite Reliability Engineer (SRE)About The OpportunityA fast-growing organization in the Enterprise Cloud Infrastructure & SaaS sector delivering highly available, mission-critical services to enterprise customers. We are hiring an on-site Site Reliability Engineer in India to own reliability, automation, and operational excellence across cloud-native...
-
Site Reliability Engineer
2 days ago
Bengaluru, Karnataka, India super Full time ₹ 12,00,000 - ₹ 24,00,000 per yearSite Reliability Engineer (SRE) Level 3Overview:A Site Reliability Engineer (SRE) Level 3 is a senior technical leadership role focused on designing, implementing, and maintaining large-scale, complex, and highly reliable systems. This role emphasizes a blend of software and systems engineering to ensure the availability, latency, performance, and capacity...
-
Site Reliability Engineer
3 days ago
Bengaluru, Karnataka, India Zetamicron Full time ₹ 12,00,000 - ₹ 36,00,000 per yearJob Title: Site Reliability Engineer (SRE)About the RoleWe are seeking a highly skilled and proactive Site Reliability Engineer (SRE)to ensure the stability, scalability, and reliability of our platform. The ideal candidate will have strong experience in managing production environments, automating operational processes, and enhancing system performance...
-
Site Reliability Engineer
2 weeks ago
Bengaluru, Karnataka, India Oracle Full time ₹ 12,00,000 - ₹ 36,00,000 per yearThis posting is for Site Reliability Engineer in the Oracle Analytics Warehouse product development organization. Fully handled Cloud service that provides customers a turn-key enterprise warehouse on the cloud for Fusion Applications. The service is being built on a sophisticated technology stack demonstrating a brand-new data integration platform and the...
-
Site Reliability Engineer
5 days ago
Bengaluru, Karnataka, India Chevron Full time ₹ 20,00,000 - ₹ 25,00,000 per yearTotal Number of Openings2About the position:Come join our Subsurface Digital Platform where we are driving continuous innovations to improve reliability, scalability and sustainability of Chevron business via Chevron's Digital Transformation. We are seeking a T-shaped dynamic Senior Site Reliability Engineer to lead and provide end-to-end solution support...
-
Site Reliability Engineer
2 days ago
Bengaluru, Karnataka, India Luxoft Full time ₹ 12,00,000 - ₹ 36,00,000 per yearProject description Luxoft partner with next-generation digital bank, built from the ground up to deliver seamless, secure, and scalable financial services. Our platform is cloud-native, API-first, and focused on reliability, speed, and security. We are growing fast and looking for top-tier Site Reliability / Ops Engineers to join our core team and help run...
-
Site Reliability Engineer
5 days ago
Bengaluru, Karnataka, India Empower Full time ₹ 12,00,000 - ₹ 36,00,000 per yearOur vision for the future is based on the idea that transforming financial lives starts by giving our people the freedom to transform their own. We have a flexible work environment, and fluid career paths. We not only encourage but celebrate internal mobility. We also recognize the importance of purpose, well-being, and work-life balance. Within Empower and...
-
Site Reliability Engineer
2 weeks ago
Bengaluru, Karnataka, India d416f97b-2589-437a-8e64-3348cfe4008b Full time ₹ 12,00,000 - ₹ 36,00,000 per yearHiring Site Reliability EngineersExp : 2.5 +years [Excluding internship]Location : BangaloreApply Here : The engineer will work in the Reliability and Productivity Engineering team and is responsible for building industry standard large scale platforms to be utilised across FK that helps to significantly improve the reliability of systems and bring...
-
Site Reliability Engineer
5 days ago
Bengaluru, Karnataka, India Progress Full time ₹ 12,00,000 - ₹ 36,00,000 per yearWe are Progress (Nasdaq: PRGS) - the trusted provider of software that enables our customers to develop, deploy and manage responsible, AI-powered applications and experience with agility and ease.We're proud to have a diverse, global team where we value the individual and enrich our culture by considering varied perspectives because we believe people power...