
Staff Site Reliability Engineer
2 days ago
About The Role :
We are looking for a highly experienced Staff Site Reliability Engineer (SRE) to drive the reliability, performance, and operational excellence of our core production systems.
This is a senior, hands-on role that requires deep expertise in large-scale distributed systems, complex incident management, and building world-class observability platforms.
Key Responsibilities :
Reliability Engineering :
- Define, measure, and enforce Service Level Objectives (SLOs) and Service Level Indicators (SLIs) for critical platform services.
- Drive down toil by promoting self-service and automation.
Observability Platform :
- Lead the design and implementation of our global observability stack, including metric collection (Prometheus/M3DB), distributed tracing (Jaeger/OpenTelemetry), and logging (Loki/Elasticsearch).
Incident Management :
- Act as a technical leader during high-severity incidents, perform in-depth Root Cause Analysis (RCA), and implement long-term preventative measures.
Performance Tuning :
- Conduct performance analysis and capacity planning for the entire platform, optimizing infrastructure and application bottlenecks.
Security & Compliance :
- Partner with the security team to enforce security controls and best practices across the infrastructure layer.
Mentorship & Evangelism :
- Mentor SRE and DevOps teams, and evangelize reliability best practices and engineering excellence across all product development teams.
Technical Skills (Must-Have) :
Distributed Systems :
- Proven experience designing, running, and debugging large-scale distributed systems and microservices in a high-traffic environment.
Cloud & Kubernetes :
- Expert proficiency in managing highly available Kubernetes clusters (i.e., K8s on GCP/AWS/Azure) and their underlying cloud resources.
Observability Stack :
- Deep, hands-on experience with modern observability tools (Prometheus, Grafana, Jaeger/OpenTelemetry).
Programming/Scripting :
- Expert in at least one modern programming language (Go/Python) for writing operators, automation tooling, and extending monitoring systems.
Infrastructure as Code (IaC) :
- Advanced knowledge of Terraform for managing multi-cloud infrastructure.
Networking :
- Advanced understanding of network concepts in a cloud/container environment (service mesh, network policies, load balancing).
Qualifications :
- Bachelor's or Master's degree in Computer Science or a related technical field.
years of professional experience in SRE, DevOps, or Infrastructure Engineering roles.
- History of successfully implementing reliability improvements that result in measurable SLO adherence
-
Staff Site Reliability Engineer
7 hours ago
Bengaluru, Karnataka, India Procore Technologies Full time ₹ 15,00,000 - ₹ 20,00,000 per yearJob DescriptionWe're looking for aStaff Site Reliability Engineerto join Procore's Infrastructure Platform division to work on our commercial initiatives. In this role, you'll help build Procore's next-generation construction compute platform for others to build upon, including Procore developers, analysts, partners, and customers.Procore software solutions...
-
Site Reliability Engineer
4 days ago
Bengaluru, Karnataka, India AppHelix Full time ₹ 9,00,000 - ₹ 12,00,000 per yearRole DescriptionThis is a full-time on-site role located in Bengaluru for a Site Reliability Engineer. The Site Reliability Engineer will be responsible for maintaining and improving the reliability of AppHelix's systems. Daily tasks include monitoring system performance, troubleshooting issues, managing infrastructure, and supporting software development....
-
Senior Staff Engineer- Site Reliability
4 days ago
Bengaluru, Karnataka, India Straatix Technology Labs Full time ₹ 12,00,000 - ₹ 36,00,000 per yearOnly applications submitted through the provided link will be taken into consideration.Your Role at a Glance:We are hiring a Senior Staff Backend Engineer Site Reliability for our Code Name: SORIN, a global leader building high-scale observability platforms. In this high-impact leadership role, youll architect, scale, and optimize the systems that drive how...
-
Site Reliability Engineer
7 hours ago
Bengaluru, Karnataka, India H&M Full time ₹ 15,00,000 - ₹ 25,00,000 per yearJob DescriptionWe are looking for a Site Reliability Engineer within eCommerce with experience of Headless SaaS (e.g., a headless CMS experience) and API based commerce frameworks and managed cloud services (e.g. managed Kubernetes). You will work within our SRE Capability supporting the next generation customer experience by blending fashion and tech. You...
-
Staff Site Reliability Engineer
5 days ago
Bengaluru, Karnataka, India Visa Full time ₹ 12,00,000 - ₹ 36,00,000 per yearCompany DescriptionVisa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more than 200 countries and territories each year. Our mission is to connect the world through the most innovative, convenient, reliable, and secure...
-
Staff Site Reliability Engineer
5 days ago
Bengaluru, Karnataka, India Visa Full time ₹ 10,00,000 - ₹ 25,00,000 per yearCompany Description Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more than 200 countries and territories each year. Our mission is to connect the world through the most innovative, convenient, reliable, and secure...
-
Senior Staff Site Reliability Engineer
2 weeks ago
Bengaluru, Karnataka, India Zscaler Full time ₹ 12,00,000 - ₹ 36,00,000 per yearAbout ZscalerServing thousands of enterprise customers around the world including 45% of Fortune 500 companies, Zscaler (NASDAQ: ZS) was founded in 2007 with a mission to make the cloud a safe place to do business and a more enjoyable experience for enterprise users. As the operator of the world's largest security cloud, Zscaler accelerates digital...
-
Site Reliability Engineer
2 days ago
Bengaluru, Karnataka, India FIS Full time ₹ 12,00,000 - ₹ 36,00,000 per yearAbout the Role :Site Reliability Engineer (SRE)with deep expertise inMainframe technologies like COBOL, JCL, etc. to support and enhance ourCard Management & Payment processing functions. This role will be responsible for ensuring reliability, high availability, scalability, stability and performance of mission-critical mainframe software applications and...
-
Site Reliability Engineer
5 days ago
Bengaluru, Karnataka, India eBay Full time ₹ 12,00,000 - ₹ 36,00,000 per yearAt eBay, we're more than a global ecommerce leader — we're changing the way the world shops and sells. Our platform empowers millions of buyers and sellers in more than 190 markets around the world. We're committed to pushing boundaries and leaving our mark as we reinvent the future of ecommerce for enthusiasts.Our customers are our compass, authenticity...
-
Site Reliability Engineer
4 days ago
Bengaluru, Karnataka, India HDFC Limited Full time ₹ 15,00,000 - ₹ 25,00,000 per yearHiring for Lead / Sr Site Reliability Engineer for Mumbai & Bangalore LocationExperience YearsJob PurposeAnalysing, troubleshooting, and designing vital services, platforms, and infrastructure on GCP while always thinking about reliability, scalability, resilience, security, and performance.Job Responsibilities:Help build a Site Reliability Engineering...