Staff Site Reliability Engineer
10 hours ago
About The Role :
We are looking for a highly experienced Staff Site Reliability Engineer (SRE) to drive the reliability, performance, and operational excellence of our core production systems.
This is a senior, hands-on role that requires deep expertise in large-scale distributed systems, complex incident management, and building world-class observability platforms.
Key Responsibilities :
Reliability Engineering :
- Define, measure, and enforce Service Level Objectives (SLOs) and Service Level Indicators (SLIs) for critical platform services.
- Drive down toil by promoting self-service and automation.
Observability Platform :
- Lead the design and implementation of our global observability stack, including metric collection (Prometheus/M3DB), distributed tracing (Jaeger/OpenTelemetry), and logging (Loki/Elasticsearch).
Incident Management :
- Act as a technical leader during high-severity incidents, perform in-depth Root Cause Analysis (RCA), and implement long-term preventative measures.
Performance Tuning :
- Conduct performance analysis and capacity planning for the entire platform, optimizing infrastructure and application bottlenecks.
Security & Compliance :
- Partner with the security team to enforce security controls and best practices across the infrastructure layer.
Mentorship & Evangelism :
- Mentor SRE and DevOps teams, and evangelize reliability best practices and engineering excellence across all product development teams.
Technical Skills (Must-Have) :
Distributed Systems :
- Proven experience designing, running, and debugging large-scale distributed systems and microservices in a high-traffic environment.
Cloud & Kubernetes :
- Expert proficiency in managing highly available Kubernetes clusters (i.e., K8s on GCP/AWS/Azure) and their underlying cloud resources.
Observability Stack :
- Deep, hands-on experience with modern observability tools (Prometheus, Grafana, Jaeger/OpenTelemetry).
Programming/Scripting :
- Expert in at least one modern programming language (Go/Python) for writing operators, automation tooling, and extending monitoring systems.
Infrastructure as Code (IaC) :
- Advanced knowledge of Terraform for managing multi-cloud infrastructure.
Networking :
- Advanced understanding of network concepts in a cloud/container environment (service mesh, network policies, load balancing).
Qualifications :
- Bachelor's or Master's degree in Computer Science or a related technical field.
years of professional experience in SRE, DevOps, or Infrastructure Engineering roles.
- History of successfully implementing reliability improvements that result in measurable SLO adherence
-
Staff Site Reliability Engineer
2 weeks ago
Bengaluru, Karnataka, India Okta Full time ₹ 8,00,000 - ₹ 24,00,000 per yearJoin our team Were building a world where Identity belongs to you.Oktas Workforce Identity Cloud Security Engineering group is looking for a Staff Site Reliability Engineer with a passion for DevSecOps , Infrastructure Security , and SRE . Join a team that is not just building solutions but redefining the standards for cloud security. If you have a proven...
-
Senior Staff Site Reliability Engineer
1 week ago
Bengaluru, Karnataka, India Movius Full time ₹ 20,00,000 - ₹ 40,00,000 per yearAbout the Role : We are looking for a highly experienced Senior Staff Site Reliability Engineer (SRE) to join our dynamic team. The ideal candidate will bring deep technical expertise in DevOps, automation, and large-scale distributed systems, with a strong understanding of cloud operations and CI/CD frameworks. Experience in the telecom domain will be an...
-
Staff Site Reliability Engineer
3 days ago
Bengaluru, Karnataka, India Aerospike Full time ₹ 1,20,000 - ₹ 6,00,000 per yearAbout AerospikeAerospike is the real-time database for mission-critical use cases and workloads, including machine learning, generative, and agentic AI. Aerospike powers millions of transactions per second with millisecond latency, at a fraction of the total cost of ownership compared to other databases.Global leaders, including Adobe, Airtel,...
-
Staff Site Reliability Engineer
18 hours ago
Bengaluru, Karnataka, India Okta Full time ₹ 12,00,000 - ₹ 36,00,000 per yearGet to know OktaOkta is The World's Identity Company. We free everyone to safely use any technology, anywhere, on any device or app. Our flexible and neutral products, Okta Platform and Auth0 Platform, provide secure access, authentication, and automation, placing identity at the core of business security and growth.At Okta, we celebrate a variety of...
-
Site Reliability Engineer
2 days ago
Bengaluru, Karnataka, India AppHelix Full time ₹ 9,00,000 - ₹ 12,00,000 per yearRole DescriptionThis is a full-time on-site role located in Bengaluru for a Site Reliability Engineer. The Site Reliability Engineer will be responsible for maintaining and improving the reliability of AppHelix's systems. Daily tasks include monitoring system performance, troubleshooting issues, managing infrastructure, and supporting software development....
-
Senior Staff Engineer- Site Reliability
2 days ago
Bengaluru, Karnataka, India Straatix Technology Labs Full time ₹ 12,00,000 - ₹ 36,00,000 per yearOnly applications submitted through the provided link will be taken into consideration.Your Role at a Glance:We are hiring a Senior Staff Backend Engineer Site Reliability for our Code Name: SORIN, a global leader building high-scale observability platforms. In this high-impact leadership role, youll architect, scale, and optimize the systems that drive how...
-
Staff Site Reliability Engineer
2 weeks ago
Bengaluru, Karnataka, India Visa Inc. Full time ₹ 8,00,000 - ₹ 16,00,000 per yearJob Description Expert-level proficiency operating large-scale, distributed, mission-critical systems: designing for high availability, multi-region resiliency, low latency, and predictable performance under extreme load. SRE fundamentals at Staff level: defines and drives SLOs/SLIs, error budgets, availability targets, and capacity guardrails codifies...
-
Staff Site Reliability Engineer
4 days ago
Bengaluru, Karnataka, India Visa Full time ₹ 10,00,000 - ₹ 25,00,000 per yearCompany Description Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more than 200 countries and territories each year. Our mission is to connect the world through the most innovative, convenient, reliable, and secure...
-
Staff Site Reliability Engineer
4 days ago
Bengaluru, Karnataka, India Visa Full time ₹ 12,00,000 - ₹ 36,00,000 per yearCompany DescriptionVisa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more than 200 countries and territories each year. Our mission is to connect the world through the most innovative, convenient, reliable, and secure...
-
Senior Staff Site Reliability Engineer
2 weeks ago
Bengaluru, Karnataka, India Zscaler Full time ₹ 12,00,000 - ₹ 36,00,000 per yearAbout ZscalerServing thousands of enterprise customers around the world including 45% of Fortune 500 companies, Zscaler (NASDAQ: ZS) was founded in 2007 with a mission to make the cloud a safe place to do business and a more enjoyable experience for enterprise users. As the operator of the world's largest security cloud, Zscaler accelerates digital...