Staff Site Reliability Engineer

22 hours ago

Bengaluru, Karnataka, India Aerospike Full time ₹ 8,00,000 - ₹ 20,00,000 per year

About Aerospike

Aerospike is the real-time database for mission-critical use cases and workloads, including machine learning, generative, and agentic AI. Aerospike powers millions of transactions per second with millisecond latency, at a fraction of the total cost of ownership compared to other databases.

Global leaders, including Adobe, Airtel, Barclays, Criteo, DBS Bank, Experian, Grab, HDFC Bank, PayPal, Sony Interactive Entertainment, The Trade Desk, and Wayfair, rely on Aerospike for customer 360, fraud detection, real-time bidding, profile stores, recommendation engines, and other use cases

Headquartered in Mountain View, California, Aerospike has a global presence with offices in London, Bangalore, and Tel Aviv.

In Bengaluru we follow hybrid models with mandate two days' work from office.

Site Reliability Engineer

As a Staff Site Reliability Engineer (SRE) for Aerospike, you will be instrumental in architecting, building, and optimizing enterprise-scale, highly resilient cloud platform infrastructure and services. You will focus on establishing reliability, performance, and automation standards to ensure seamless delivery and operation across our cloud platform ecosystem. Your responsibilities will include driving robust infrastructure initiatives across multiple teams, implementing organization-wide monitoring and observability practices, and leading strategic improvement initiatives that enhance system efficiency, scalability, and overall platform stability at enterprise scale.

Key Responsibilities

Architecting, deploying, and optimizing enterprise-scale Aerospike cloud platform infrastructure and services across multiple environments
Driving the development and standardization of automation, tooling, and infrastructure solutions across multiple engineering teams to improve efficiency at scale
Building and establishing monitoring, alerting, and observability standards and implementations across the organization with cutting-edge solutions and best practices
Leading complex incident response activities across multiple teams, conducting detailed root cause analysis, and driving systematic improvements
Establishing and implementing security best practices and standards for cloud platform infrastructure and services impacting multiple teams
Collaborating with development teams and engineering leadership to ensure reliable service delivery and alignment with enterprise-scale SRE best practices
Serving as escalation point for critical production incidents, coordinating cross-team mitigation strategies
Establishing documentation standards, runbooks, and knowledge sharing practices for operational excellence
Leading capacity planning and performance optimization efforts at enterprise scale
Mentoring engineers across teams and sharing knowledge to build technical capabilities

Required Experience

8+ years of experience in Site Reliability Engineering (SRE), DevOps, or related fields, with a focus on architecting scalable, resilient, and automated enterprise-scale systems
Experience leading complex infrastructure projects, driving measurable improvements in system reliability and performance
Deep knowledge of multiple public cloud providers (AWS, Google Cloud, Azure), including advanced cloud-native services and architectures
Advanced proficiency in automation, tooling, and infrastructure solutions to enable enterprise-scale automated and reproducible infrastructure
Extensive experience in CI/CD pipeline design and implementation, enabling seamless, automated software delivery and infrastructure updates at scale
Deep understanding of Linux/Unix systems, advanced networking concepts, and distributed system architectures
Comprehensive proficiency in scripting and software development using Python, Bash, Go, or similar languages to build sophisticated automation, tooling, and infrastructure solutions
Extensive experience with containerization and orchestration technologies such as Docker and Kubernetes for enterprise-scale service deployment and scaling
In-depth experience with monitoring, logging, and observability tools and methodologies to drive data-driven system improvements across multiple teams
Advanced problem-solving skills with an engineering-first mindset for improving system reliability, scalability, and performance at enterprise scale
Extensive experience implementing security best practices for cloud infrastructure, access control, and data protection across multiple teams
Excellent communication and influence skills to collaborate effectively across multiple teams and drive technical decisions

Preferred Skills and Qualifications

Extensive experience managing and optimizing database deployments and services in production environments at enterprise scale, ensuring high availability and performance
Deep expertise with Aerospike or other distributed NoSQL databases, including advanced features and enterprise-scale deployment optimization
Comprehensive understanding of security principles and implementation in complex cloud environments across multiple teams
Advanced industry certifications, such as AWS Solutions Architect Professional, Google Professional Cloud Architect, Azure Solutions Architect Expert, or equivalent
Advanced Kubernetes certifications (CKA, CKD, CKS) with extensive experience managing Kubernetes at enterprise scale
Advanced proficiency with configuration management and automation tools in complex, multi-team environments
Experience leading technical initiatives, mentoring, and driving best practices across multiple engineering teams.

Aerospike is an Equal Opportunity Employer. We are committed to providing an environment free from discrimination on the basis of race, religion, color, sex, gender identity, sexual orientation, age, non-disqualifying physical or mental disability, national origin, veteran status, or any other basis covered by appropriate law.

Staff Site Reliability Engineer

2 weeks ago

Bengaluru, Karnataka, India Okta Full time ₹ 8,00,000 - ₹ 24,00,000 per year

Join our team Were building a world where Identity belongs to you.Oktas Workforce Identity Cloud Security Engineering group is looking for a Staff Site Reliability Engineer with a passion for DevSecOps , Infrastructure Security , and SRE . Join a team that is not just building solutions but redefining the standards for cloud security. If you have a proven...
Senior Staff Engineer- Site Reliability

19 hours ago

Bengaluru, Karnataka, India Straatix Technology Labs Full time ₹ 12,00,000 - ₹ 36,00,000 per year

Only applications submitted through the provided link will be taken into consideration.Your Role at a Glance:We are hiring a Senior Staff Backend Engineer Site Reliability for our Code Name: SORIN, a global leader building high-scale observability platforms. In this high-impact leadership role, youll architect, scale, and optimize the systems that drive how...
Staff Site Reliability Engineer

4 days ago

Bengaluru, Karnataka, India Visa Full time ₹ 12,00,000 - ₹ 36,00,000 per year

Company Description Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more than 200 countries and territories each year. Our mission is to connect the world through the most innovative, convenient, reliable, and secure...
Site Reliability Engineering

2 weeks ago

Bengaluru, Karnataka, India Thakral One Full time US$ 60,000 - US$ 1,20,000 per year

Company DescriptionThakral One, headquartered in Singapore, is a technology consulting and services company with a strong presence across Asia. The company specializes in technology-driven consulting, custom solution development, data analytics, and leveraging cloud capabilities to deliver enhanced decision support and practical outcomes. Collaborating...
Site Reliability Engineering

2 weeks ago

Bengaluru, Karnataka, India Viraaj HR Solutions Private Limited Full time ₹ 12,00,000 - ₹ 36,00,000 per year

Site Reliability Engineer (SRE)About The OpportunityA fast-growing organization in the Enterprise Cloud Infrastructure & SaaS sector delivering highly available, mission-critical services to enterprise customers. We are hiring an on-site Site Reliability Engineer in India to own reliability, automation, and operational excellence across cloud-native...
Site Reliability Engineer

6 days ago

Bengaluru, Karnataka, India super Full time ₹ 12,00,000 - ₹ 24,00,000 per year

Site Reliability Engineer (SRE) Level 3Overview:A Site Reliability Engineer (SRE) Level 3 is a senior technical leadership role focused on designing, implementing, and maintaining large-scale, complex, and highly reliable systems. This role emphasizes a blend of software and systems engineering to ensure the availability, latency, performance, and capacity...
Site Reliability Engineer

4 days ago

Bengaluru, Karnataka, India eBay Full time ₹ 12,00,000 - ₹ 36,00,000 per year

At eBay, we're more than a global ecommerce leader — we're changing the way the world shops and sells. Our platform empowers millions of buyers and sellers in more than 190 markets around the world. We're committed to pushing boundaries and leaving our mark as we reinvent the future of ecommerce for enthusiasts.Our customers are our compass, authenticity...
Staff Site Reliability Engineer

6 days ago

Bengaluru, Karnataka, India Zinnia Full time ₹ 12,00,000 - ₹ 36,00,000 per year

WHO WE ARE:Zinnia is the leading technology platform for accelerating life and annuities growth. With innovative enterprise solutions and data insights, Zinnia simplifies the experience of buying, selling, and administering insurance products. All of which enables more people to protect their financial futures. Our success is driven by a commitment to three...
Staff Site Reliability Engineer

6 days ago

Bengaluru, Karnataka, India Zinnia Full time ₹ 12,00,000 - ₹ 36,00,000 per year

WHO WE ARE: Zinnia is the leading technology platform for accelerating life and annuities growth. With innovative enterprise solutions and data insights, Zinnia simplifies the experience of buying, selling, and administering insurance products. All of which enables more people to protect their financial futures. Our success is driven by a commitment to three...
Staff Site Reliability Engineer, Auth0

2 days ago

Bengaluru, Karnataka, India Okta Full time ₹ 12,00,000 - ₹ 48,00,000 per year

Get to know OktaOkta is The World's Identity Company. We free everyone to safely use any technology, anywhere, on any device or app. Our flexible and neutral products, Okta Platform and Auth0 Platform, provide secure access, authentication, and automation, placing identity at the core of business security and growth.At Okta, we celebrate a variety of...

Americas

Europe

Asia / Oceania

Africa

Staff Site Reliability Engineer