Staff Site Reliability Engineer

6 days ago


Bengaluru, Karnataka, India Aerospike Full time US$ 1,25,000 - US$ 1,75,000 per year

About Aerospike

Aerospike is the real-time database for mission-critical use cases and workloads, including machine learning, generative, and agentic AI. Aerospike powers millions of transactions per second with millisecond latency, at a fraction of the total cost of ownership compared to other databases.

Global leaders, including Adobe, Airtel, Barclays, Criteo, DBS Bank, Experian, Grab, HDFC Bank, PayPal, Sony Interactive Entertainment, The Trade Desk, and Wayfair, rely on Aerospike for customer 360, fraud detection, real-time bidding, profile stores, recommendation engines, and other use cases

Headquartered in Mountain View, California, Aerospike has a global presence with offices in London, Bangalore, and Tel Aviv.

In Bengaluru we follow hybrid models with mandate two days' work from office.

Site Reliability Engineer

As a Staff Site Reliability Engineer (SRE) for Aerospike, you will be instrumental in architecting, building, and optimizing enterprise-scale, highly resilient cloud platform infrastructure and services. You will focus on establishing reliability, performance, and automation standards to ensure seamless delivery and operation across our cloud platform ecosystem. Your responsibilities will include driving robust infrastructure initiatives across multiple teams, implementing organization-wide monitoring and observability practices, and leading strategic improvement initiatives that enhance system efficiency, scalability, and overall platform stability at enterprise scale.

Key Responsibilities

  • Architecting, deploying, and optimizing enterprise-scale Aerospike cloud platform infrastructure and services across multiple environments
  • Driving the development and standardization of automation, tooling, and infrastructure solutions across multiple engineering teams to improve efficiency at scale
  • Building and establishing monitoring, alerting, and observability standards and implementations across the organization with cutting-edge solutions and best practices
  • Leading complex incident response activities across multiple teams, conducting detailed root cause analysis, and driving systematic improvements
  • Establishing and implementing security best practices and standards for cloud platform infrastructure and services impacting multiple teams
  • Collaborating with development teams and engineering leadership to ensure reliable service delivery and alignment with enterprise-scale SRE best practices
  • Serving as escalation point for critical production incidents, coordinating cross-team mitigation strategies
  • Establishing documentation standards, runbooks, and knowledge sharing practices for operational excellence
  • Leading capacity planning and performance optimization efforts at enterprise scale
  • Mentoring engineers across teams and sharing knowledge to build technical capabilities
Required Experience
  • 8 years of experience in Site Reliability Engineering (SRE), DevOps, or related fields, with a focus on architecting scalable, resilient, and automated enterprise-scale systems
  • Experience leading complex infrastructure projects, driving measurable improvements in system reliability and performance
  • Deep knowledge of multiple public cloud providers (AWS, Google Cloud, Azure), including advanced cloud-native services and architectures
  • Advanced proficiency in automation, tooling, and infrastructure solutions to enable enterprise-scale automated and reproducible infrastructure
  • Extensive experience in CI/CD pipeline design and implementation, enabling seamless, automated software delivery and infrastructure updates at scale
  • Deep understanding of Linux/Unix systems, advanced networking concepts, and distributed system architectures
  • Comprehensive proficiency in scripting and software development using Python, Bash, Go, or similar languages to build sophisticated automation, tooling, and infrastructure solutions
  • Extensive experience with containerization and orchestration technologies such as Docker and Kubernetes for enterprise-scale service deployment and scaling
  • In-depth experience with monitoring, logging, and observability tools and methodologies to drive data-driven system improvements across multiple teams
  • Advanced problem-solving skills with an engineering-first mindset for improving system reliability, scalability, and performance at enterprise scale
  • Extensive experience implementing security best practices for cloud infrastructure, access control, and data protection across multiple teams
  • Excellent communication and influence skills to collaborate effectively across multiple teams and drive technical decisions

Preferred Skills and Qualifications

  • Extensive experience managing and optimizing database deployments and services in production environments at enterprise scale, ensuring high availability and performance
  • Deep expertise with Aerospike or other distributed NoSQL databases, including advanced features and enterprise-scale deployment optimization
  • Comprehensive understanding of security principles and implementation in complex cloud environments across multiple teams
  • Advanced industry certifications, such as AWS Solutions Architect Professional, Google Professional Cloud Architect, Azure Solutions Architect Expert, or equivalent
  • Advanced Kubernetes certifications (CKA, CKD, CKS) with extensive experience managing Kubernetes at enterprise scale
  • Advanced proficiency with configuration management and automation tools in complex, multi-team environments
  • Experience leading technical initiatives, mentoring, and driving best practices across multiple engineering teams.

Aerospike is an Equal Opportunity Employer. We are committed to providing an environment free from discrimination on the basis of race, religion, color, sex, gender identity, sexual orientation, age, non-disqualifying physical or mental disability, national origin, veteran status, or any other basis covered by appropriate law.



  • Bengaluru, Karnataka, India Procore Technologies Full time ₹ 15,00,000 - ₹ 20,00,000 per year

    Job DescriptionWe're looking for aStaff Site Reliability Engineerto join Procore's Infrastructure Platform division to work on our commercial initiatives. In this role, you'll help build Procore's next-generation construction compute platform for others to build upon, including Procore developers, analysts, partners, and customers.Procore software solutions...


  • Bengaluru, Karnataka, India Visa Full time

    Company Description Visa is a world leader in payments and technology with over 259 billion payments transactions flowing safely between consumers merchants financial institutions and government entities in more than 200 countries and territories each year Our mission is to connect the world through the most innovative convenient reliable and secure...


  • Bengaluru, Karnataka, India Visa Full time ₹ 4,00,000 - ₹ 8,00,000 per year

    Company Description Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more than 200 countries and territories each year. Our mission is to connect the world through the most innovative, convenient, reliable, and secure...


  • Bengaluru, Karnataka, India Programming Full time ₹ 1,04,000 - ₹ 1,30,878 per year

    Role - Site Reliability Engineering.Location - BengaluruYears of Expereince - 4+ YearsProfessional & Technical Skills:Must To Have Skills: Proficiency in Site Reliability Engineering.Good To Have Skills: Experience with cloud service providers such as AWS, Azure, or Google Cloud.Strong understanding of CI/CD tools and practices.Experience with container...


  • Bengaluru, Karnataka, India Visa Full time

    Company DescriptionVisa is a world leader in payments and technology with over 259 billion payments transactions flowing safely between consumers merchants financial institutions and government entities in more than 200 countries and territories each year Our mission is to connect the world through the most innovative convenient reliable and secure...


  • Bengaluru, Karnataka, India Enterprise Minds, Inc Full time

    We're Hiring | Site Reliability Engineer | 8-10 years


  • Bengaluru, Karnataka, India FOSS United Full time ₹ 1,04,000 - ₹ 1,30,878 per year

    All JobsSite Reliability Engineer at ZEISS IndiaSite Reliability EngineerApplyPosted on September 11, 2025ZEISS IndiaKadubeesanahalli, BengaluruFull TImeJob DescriptionZEISS in IndiaZEISS in India is headquartered in Bengaluru and present in the fields of Industrial Quality Solutions, Research Microscopy Solutions, Medical Technology, Vision Care and Sports...


  • Bengaluru, Karnataka, India WhiteLotus Talent Partners Full time

    We are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes . In this role, you will focus on monitoring , basic troubleshooting , and incident response , helping to maintain high...


  • Bengaluru, Karnataka, India H&M Full time ₹ 1,04,000 - ₹ 1,30,878 per year

    Job DescriptionWe are looking for a Site Reliability Engineer within eCommerce with experience of Headless SaaS (e.g., a headless CMS experience) and API based commerce frameworks and managed cloud services (e.g. managed Kubernetes). You will work within our SRE Capability supporting the next generation customer experience by blending fashion and tech. You...


  • Bengaluru, Karnataka, India Randstad Full time

    Role: Site Reliability Engineer SummaryThe Network Engineer 2 provides technical design, planning, operation, maintenance, and advanced troubleshooting of the Bread Financials' network infrastructure. This position ensures continuity and alignment of the network administration/engineering direction. This position supports Bread Financials' strategies and...