
Site Reliability Engineer
1 day ago
About Aerospike
Aerospike is the real-time database for mission-critical use cases and workloads, including machine learning, generative, and agentic AI. Aerospike powers millions of transactions per second with millisecond latency, at a fraction of the total cost of ownership compared to other databases.
Global leaders, including Adobe, Airtel, Barclays, Criteo, DBS Bank, Experian, Grab, HDFC Bank, PayPal, Sony Interactive Entertainment, The Trade Desk, and Wayfair, rely on Aerospike for customer 360, fraud detection, real-time bidding, profile stores, recommendation engines, and other use cases
Headquartered in Mountain View, California, Aerospike has a global presence with offices in London, Bangalore, and Tel Aviv.
In Bengaluru we follow hybrid models with mandate two days' work from office.
Site Reliability EngineerAs a Site Reliability Engineer (SRE) for Aerospike, you will play a crucial role in building and improving the reliability, performance, and scalability of our cloud platform. You will contribute to developing robust infrastructure, implementing monitoring solutions, and ensuring the reliability of our mission-critical cloud infrastructure and services. This role offers excellent opportunities for growth and learning in a fast-paced, innovative environment.
Key Responsibilities
- Deploying, monitoring, and optimizing Aerospike's cloud platform infrastructure and services across multiple environments
- Developing and enhancing automation and infrastructure-as-code solutions to improve operational efficiency
- Building monitoring, alerting, and observability implementations to help detect and resolve system issues proactively
- Participating in incident response activities, learning from post-mortems, and driving continuous improvement initiatives
- Implementing security best practices for cloud infrastructure and access control
- Collaborating with development teams to ensure reliable service delivery
- Participating in on-call rotation, responding to critical incidents and minimizing downtime through proactive mitigation strategies.
- Creating and maintaining documentation, runbooks, and system configurations for team knowledge sharing
- Working on capacity planning and performance optimization efforts
- Enhancing CI/CD pipeline improvements and deployment automation
- 3 years of experience in Site Reliability Engineering, DevOps, Infrastructure Engineering, or related technical fields
- Experience with at least one major public cloud provider (AWS, Google Cloud, or Azure) and basic understanding of cloud services
- Familiarity with infrastructure-as-code tools such as Terraform or CloudFormation
- Basic experience with CI/CD pipelines and automated deployment practices
- Understanding of Linux/Unix systems administration and basic networking concepts
- Experience with scripting languages such as Python, Bash, Go, or similar for automation tasks
- Exposure to containerization technologies such as Docker and basic Kubernetes concepts
- Familiarity with monitoring and logging tools (e.g., Prometheus, Grafana, CloudWatch, or similar)
- Strong problem-solving skills and eagerness to learn new technologies
- Good communication skills and ability to work collaboratively in a team environment
Preferred Skills and Qualifications
- Experience with database systems, preferably NoSQL databases
- Understanding of basic security practices in cloud environments
- Familiarity with Aerospike or other distributed databases
- Industry certifications such as AWS Cloud Practitioner, Google Cloud Associate, or Azure Fundamentals
- Exposure to configuration management tools (Ansible or similar)
- Experience with version control systems (Git) and collaborative development practices
Aerospike is an Equal Opportunity Employer. We are committed to providing an environment free from discrimination on the basis of race, religion, color, sex, gender identity, sexual orientation, age, non-disqualifying physical or mental disability, national origin, veteran status, or any other basis covered by appropriate law.
-
Site Reliability Engineer
6 days ago
Bengaluru, Karnataka, India Enterprise Minds, Inc Full timeWe're Hiring | Site Reliability Engineer | 8-10 years
-
site reliability engineer
4 days ago
Bengaluru, Karnataka, India Randstad Full timeRole: Site Reliability Engineer SummaryThe Network Engineer 2 provides technical design, planning, operation, maintenance, and advanced troubleshooting of the Bread Financials' network infrastructure. This position ensures continuity and alignment of the network administration/engineering direction. This position supports Bread Financials' strategies and...
-
Site Reliability Engineer
1 week ago
Bengaluru, Karnataka, India WhiteLotus Talent Partners Full timeWe are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes . In this role, you will focus on monitoring , basic troubleshooting , and incident response , helping to maintain high...
-
Site Reliability Engineer
4 days ago
Bengaluru, Karnataka, India WhiteLotus Talent Partners Full timeWe are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes. In this role, you will focus on monitoring, basic troubleshooting, and incident response, helping to maintain high system...
-
Site Reliability Engineer
1 day ago
Bengaluru, Karnataka, India TRUGlobal Full time ₹ 9,00,000 - ₹ 12,00,000 per yearJob Title: Site Reliability Engineer (SRE) with Python Development ExpertisePosition Overview: We are seeking a skilled Site Reliability Engineer (SRE) with strong Python development experience to join our team. The ideal candidate will be responsible for ensuring the reliability, availability, and performance of our services across both on-premises and...
-
Site Reliability Engineer
1 week ago
Bengaluru, Karnataka, India beBeeReliability Full time ₹ 2,00,00,000 - ₹ 2,50,00,000Role OverviewAs a Site Reliability Engineer, you will play a pivotal role in driving innovation and modernizing complex systems by leveraging cutting-edge technologies and collaboration with cross-functional teams.
-
Site Reliability Engineer
1 day ago
Bengaluru, Karnataka, India IDESLABS PRIVATE LIMITED Full time US$ 90,000 - US$ 1,20,000 per yearExperience: 5+ YearsSkill:Site reliability engineerLocation: BangaloreNotice Period:Immediate.Employment Type: ContractWorking Mode: HybridJob DescriptionSite Reliability Engineer Tech StackPrimaryAWSTerraformAnsibleDockerSecondaryPythonBashGithubJenkins
-
Site Reliability Engineer
8 hours ago
Bengaluru, Karnataka, India Success Pact Consulting Pvt Ltd Full timePosition : Site Reliability EngineerExperience : 5 - 9 YearsLocation : Bangalore, IndiaJob Summary : We are seeking an experienced Site Reliability Engineer (SRE) with 5-9 years of experience to join our Platform Engineering team. This role is crucial for ensuring the high availability, performance, and scalability of our AI-powered code review platform....
-
Site Reliability Engineer
6 days ago
Bengaluru, Karnataka, India Coforge Full timeJob Description- Design, implement, and maintain scalable infrastructure to ensure high availability and performance of software applications.- Collaborate with development teams to identify and resolve issues affecting application performance, stability, and reliability.- Develop automated monitoring scripts using tools like Prometheus, Grafana, etc. to...
-
Site Reliability Engineering
6 days ago
Bengaluru, Karnataka, India Infrasoft Technologies Limited Full timeJob DescriptionJob Title: DeveloperWork Location: Bangalore, KarnatakaExperience Range: 68 YearsJob Description:We are looking for a skilled Developer with strong hands-on experience in Site Reliability Engineering (SRE), Java, JavaScript, and Production Support. The ideal candidate should have a solid background in application monitoring and troubleshooting...