Site Reliability Engineer
5 days ago
About AerospikeAerospike is the real-time database for mission-critical use cases and workloads, including machine learning, generative, and agentic AI. Aerospike powers millions of transactions per second with millisecond latency, at a fraction of the total cost of ownership compared to other databases.Global leaders, including Adobe, Airtel, Barclays, Criteo, DBS Bank, Experian, Grab, HDFC Bank, PayPal, Sony Interactive Entertainment, The Trade Desk, and Wayfair, rely on Aerospike for customer 360, fraud detection, real-time bidding, profile stores, recommendation engines, and other use casesHeadquartered in Mountain View, California, Aerospike has a global presence with offices in London, Bangalore, and Tel Aviv.In Bengaluru we follow hybrid models with mandate two days' work from office.Site Reliability EngineerAs a Site Reliability Engineer (SRE) for Aerospike, you will play a crucial role in building and improving the reliability, performance, and scalability of our cloud platform. You will contribute to developing robust infrastructure, implementing monitoring solutions, and ensuring the reliability of our mission-critical cloud infrastructure and services. This role offers excellent opportunities for growth and learning in a fast-paced, innovative environment.Key ResponsibilitiesDeploying, monitoring, and optimizing Aerospike's cloud platform infrastructure and services across multiple environmentsDeveloping and enhancing automation and infrastructure-as-code solutions to improve operational efficiencyBuilding monitoring, alerting, and observability implementations to help detect and resolve system issues proactivelyParticipating in incident response activities, learning from post-mortems, and driving continuous improvement initiativesImplementing security best practices for cloud infrastructure and access controlCollaborating with development teams to ensure reliable service deliveryParticipating in on-call rotation, responding to critical incidents and minimizing downtime through proactive mitigation strategies.Creating and maintaining documentation, runbooks, and system configurations for team knowledge sharingWorking on capacity planning and performance optimization effortsEnhancing CI/CD pipeline improvements and deployment automationRequired Experience3+ years of experience in Site Reliability Engineering, DevOps, Infrastructure Engineering, or related technical fieldsExperience with at least one major public cloud provider (AWS, Google Cloud, or Azure) and basic understanding of cloud servicesFamiliarity with infrastructure-as-code tools such as Terraform or CloudFormationBasic experience with CI/CD pipelines and automated deployment practicesUnderstanding of Linux/Unix systems administration and basic networking conceptsExperience with scripting languages such as Python, Bash, Go, or similar for automation tasksExposure to containerization technologies such as Docker and basic Kubernetes conceptsFamiliarity with monitoring and logging tools (e.g., Prometheus, Grafana, CloudWatch, or similar)Strong problem-solving skills and eagerness to learn new technologiesGood communication skills and ability to work collaboratively in a team environmentPreferred Skills and QualificationsExperience with database systems, preferably NoSQL databasesUnderstanding of basic security practices in cloud environmentsFamiliarity with Aerospike or other distributed databasesIndustry certifications such as AWS Cloud Practitioner, Google Cloud Associate, or Azure FundamentalsExposure to configuration management tools (Ansible or similar)Experience with version control systems (Git) and collaborative development practicesAerospike is an Equal Opportunity Employer. We are committed to providing an environment free from discrimination on the basis of race, religion, color, sex, gender identity, sexual orientation, age, non-disqualifying physical or mental disability, national origin, veteran status, or any other basis covered by appropriate law.
-
Site Reliability Engineer
1 week ago
bangalore, India super Full timeSite Reliability Engineer (SRE) Level 3Overview:A Site Reliability Engineer (SRE) Level 3 is a senior technical leadership role focused on designing, implementing, and maintaining large-scale, complex, and highly reliable systems. This role emphasizes a blend of software and systems engineering to ensure the availability, latency, performance, and capacity...
-
Site Reliability Engineer
1 day ago
bangalore, India Pagos Consultants Full timewe are looking for experienced site reliability engineers to join a founding team of startup-minded individuals that will lay the groundwork for our new fintech offering. This team will play a pivotal role in spearheading innovation. As such, you will have the opportunity to shape the early architecture and design of the system and set the trajectory for its...
-
Site Reliability Engineer
1 hour ago
bangalore, India Enterprise Minds, Inc Full timeSenior Site Reliability Engineer (GCP | Terraform | Ansible | SRE | On-Call)We are looking for a high-impact Site Reliability Engineer (SRE) who will play a key role in ensuring the reliability, availability, and scalability of our production systems on Google Cloud Platform (GCP).If you thrive in fast-paced environments, excel in incident management, and...
-
Site Reliability Engineer
1 week ago
bangalore, India Datum Technologies Group Full timeJob Title: Site Reliability Engineer (SRE) – AWS Experience: 8+ years Location: Chennai / Mumbai Work Mode: Hybrid Key Skills: AWS, Terraform, Kubernetes, Docker, Grafana, Prometheus, Datadog Job Summary: We are looking for a skilled Site Reliability Engineer (SRE) with strong AWS experience and a solid background in DevOps, automation, observability, and...
-
Site Reliability Engineer
2 weeks ago
Bangalore, India CodeKarma Full timeSite Reliability Engineer (Multi-Cloud Deployments) Location: Bangalore / Remote Experience: 4–10 years Type: Full-time (6-month probation) About CodeKarma CodeKarma is redefining how engineering teams understand and evolve complex systems — bringing production context directly into the developer’s workflow. Our platform runs both as SaaS and as sub-
-
Site Reliability Engineer
2 weeks ago
Bangalore, India Flipkart Full timeHiring Site Reliability Engineers Exp : 2.5 +years (Excluding internship) Location : Bangalore Apply Here : The engineer will work in the Reliability and Productivity Engineering team and is responsible for building industry standard large scale platforms to be utilised across FK that helps to significantly improve the reliability of systems and bring...
-
Site Reliability Engineer
1 week ago
bangalore, India Andor Tech Full timeHiring!!🏢 About AndorTechAndorTech is a global IT services and consulting firm founded in 2009, headquartered in Bangalore. The company specializes in software engineering, AI-enabled IT services, application support, analytics, and test automation. With a presence across India, the USA, Europe, and the UAE, AndorTech partners with Global Capability...
-
Site Reliability Engineer
1 week ago
Bangalore, India Andor Tech Full timeHiring!! About AndorTech AndorTech is a global IT services and consulting firm founded in 2009, headquartered in Bangalore. The company specializes in software engineering, AI-enabled IT services, application support, analytics, and test automation . With a presence across India, the USA, Europe, and the UAE, AndorTech partners with Global Capability Centers...
-
Site Reliability Engineer
1 week ago
bangalore, India Karix Full timeRole: Site Reliability Engineer Location: Bangalore (WFO) About the role: We are seeking an experienced professional Site Reliability Engineer who acts as a bridge between development and IT operations, taking operational tasks to ensure the efficient functioning of Service platforms. They are responsible for monitoring, automating, and improving the...
-
Site Reliability Engineer
1 week ago
bangalore, India Karix Full timeRole: Site Reliability Engineer Location: Bangalore (WFO) About the role: We are seeking an experienced professional Site Reliability Engineer who acts as a bridge between development and IT operations, taking operational tasks to ensure the efficient functioning of Service platforms. They are responsible for monitoring, automating, and improving the...