Site Reliability Engineer

4 weeks ago


Bengaluru, Karnataka, India slice Full time
About us

slice the way you bank

slice's purpose is to make the world better at using money and time, with a major focus on building the best consumer experience for your money. We've all felt how slow, confusing, and complicated banking can be. So, we're reimagining it. We're building every product from scratch to be fast, transparent, and feel good, because we believe that the best products transcend demographics, like how great music touches most of us.

Our cornerstone products and services: slice savings account, slice UPI credit card, slice UPI, slice UPI ATMs, slice fixed deposits, slice borrow, and UPI-powered bank branch are designed to be simple, rewarding, and completely in your control. At slice, you'll get to build things you'd use yourself and shape the future of banking in India. We tailor our working experience with the belief that the present moment is the only real thing in life. And we have harmony in the present the most when we feel happy and successful together.

We're backed by some of the world's leading investors, including Tiger Global, Insight Partners, Advent International, Blume Ventures, and Gunosy Capital.

About the role

We are looking for a Site Reliability Engineer with experience in building and implement functional systems that improve customer experience. Site Reliability Engineer responsibilities include deploying product updates, identifying production issues and implementing integrations that meet customer needs. Ultimately, you will execute and automate operational processes fast, accurately and securely.

What you'll do

- Designing and implementation of IT Infra including Networking, Storage, Compute, Backup and Security.
- Design and implement power distribution systems, optimize power usage efficiency and ensure redundancy to minimize downtime risks.
- Architect network infrastructure for data center and cloud environments, including switches, routers, firewalls, VPC security groups, transient gateway etc
- Implement high-speed interconnects and design network topologies to support scalable and resilient connectivity.
- Architect storage solutions (NAS/SAN, blockstore, filestore) tailored to meet performance, capacity, and data protection requirements.
- Optimize compute resources through virtualization/containerization technologies like VMWare ESX, Red Hat Openshift, Microsoft HyperV and Nutanix acropolis.
- Design fault-tolerant architectures to ensure high availability and minimize service disruptions.
- Develop rack layouts and configurations to maximize space utilization.
- Deep diving into Linux server issues and automation of configuration & deployment
- Documentation of systems processes and runbook.
- Manage data center vendor team and cable new servers, decommission old servers and manage system inventory
- Ensure successful execution of IT strategies, architecture guidelines, and standards and guide project teams through the technology selection and architecture/security governance processes
- Manage and maintain the Cloud DevOps pipeline and work with dev teams. Look for opportunities to optimize and enable consistent automated deployments.
- Monitor standards/policy compliance by developing and executing governance processes and tools.
- Provide mentoring and knowledge transfer to others, and promote open culture and DevOps.
- Participate in incident response and post-mortem activities to identify root causes and prevent recurrence.
- Proactively identify and address performance bottlenecks, reliability issues, and security vulnerabilities.

Basic Qualification :

- 5+ years of experience in the field
- Experience in NAS, SAN, Block storage, File storage
- Experience in virtualization platforms like VMWare ESX, Red Hat Openshift, Nutanix acropolis, Microsoft HyperV.
- Working knowledge, networking, switching, routing, firewalls.
- Good understanding of Linux
- Expertise in Go, TypeScript, GIT, Terraform
- Solid understanding of monitoring and logging solutions (e.g., Prometheus, Grafana, ELK stack).
- Experience with CI/CD pipelines and DevOps practices.
- Hands-on experience with Public Cloud AWS/GCP
- Working knowledge of Kubernetes

Life at slice

Life so good, you'd think we're kidding:

- Competitive salaries. Period.
- An extensive medical insurance that looks out for our employees & their dependants. We'll love you and take care of you, our promise.
- Flexible working hours. Just don't call us at 3AM, we like our sleep schedule.
- Tailored vacation & leave policies so that you enjoy every important moment in your life.
- A reward system that celebrates hard work and milestones throughout the year. Expect a gift coming your way anytime you kill it here.
- Learning and upskilling opportunities. Seriously, not kidding.
- Good food, games, and a cool office to make you feel like home. An environment so good, you'll forget the term "colleagues can't be your friends".

  • Bengaluru, Karnataka, India Enterprise Minds, Inc Full time

    We're Hiring | Site Reliability Engineer | 8-10 years


  • Bengaluru, Karnataka, India WhiteLotus Talent Partners Full time

    We are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes . In this role, you will focus on monitoring , basic troubleshooting , and incident response , helping to maintain high...


  • Bengaluru, Karnataka, India beBeeReliability Full time ₹ 2,00,00,000 - ₹ 2,50,00,000

    Role OverviewAs a Site Reliability Engineer, you will play a pivotal role in driving innovation and modernizing complex systems by leveraging cutting-edge technologies and collaboration with cross-functional teams.


  • Bengaluru, Karnataka, India Coforge Full time

    Job Description- Design, implement, and maintain scalable infrastructure to ensure high availability and performance of software applications.- Collaborate with development teams to identify and resolve issues affecting application performance, stability, and reliability.- Develop automated monitoring scripts using tools like Prometheus, Grafana, etc. to...


  • Bengaluru, Karnataka, India Infrasoft Technologies Limited Full time

    Job DescriptionJob Title: DeveloperWork Location: Bangalore, KarnatakaExperience Range: 68 YearsJob Description:We are looking for a skilled Developer with strong hands-on experience in Site Reliability Engineering (SRE), Java, JavaScript, and Production Support. The ideal candidate should have a solid background in application monitoring and troubleshooting...


  • Bengaluru, Karnataka, India Collabera Full time

    Job Description As a Principal/Chief Site Reliability Engineer , you will play a critical role in designing, developing, and maintaining scalable and highly reliable systems. You'll work closely with development teams to improve system reliability, monitor critical applications, and design fail-proof infrastructure. Responsibilities Design and implement...


  • Bengaluru, Karnataka, India Xebia Full time

    We are seeking an experienced AWS DevOps Engineer with strong expertise in Observability and Site Reliability Engineering (SRE) to design, build, and manage scalable, reliable, and secure cloud environments. The role requires hands-on experience with AWS services, Infrastructure as Code (IaC), CI/CD, monitoring & observability frameworks, and incident...


  • Bengaluru, Karnataka, India NatWest Group Full time

    Join us as a Site Reliability Engineer In this key role you ll support the improvement of non-functional and operational characteristics such as availability performance efficiency change management monitoring security incident response and capacity planning of our products and services You ll enjoy significant stakeholder interaction working in...


  • Bengaluru, Karnataka, India Tata Technologies Full time

    Job DescriptionSite Reliability EngineerWhat awaits you/ Job ProfileAn SRE is responsible for maintaining reliability. That means facilitating automated, streamlined, and efficient error responses and reducing human error at scale. SREs spend a lot of time removing pain points, configuring internal tools, and setting and testing system benchmarks. They also...


  • Bengaluru, Karnataka, India beBeeReliability Full time

    Pearson is looking for a dynamic and experienced Manager - Site Reliability Engineering (SRE) to join our team. This individual will play a critical role in ensuring the stability, performance, and scalability of our infrastructure. If you possess excellent leadership skills, profound technical expertise, and the ability to thrive in a fast-paced,...