
Site Reliability Engineer
4 weeks ago
slice the way you bank
slice's purpose is to make the world better at using money and time, with a major focus on building the best consumer experience for your money. We've all felt how slow, confusing, and complicated banking can be. So, we're reimagining it. We're building every product from scratch to be fast, transparent, and feel good, because we believe that the best products transcend demographics, like how great music touches most of us.
Our cornerstone products and services: slice savings account, slice UPI credit card, slice UPI, slice UPI ATMs, slice fixed deposits, slice borrow, and UPI-powered bank branch are designed to be simple, rewarding, and completely in your control. At slice, you'll get to build things you'd use yourself and shape the future of banking in India. We tailor our working experience with the belief that the present moment is the only real thing in life. And we have harmony in the present the most when we feel happy and successful together.
We're backed by some of the world's leading investors, including Tiger Global, Insight Partners, Advent International, Blume Ventures, and Gunosy Capital.
About the role
We are looking for a Site Reliability Engineer with experience in building and implement functional systems that improve customer experience. Site Reliability Engineer responsibilities include deploying product updates, identifying production issues and implementing integrations that meet customer needs. Ultimately, you will execute and automate operational processes fast, accurately and securely.
What you'll do
- Designing and implementation of IT Infra including Networking, Storage, Compute, Backup and Security.
- Design and implement power distribution systems, optimize power usage efficiency and ensure redundancy to minimize downtime risks.
- Architect network infrastructure for data center and cloud environments, including switches, routers, firewalls, VPC security groups, transient gateway etc
- Implement high-speed interconnects and design network topologies to support scalable and resilient connectivity.
- Architect storage solutions (NAS/SAN, blockstore, filestore) tailored to meet performance, capacity, and data protection requirements.
- Optimize compute resources through virtualization/containerization technologies like VMWare ESX, Red Hat Openshift, Microsoft HyperV and Nutanix acropolis.
- Design fault-tolerant architectures to ensure high availability and minimize service disruptions.
- Develop rack layouts and configurations to maximize space utilization.
- Deep diving into Linux server issues and automation of configuration & deployment
- Documentation of systems processes and runbook.
- Manage data center vendor team and cable new servers, decommission old servers and manage system inventory
- Ensure successful execution of IT strategies, architecture guidelines, and standards and guide project teams through the technology selection and architecture/security governance processes
- Manage and maintain the Cloud DevOps pipeline and work with dev teams. Look for opportunities to optimize and enable consistent automated deployments.
- Monitor standards/policy compliance by developing and executing governance processes and tools.
- Provide mentoring and knowledge transfer to others, and promote open culture and DevOps.
- Participate in incident response and post-mortem activities to identify root causes and prevent recurrence.
- Proactively identify and address performance bottlenecks, reliability issues, and security vulnerabilities.
Basic Qualification :
- 5+ years of experience in the field
- Experience in NAS, SAN, Block storage, File storage
- Experience in virtualization platforms like VMWare ESX, Red Hat Openshift, Nutanix acropolis, Microsoft HyperV.
- Working knowledge, networking, switching, routing, firewalls.
- Good understanding of Linux
- Expertise in Go, TypeScript, GIT, Terraform
- Solid understanding of monitoring and logging solutions (e.g., Prometheus, Grafana, ELK stack).
- Experience with CI/CD pipelines and DevOps practices.
- Hands-on experience with Public Cloud AWS/GCP
- Working knowledge of Kubernetes
Life at slice
Life so good, you'd think we're kidding:
- Competitive salaries. Period.
- An extensive medical insurance that looks out for our employees & their dependants. We'll love you and take care of you, our promise.
- Flexible working hours. Just don't call us at 3AM, we like our sleep schedule.
- Tailored vacation & leave policies so that you enjoy every important moment in your life.
- A reward system that celebrates hard work and milestones throughout the year. Expect a gift coming your way anytime you kill it here.
- Learning and upskilling opportunities. Seriously, not kidding.
- Good food, games, and a cool office to make you feel like home. An environment so good, you'll forget the term "colleagues can't be your friends".
-
Site Reliability Engineer
1 day ago
Bengaluru, Karnataka, India Enterprise Minds, Inc Full timeWe're Hiring | Site Reliability Engineer | 8-10 years
-
Site Reliability Engineer
4 days ago
Bengaluru, Karnataka, India WhiteLotus Talent Partners Full timeWe are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes . In this role, you will focus on monitoring , basic troubleshooting , and incident response , helping to maintain high...
-
Site Reliability Engineer
5 days ago
Bengaluru, Karnataka, India beBeeReliability Full time ₹ 2,00,00,000 - ₹ 2,50,00,000Role OverviewAs a Site Reliability Engineer, you will play a pivotal role in driving innovation and modernizing complex systems by leveraging cutting-edge technologies and collaboration with cross-functional teams.
-
Site Reliability Engineer
1 day ago
Bengaluru, Karnataka, India Coforge Full timeJob Description- Design, implement, and maintain scalable infrastructure to ensure high availability and performance of software applications.- Collaborate with development teams to identify and resolve issues affecting application performance, stability, and reliability.- Develop automated monitoring scripts using tools like Prometheus, Grafana, etc. to...
-
Site Reliability Engineering
1 day ago
Bengaluru, Karnataka, India Infrasoft Technologies Limited Full timeJob DescriptionJob Title: DeveloperWork Location: Bangalore, KarnatakaExperience Range: 68 YearsJob Description:We are looking for a skilled Developer with strong hands-on experience in Site Reliability Engineering (SRE), Java, JavaScript, and Production Support. The ideal candidate should have a solid background in application monitoring and troubleshooting...
-
Site Reliability Engineer
2 weeks ago
Bengaluru, Karnataka, India Collabera Full timeJob Description As a Principal/Chief Site Reliability Engineer , you will play a critical role in designing, developing, and maintaining scalable and highly reliable systems. You'll work closely with development teams to improve system reliability, monitor critical applications, and design fail-proof infrastructure. Responsibilities Design and implement...
-
Site Reliability Engineer
2 days ago
Bengaluru, Karnataka, India Xebia Full timeWe are seeking an experienced AWS DevOps Engineer with strong expertise in Observability and Site Reliability Engineering (SRE) to design, build, and manage scalable, reliable, and secure cloud environments. The role requires hands-on experience with AWS services, Infrastructure as Code (IaC), CI/CD, monitoring & observability frameworks, and incident...
-
Site Reliability Engineer
3 weeks ago
Bengaluru, Karnataka, India NatWest Group Full timeJoin us as a Site Reliability Engineer In this key role you ll support the improvement of non-functional and operational characteristics such as availability performance efficiency change management monitoring security incident response and capacity planning of our products and services You ll enjoy significant stakeholder interaction working in...
-
Site Reliability Engineer
4 weeks ago
Bengaluru, Karnataka, India Tata Technologies Full timeJob DescriptionSite Reliability EngineerWhat awaits you/ Job ProfileAn SRE is responsible for maintaining reliability. That means facilitating automated, streamlined, and efficient error responses and reducing human error at scale. SREs spend a lot of time removing pain points, configuring internal tools, and setting and testing system benchmarks. They also...
-
Site Reliability Engineering Director
4 days ago
Bengaluru, Karnataka, India beBeeReliability Full timePearson is looking for a dynamic and experienced Manager - Site Reliability Engineering (SRE) to join our team. This individual will play a critical role in ensuring the stability, performance, and scalability of our infrastructure. If you possess excellent leadership skills, profound technical expertise, and the ability to thrive in a fast-paced,...