15h Left: Site Reliability Engineer

7 days ago

Bengaluru India Groww Full time

Job Description

About Groww

We are a passionate group of people focused on making financial services accessible to every Indian through a multi-product platform. Each day, we help millions of customers take charge of their financial journey. Customer obsession is in our DNA. Every product, every design, every algorithm down to the tiniest detail is executed keeping the customers needs and convenience in mind. Our people are our greatest strength. Everyone at Groww is driven by ownership, customer-centricity, integrity and the passion to constantly challenge the status quo.

Are you as passionate about defying conventions and creating something extraordinary as we are Lets chat.

Our Vision

Every individual deserves the knowledge, tools, and confidence to make informed financial decisions. At Groww, we are making sure every Indian feels empowered to do so through a cutting-edge multi-product platform offering a variety of financial services.

Our long-term vision is to become the trusted financial partner for millions of Indians.

Our Values

Our culture enables us to be what we are Indias fastest-growing financial services company. It fosters an environment where collaboration, transparency, and open communication take center-stage and hierarchies fade away. There is space for every individual to be themselves and feel motivated to bring their best to the table, as well as craft a promising career for themselves.

The values that form our foundation are:

- Radical customer centricity
- Ownership-driven culture
- Keeping everything simple
- Long-term thinking
- Complete transparency

Expertise and Qualifications

We are seeking a highly motivated and experienced Senior Site Reliability Engineer to join our engineering team. As an SRE, you will be responsible for ensuring the reliability, availability, scalability, and performance of our applications and infrastructure. You will collaborate closely with software developers, platform engineers, and other team members to design, provision, build, and maintain systems that are scalable, secure, and highly available.

What will make you a great fit for the role:

- 69 years of experience in SRE, DevOps, or system architecture roles with large-scale production systems.
- Extensive experience managing and scaling high-traffic, low-latency fintech systems, ensuring reliability, compliance, and secure transaction processing.
- Proven expertise in the networking stack, with hands-on experience in BGP, OSPF, DNS, HTTP(S), TCP/IP, MPLS, and VPN protocols.
- Advanced knowledge of GCP networking (VPC design, Shared VPC, Private Service Connect, Global Load Balancers, Cloud DNS, Cloud NAT, Network Intelligence Center, and Service Mesh).
- Strong background in managing complex multi-cloud environments (AWS, GCP, Azure) with a focus on secure and compliant architectures in regulated industries.
- Hands-on expertise in Terraform and Infrastructure-as-Code (IaC) for repeatable, automated deployments.
- Expertise in Kubernetes, container orchestration, and microservices, with production experience in regulated fintech environments.
- Advanced programming and scripting skills in Python, Go, or Java, applied to automation, risk reduction, and financial system resilience.
- Proficiency with monitoring and logging tools (Prometheus, Mimir, Grafana, Loki) to ensure real-time visibility into trading, payments, and transaction flows.
- Strong understanding of networking, load balancing, and DNS management across multi-cloud and hybrid infrastructures.
- Implemented end-to-end observability solutions (metrics, logs, and traces) to monitor and optimize transaction throughput, adhering to latency SLAs.
- Leadership skills with experience mentoring teams, fostering a culture of reliability, and partnering with cross-functional stakeholders in product teams.
- Strong communication, critical thinking, and incident management abilities, especially in high-stakes production incidents involving customer transactions.
- Bachelors or Masters degree in Computer Science, Engineering, or equivalent experience.

What youll do:

- Architect and lead the design of scalable, reliable infrastructure solutions.
- Implement strategies for high availability, scalability, and low-latency performance.
- Define service-level objectives (SLOs) and service-level indicators (SLIs) to track performance and reliability.
- Drive incident management by identifying root causes and providing long-term solutions.
- Mentor junior engineers and foster a collaborative, learning-focused environment.
- Design advanced monitoring and alerting systems for proactive system management.
- Architect and optimize network topologies (hybrid cloud, multi-cloud, and on-prem) to support ultra-low-latency trading and compliance-driven workloads.
- Configure and manage cloud and on-prem networking components (VPCs, Shared VPCs, Private Service Connect, Cloud NAT, and Global Load Balancers) for secure and compliant transaction flows.
- Implement secure connectivity solutions (VPNs, Interconnect, Direct Connect, and service meshes) to meet fintech regulatory requirements and standards.
- Develop and maintain DNS, load-balancing, and traffic-routing strategies to ensure millisecond-level latency for real-time transactions.
- Evolve Infrastructure as Code (IaC) practices and principles to automate infrastructure provisioning.
- Collaborate on reliability roadmaps, performance benchmarks, and disaster recovery plans tailored for low-latency and high-throughput workloads.
- Manage Kubernetes clusters at scale, integrating service meshes like Istio or Linkerd.
- Implement chaos engineering principles to strengthen system resilience.
- Influence technical direction, reliability culture, and organizational strategies.

▷ 15h Left! Lead Site Reliability Engineer

2 weeks ago

Bengaluru, India Optum Full time

Job Description Optum is a global organization that delivers care, aided by technology to help millions of people live healthier lives. The work you do with our team will directly improve health outcomes by connecting people with the care, pharmacy benefits, data and resources they need to feel their best. Here, you will find a culture guided by inclusion,...
15h Left Senior Site Reliability Engineer

4 weeks ago

India Microsoft Full time

Job DescriptionThe Windows Cloud division is looking for a Senior Site Reliability Engineer that will help us take the Windows Cloud platform, as well as the Windows 365 Cloud PC and Azure Virtual Desktop business to the next level.Windows 365 Cloud PC (W365) and Azure Virtual Desktop (AVD) have recently been recognized as leaders in the Gartner Magic...
15h Left) Senior Site Reliability Engineer

3 weeks ago

Bengaluru, Karnataka, India Allegion Full time

- Allegion India is seeking a highly motivated Senior Site Reliability Engineer who will play a critical role in ensuring the reliability, scalability, and performance of our organization's systems and infrastructure, who will work with a team of cross-functional product development engineers to design, implement, and maintain highly available and resilient...
▷ [15h Left] Senior Site Reliability Engineer

2 weeks ago

Bengaluru, India Booking Holdings Full time

Job Description Key Job Responsibilities and Duties: The core premise for the Booking SRE lies in treating operational and reliability problems of software systems as a software engineering problem. We code our way out of problems where operations are concerned addressing availability, scalability, latency, and efficiency challenges within the vast...
Site Reliability Engineer

2 weeks ago

Bengaluru, India VidPro Consultancy Services Full time

Job Description Experience: 2.55 Years Location: Bangalore (On-site) Work Mode: 5 Days WFO Mandatory Skills: Site Reliability engineer or SRE ,Linux, System architecture, TCP/IP. HTTP,DNS ,Grafana, Prometheus and Loki Troubleshooting ,Root cause, complex systems ,Ci/CD, Docker, Kubernetes Experience : 2-4 years of relevant experience Key Skills...
Site reliability engineer

2 weeks ago

India Employ Full time

Role - Site Reliability Engineer (SRE)/ Platform Engineering/ or Dev Ops Engineering roles Location – Fully Remote Type - 6 months Contract Work Ex - 5+ Yrs We’re working with a AI product company that’s building the next generation of Gen AI powered developer platforms . We’re looking for an experienced Site Reliability Engineer to join...
Site Reliability Engineer

1 week ago

India Elgebra Full time

Hiring: Site Reliability Engineer – 7+ Years Location: Bangalore / Chennai Payroll: Elgebra
Site Reliability Engineer

2 weeks ago

India Concord Full time

SRE Sr. Engineers (Individual Contributors) Key Attributes: - Strong SRE (Site Reliability Engineering) experience - DevOps skills – CI/CD, monitoring, automation, infrastructure as code, etc. - Excellent troubleshooting and debugging skills (infrastructure + application level) - Perseverance – must push through complex/challenging issues without...
Site reliability engineer

2 weeks ago

India Concord Full time

SRE Sr. Engineers (Individual Contributors) Key Attributes : Strong SRE (Site Reliability Engineering) experience Dev Ops skills – CI/CD, monitoring, automation, infrastructure as code, etc. Excellent troubleshooting and debugging skills (infrastructure + application level) Perseverance – must push through complex/challenging issues without...
Site Reliability Engineer

4 weeks ago

India Employ Full time

Role - Site Reliability Engineer (SRE)/ Platform Engineering/ or DevOps Engineering rolesLocation – Bangalore/ RemoteType - ContractWork Ex - 4-6 yrsWe're working with a AI product company that's building the next generation of GenAI powered developer platforms.We're looking for an experienced Site Reliability Engineer to join their Platform Engineering...

Americas

Europe

Asia / Oceania

Africa

15h Left: Site Reliability Engineer