Site Reliability Engineering Manager

3 weeks ago


bangalore, India Groww Full time
About Groww
We are a passionate group of people focused on making financial services accessible to every Indian through a multi-product platform. Each day, we help millions of customers take charge of their financial journey. Customer obsession is in our DNA. Every product, every design, every algorithm down to the tiniest detail is executed keeping the customers’ needs and convenience in mind. Our people are our greatest strength. Everyone at Groww is driven by ownership, customer-centricity, integrity and the passion to constantly challenge the status quo.
Are you as passionate about defying conventions and creating something extraordinary as we are? Let’s chat.
Our Vision
Every individual deserves the knowledge, tools, and confidence to make informed financial decisions. At Groww, we are making sure every Indian feels empowered to do so through a cutting-edge multi-product platform offering a variety of financial services. Our long-term vision is to become the trusted financial partner for millions of Indians.
Our Values
Our culture enables us to be what we are — India’s fastest-growing financial services company. Everyone at Groww enjoys the autonomy and flexibility to bring their best work to the table, as well as craft a promising career for themselves.
The values that form our foundation are:
Radical customer-centricity
Ownership-driven culture
Keeping everything simple
Long-term thinking
Complete transparency
EXPERTISE AND QUALIFICATIONS
Collaborate with development teams to ensure the architecture and applications are designed with scalability, reliability and cost in mind.
Develop and maintain monitoring, alerting, and logging solutions to proactively identify and address performance issues and outages.
Orchestrate and own on-call rotations, responding to incidents, conducting post-incident reviews, and contributing to incident response improvements.
Analyze system performance data, identify bottlenecks, and recommend solutions to optimize performance and resource utilization.
Contribute to the design and implementation of disaster recovery strategies and backup solutions.
Stay with current industry trends, emerging technologies, and best practices to drive innovation and improvements in system reliability.
Plan and execute patching and upgrades for the PaaS and IaaS components.
Regularly connect with stakeholders to align on their infrastructure requirements
Manage a team of Site Reliability Engineers who work closely with our other Engineering teams to provide consistency in monitoring, process, deliverability.
Plan, prioritize, track, and deliver on internal and external projects, tasks, and goals.
Champion and advocate for your team across the company.
Build trust by communicating transparently and honestly with all members of the organization.
Requirements
Experience with containerization and orchestration technologies (e.g., Docker, Kubernetes).
Bachelor's degree in Computer Science, Engineering, or related field (or equivalent practical experience).
8+ years of experience with at least two years with leadership experience in Site Reliability Engineering or similar role, with a proven track record of managing complex systems in a production environment.
Proficiency in programming/scripting languages such as Java, Python, or similar.
Expertise in Cloud Infrastructure solutions like Microsoft Azure, Google Cloud or AWS
Experience with multiple data stores (MySQL, MongoDB, Cassandra, Elasticsearch).
Experience in designing highly efficient in-house observability platforms using open-source tools like thanos, Prometheus, Datalog and Grafana
Solid knowledge of networking concepts, including load balancing, DNS, routing, and security.
Strong problem-solving skills and the ability to troubleshoot complex issues under pressure.
Excellent communication and collaboration skills to work effectively across teams.
Experience with CI/CD pipelines and version control systems (e.g., Jenkins, GitHub actions).
Possess a passion for reliability, through participation in architectural design.

  • bangalore, India First American (India) Full time

    The Role:A SRE Manager is ultimately responsible for system reliability, developer productivity and reducing time to market by striving to reduce technical debt of the services your SRE team supports. We seek managers who are passionate about site reliability to influence and drive the strategic SRE mission.As a Site Reliability Engineering Manager working...


  • bangalore, India First American (India) Full time

    The Role:A SRE Manager is ultimately responsible for system reliability, developer productivity and reducing time to market by striving to reduce technical debt of the services your SRE team supports. We seek managers who are passionate about site reliability to influence and drive the strategic SRE mission.As a Site Reliability Engineering Manager working...


  • bangalore, India First American (India) Full time

    The Role: A SRE Manager is ultimately responsible for system reliability, developer productivity and reducing time to market by striving to reduce technical debt of the services your SRE team supports. We seek managers who are passionate about site reliability to influence and drive the strategic SRE mission. As a Site Reliability Engineering Manager...


  • bangalore, India Cyitechsearch Full time

    We are hiring for Site Reliability Engineer Skills : - Develop and provide operational support for full-stack software applications.- Relevant industry certifications, such as through the Site Reliability Engineering (SRE) Foundation.- Five years' experience as a site reliability engineer or similar role.- Collaborate with development operations staff to...


  • bangalore, India Cricbuzz.com Full time

    Site Reliability EngineerWe are looking for a highly skilled and motivated Web Server Site Reliability Engineer to join our team. As a Web Server Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our web server infrastructure and CDN services.Experience - 3 - 5 yearsResponsibilities:● Design,...


  • Bangalore, Karnataka, India Cyitechsearch Full time

    We are hiring for Site Reliability Engineer Skills : - Develop and provide operational support for full-stack software applications.- Relevant industry certifications, such as through the Site Reliability Engineering (SRE) Foundation.- Five years' experience as a site reliability engineer or similar role.- Collaborate with development operations staff to...


  • Bangalore, India Cyitechsearch Full time

    We are hiring for Site Reliability Engineer Skills : - Develop and provide operational support for full-stack software applications.- Relevant industry certifications, such as through the Site Reliability Engineering (SRE) Foundation.- Five years' experience as a site reliability engineer or similar role.- Collaborate with development operations staff...


  • Bangalore, Karnataka, India Cyitechsearch Full time

    We are hiring for Site Reliability Engineer Skills : - Develop and provide operational support for full-stack software applications.- Relevant industry certifications, such as through the Site Reliability Engineering (SRE) Foundation.- Five years' experience as a site reliability engineer or similar role.- Collaborate with development operations staff to...


  • bangalore, India Kunato Full time

    Site Reliability Engineer (SRE) - Python/GolangJob Description:We are seeking a highly skilled and passionate Site Reliability Engineer (SRE) to join our technology team. The ideal candidate will possess strong programming skills with expertise in Python, Golang, or both. This role is pivotal in ensuring the high availability, performance, and security of...


  • bangalore, India Kunato Full time

    Site Reliability Engineer (SRE) - Python/GolangJob Description:We are seeking a highly skilled and passionate Site Reliability Engineer (SRE) to join our technology team. The ideal candidate will possess strong programming skills with expertise in Python, Golang, or both. This role is pivotal in ensuring the high availability, performance, and security of...


  • Bangalore, India Cyitechsearch Full time

    About the job :We are hiring for Site Reliability EngineerExperience : 5+ Years Work Model : Remote / Contract 3 years Skills : Develop and provide operational support for fullstack software applications. Relevant industry certifications, such as through the Site Reliability Engineering (SRE) Foundation. Five years' experience as a site reliability engineer...


  • bangalore, India Greenway Health Full time

    Job Summary The Manager is responsible for implementing the development process and site reliability engineering practices to resolve issues and identify opportunity areas. This role will lead development and site reliability engineering teams and establish and implement best practices and standards related to engineering processes through all phases of the...


  • bangalore, India Kunato Full time

    Site Reliability Engineer (SRE) - Python/Golang Job Description: We are seeking a highly skilled and passionate Site Reliability Engineer (SRE) to join our technology team. The ideal candidate will possess strong programming skills with expertise in Python, Golang, or both. This role is pivotal in ensuring the high availability, performance, and security...


  • bangalore, India Ultrabot Innovations Full time

    Position Overview :As a Senior Site Reliability Engineer with 5-8 years of experience, you will play a key role in ensuring the reliability, scalability, and performance of our systems and infrastructure. You will leverage your expertise in Site Reliability Engineering (SRE) to implement best practices and methodologies, effectively troubleshoot complex...


  • bangalore, India CloudBees Full time

    J ob Title - Manager, Site Reliability EngineerLocation - Bangalore and ChennaiYear of Experience - 10+ YearsAbout CloudBeesCloudBees is the leading software delivery platform that enables enterprises to deliver scalable, compliant, and secure software, empowering developers to do their best work.Seamlessly integrating into any hybrid and heterogeneous...


  • Bangalore, India Ultrabot Innovations Full time

    Position Overview :As a Senior Site Reliability Engineer with 5-8 years of experience, you will play a key role in ensuring the reliability, scalability, and performance of our systems and infrastructure. You will leverage your expertise in Site Reliability Engineering (SRE) to implement best practices and methodologies, effectively troubleshoot complex...


  • Bangalore, Karnataka, India Ultrabot Innovations Full time

    Position Overview :As a Senior Site Reliability Engineer with 5-8 years of experience, you will play a key role in ensuring the reliability, scalability, and performance of our systems and infrastructure. You will leverage your expertise in Site Reliability Engineering (SRE) to implement best practices and methodologies, effectively troubleshoot complex...


  • Bangalore, Karnataka, India Ultrabot Innovations Full time

    Position Overview :As a Senior Site Reliability Engineer with 5-8 years of experience, you will play a key role in ensuring the reliability, scalability, and performance of our systems and infrastructure. You will leverage your expertise in Site Reliability Engineering (SRE) to implement best practices and methodologies, effectively troubleshoot complex...


  • Bangalore, India Ultrabot Innovations Full time

    Position Overview :As a Senior Site Reliability Engineer with 5-8 years of experience, you will play a key role in ensuring the reliability, scalability, and performance of our systems and infrastructure. You will leverage your expertise in Site Reliability Engineering (SRE) to implement best practices and methodologies, effectively troubleshoot complex...


  • bangalore, India Integra Connect Full time

    About IntegraConnectIntegra Connect delivers a comprehensive, integrated suite of cloud-based technologies and services that enable specialty groups to optimize clinical and financial performance as reimbursement shifts to value-based models. Connected by the IntegraCloud platform, the company’s core applications span population health including care...