Site Reliability Engineer

3 months ago


bangalore, India Groww Full time

About Groww

We are a passionate group of people focused on making financial services accessible to every Indian through a multi-product platform. Each day, we help millions of customers take charge of their financial journey. Customer obsession is in our DNA. Every product, every design, every algorithm down to the tiniest detail is executed keeping the customers’ needs and convenience in mind. Our people are our greatest strength. Everyone at Groww is driven by ownership, customer-centricity, integrity and the passion to constantly challenge the status quo.


Are you as passionate about defying conventions and creating something extraordinary as we are? Let’s chat.


Our Vision

Every individual deserves the knowledge, tools, and confidence to make informed financial decisions. At Groww, we are making sure every Indian feels empowered to do so through a cutting-edge multi-product platform offering a variety of financial services.

Our long-term vision is to become the trusted financial partner for millions of Indians.


Our Values

Our culture enables us to be what we are — India’s fastest-growing financial services company. It fosters an environment where collaboration, transparency, and open communication take center-stage and hierarchies fade away. There is space for every individual to be themselves and feel motivated to bring their best to the table, as well as craft a promising career for themselves.


The values that form our foundation are:

  • Radical customer centricity
  • Ownership-driven culture
  • Keeping everything simple
  • Long-term thinking
  • Complete transparency


Know your team:

We are a team of very enthusiastic hustlers who like to challenge themselves more with every new experience.


Expertise and Qualifications

We are seeking a highly motivated and experienced Senior Site Reliability Engineer to join our engineering team. As an SRE, you will be responsible for ensuring the reliability, availability, scalability, and performance of our applications and infrastructure. You will collaborate closely with software developers, platform engineers, and other team members to design, provision, build, and maintain systems that are scalable, secure, and highly available.


Responsibilities

  • Monitor and troubleshoot issues related to system performance, reliability, and security.
  • Define and implement Service Level Indicators (SLIs), Service Level Objectives (SLOs), and Error Budgets to measure and improve service reliability.
  • Analyze and report on metrics and trace data using Grafana, prometheus.
  • Participate in an on-call rotation to provide 24/7 support for critical production systems.
  • Evaluate and automate manual and repetitive tasks to reduce toil and improve system efficiency.
  • Design and manage infrastructure using tools like Terraform, Crossplane, or Kubernetes Composite Resource Definitions (XRDs).
  • Implement and manage security measures to protect infrastructure and data.
  • Coordinate between developers and operations to ensure smooth software releases and timely resolution of production issues.
  • Conduct thorough root cause analysis (RCA) of production incidents and implement preventive measures.
  • Review and optimize system performance, identify bottlenecks, and implement capacity planning and recovery strategies.
  • Maintain comprehensive documentation of systems, processes, and incident responses.
  • Continuously seek and implement improvements to infrastructure, processes, and tools to enhance system reliability and performance.


Requirements

  • 4-7 years of relevant work experience
  • Bachelor's or Master's degree in Computer Science or a related field
  • Must have strong understanding of Linux/Unix systems administration and networking
  • Must have experience with Kubernetes, Docker, and other containerization technologies is a plus
  • Must have experience with cloud platforms such as GCP or AWS or Azure
  • Strong programming skills in one or more languages such as Go, Python, or Java
  • Good to have experience with monitoring and alerting tools such as Grafana, Prometheus, Pager Duty or similar technologies
  • Must have experience with infrastructure provisioning tools such as Terraform, Pulumi, Cloud formation, or similar technologies.
  • Strong troubleshooting skills
  • Strong interpersonal and team collaboration skill


  • bangalore, India Integra Connect Full time

    About IntegraConnectIntegra Connect delivers a comprehensive, integrated suite of cloud-based technologies and services that enable specialty groups to optimize clinical and financial performance as reimbursement shifts to value-based models. Connected by the IntegraCloud platform, the company’s core applications span population health including care...


  • bangalore, India Zensar Technologies Full time

    About the Role: Site Reliability EngineerExperience: 5-8YrsLocation: BangaloreRequired Skills:Must have skills: -High level of experience using cloud log management and monitoring data platforms ( Dynatrace, Azure Monitor )Hands on experience in Azure BicepExperience working with Infrastructure as Code and Containerization tools ( Terraform , Docker,...


  • bangalore, India tsworks Full time

    Who We Are tsworks Technologies India Private Limited (subsidiary of The Software Works, Inc, USA) is a technology product and services company. Our mission is to provide domain expertise, innovative solutions and thought leadership to empower businesses to thrive in a digital world. We value our employees, take pride in providing best value in customer...


  • bangalore, India CirrusLabs Full time

    About the CompanyWe are CirrusLabs. Our vision is to become the world's most sought-after niche digital transformation company that helps customers realize value through innovation. Our mission is to co-create success with our customers, partners and community. Our goal is to enable employees to dream, grow and make things happen. We are committed to...


  • Bangalore City, India Zensar Technologies Full time

    About the Role: Site Reliability EngineerExperience: 5-8YrsLocation: BangaloreRequired Skills:Must have skills: High level of experience using cloud log management and monitoring data platforms (Dynatrace, Azure Monitor)Hands on experience in Azure Bicep Experience working with Infrastructure as Code and Containerization tools (Terraform, Docker, Kubernetes,...


  • bangalore, India BayOne Solutions Full time

    ResponsibilitiesTo ensure the reliability, availability and performance of customer’s production systems. Monitor system health, identify issues and implement solutions to prevent and resolve incidentsFor responding to incidents, perform root cause analysis and work with functional teams, 3rd party vendors to implement corrective actionsFor monitoring,...


  • Bangalore Urban, India Integra Connect Full time

    About IntegraConnect Integra Connect delivers a comprehensive, integrated suite of cloud-based technologies and services that enable specialty groups to optimize clinical and financial performance as reimbursement shifts to value-based models. Connected by the IntegraCloud platform, the company’s core applications span population health including care...


  • Bangalore Urban, India Integra Connect Full time

    About IntegraConnect Integra Connect delivers a comprehensive, integrated suite of cloud-based technologies and services that enable specialty groups to optimize clinical and financial performance as reimbursement shifts to value-based models. Connected by the IntegraCloud platform, the company’s core applications span population health including care...


  • Bangalore, India Signify Netherlands B.V. Full time

    Signify, the new company name of Philips Lighting, is the global leader in lighting building on 125+ years of innovations. Our purpose is to unlock the extraordinary potential of light for brighter lives and a better world.We are proud to be ahead of the game in the Internet of Things and being carbon neutral. We learn through disruptive challenges and our...


  • Bangalore City, India BayOne Solutions Full time

    ResponsibilitiesTo ensure the reliability, availability and performance of customer’s production systems. Monitor system health, identify issues and implement solutions to prevent and resolve incidentsFor responding to incidents, perform root cause analysis and work with functional teams, 3rd party vendors to implement corrective actionsFor monitoring,...


  • bangalore, India Saarthee Full time

    Company DescriptionAbout Saarthee:Saarthee is a global data, analytics , technology and consulting firm unlike any other, where our passion for helping others fuels our approach and our products and solutions. We are a one-stop shop for all things data and analytics. Unlike other analytics consulting firms that are technology or platform specific,...


  • Bangalore, India ALIQAN Technologies Full time

    Job Description : We are seeking a Site Reliability Engineer with strong platform development skills and a thorough understanding of securing environments, with a solid grasp of information security and performance optimization. This role focuses on building scalable, secure, and exceptional infrastructure, automating processes wherever possible. Ideal...


  • bangalore, India Groww Full time

    About GrowwWe are a passionate group of people focused on making financial services accessible to every Indian through a multi-product platform. Each day, we help millions of customers take charge of their financial journey. Customer obsession is in our DNA. Every product, every design, every algorithm down to the tiniest detail is executed keeping the...


  • Bangalore, India Protoporos Staffing Services Pvt Ltd Full time

    Opportunity with a leading B2B SaaS product client specializing in cutting-edge data integration solutions. Position Overview: We are seeking a highly skilled and experienced Staff Site Reliability Engineer to join our team. As a Staff SRE, you will play a critical role in ensuring the reliability, scalability, and performance of our data integration...


  • bangalore, India Laerdal Bangalore Full time

    As a Senior Site Reliability Engineer, you’ll play a pivotal role in ensuring the reliability and performance of our cloud-based applications and solutions. Collaborating closely with our team, you will cultivate a culture of SRE breaking down silos and managing incidents and problems. Your role will involve developing and implementing innovative solutions...


  • bangalore, India Tech Mahindra Full time

    Role - Offshore Delivery Lead (SRE with ITSM)Role & JDDetailed JDWe are seeking an experienced Off-Shore Technical Team SRE Lead to join our global team of Site Reliability Engineers. As an SRE Lead, you will be responsible for managing and mentoring a team of offshore SREs and DevOps engineers who work on ensuring the reliability, performance, and...


  • bangalore, India Zscaler Full time

    Our Engineering team built the world's largest cloud security platform from the ground up, and we keep building. With more than 100 patents and big plans for enhancing services and increasing our global footprint, the team has made us and our multitenant architecture today's cloud security leader, with more than 15 million users in 185 countries. Bring your...


  • Bangalore City, India Laerdal Bangalore Full time

    As a Senior Site Reliability Engineer, you’ll play a pivotal role in ensuring the reliability and performance of our cloud-based applications and solutions. Collaborating closely with our team, you will cultivate a culture of SRE breaking down silos and managing incidents and problems. Your role will involve developing and implementing innovative solutions...


  • bangalore, India Groww Full time

    About GrowwWe are a passionate group of people focused on making financial services accessible to every Indian through a multi-product platform. Each day, we help millions of customers take charge of their financial journey. Customer obsession is in our DNA. Every product, every design, every algorithm down to the tiniest detail is executed keeping the...


  • Bangalore, India Protoporos Staffing Services Pvt Ltd Full time

    About : Opportunity for a role of Engineering Manager with a Enterprise B2B SaaS product firm providing Services/products to Fortune 100 organizations.The ideal candidate must be from a B2B SaaS product organization only. Title : Staff Platform Engineer/Site Reliability Engineer. Mandatory Skills : B2B SaaS Product Development, Java, AWS, 2 years of...