Lead Site Reliability Engineer

3 weeks ago


Bengaluru, India Groww Full time

About Groww

We are a passionate group of people focused on making financial services accessible to every Indian through a multi-product platform. Each day, we help millions of customers take charge of their financial journey. Customer obsession is in our DNA. Every product, every design, every algorithm down to the tiniest detail is executed keeping the customers’ needs and convenience in mind. Our people are our greatest strength. Everyone at Groww is driven by ownership, customer-centricity, integrity and the passion to constantly challenge the status quo.


Are you as passionate about defying conventions and creating something extraordinary as we are? Let’s chat.


Our Vision

Every individual deserves the knowledge, tools, and confidence to make informed financial decisions. At Groww, we are making sure every Indian feels empowered to do so through a cutting-edge multi-product platform offering a variety of financial services.

Our long-term vision is to become the trusted financial partner for millions of Indians.


Our Values

Our culture enables us to be what we are — India’s fastest-growing financial services company. It fosters an environment where collaboration, transparency, and open communication take center-stage and hierarchies fade away. There is space for every individual to be themselves and feel motivated to bring their best to the table, as well as craft a promising career for themselves.


The values that form our foundation are:

  • Radical customer centricity
  • Ownership-driven culture
  • Keeping everything simple
  • Long-term thinking
  • Complete transparency


Expertise and Qualifications

We are seeking a highly motivated and experienced Senior Site Reliability Engineer to join our engineering team. As an SRE, you will be responsible for ensuring the reliability, availability, scalability, and performance of our applications and infrastructure. You will collaborate closely with software developers, platform engineers, and other team members to design, provision, build, and maintain systems that are scalable, secure, and highly available.


Responsibilities

  • Monitor and troubleshoot issues related to system performance, reliability, and security.
  • Define and implement Service Level Indicators (SLIs), Service Level Objectives (SLOs), and Error Budgets to measure and improve service reliability.
  • Analyze and report on metrics and trace data using Grafana, prometheus.
  • Participate in an on-call rotation to provide 24/7 support for critical production systems.
  • Evaluate and automate manual and repetitive tasks to reduce toil and improve system efficiency.
  • Design and manage infrastructure using tools like Terraform, Crossplane, or Kubernetes Composite Resource Definitions (XRDs).
  • Implement and manage security measures to protect infrastructure and data.
  • Coordinate between developers and operations to ensure smooth software releases and timely resolution of production issues.
  • Conduct thorough root cause analysis (RCA) of production incidents and implement preventive measures.
  • Review and optimize system performance, identify bottlenecks, and implement capacity planning and recovery strategies.
  • Maintain comprehensive documentation of systems, processes, and incident responses.
  • Continuously seek and implement improvements to infrastructure, processes, and tools to enhance system reliability and performance.


Requirements

  • 5+ years of relevant work experience.
  • Bachelor's or Master's degree in Computer Science or a related field.
  • Strong understanding of Linux/Unix systems administration and networking, with troubleshooting skills.
  • Must have experience with Kubernetes, Docker, and other containerization technologies.
  • Experience with cloud platforms such as GCP, AWS, or Azure is required.
  • Strong programming skills in one or more languages such as Go, Python, or Java.
  • Experience with monitoring and alerting tools such as Grafana, Prometheus, PagerDuty, or similar technologies is desirable.
  • Must have experience with infrastructure provisioning tools such as Terraform, Pulumi, CloudFormation, or similar technologies.
  • Strong interpersonal and team collaboration skills.


  • Bengaluru, Karnataka, India Synechron Full time

    About SynechronWe are a leading global digital consulting firm, providing innovative technology solutions for business. As a trusted partner, we lead digital optimization and modernization journeys for our clients.Our expertise in AI, Consulting, Data, Digital, Cloud & DevOps and Software Engineering delivers customized, end-to-end solutions that drive...

  • Site Reliability Lead

    4 weeks ago


    Bengaluru, Karnataka, India Squareroot Consulting Pvt Ltd. Full time

    About the Role:At Squareroot Consulting Pvt Ltd., we are seeking a highly skilled Site Reliability Engineer to lead our infrastructure efforts in data privacy. As a key member of our team, you will be responsible for designing, implementing, and maintaining secure and scalable infrastructure as a service. Your expertise in DevOps and SRE practices will be...


  • Bengaluru, Karnataka, India myGwork Full time

    About the Role:We are seeking an experienced Site Reliability Engineering (SRE) lead to join our team at American Express, an inclusive employer and a member of myGwork. As an SRE lead, you will play a crucial role in driving the reliability, performance, and scalability of our GRC technology solutions.Key Responsibilities:Develop and implement a...


  • Bengaluru, Karnataka, India Synechron Full time

    About the OpportunityWe have an exciting opening for a Sr. Site Reliability Engineer to join our team at Synechron in Bangalore. As a key member of our organization, you will play a critical role in ensuring the scalability and reliability of our applications and infrastructure.


  • Bengaluru, India N Consulting Ltd Full time

    Experience: 10+ years Location: Bengaluru Job Description: Site Reliability EngineeringGood Communication & Leadership skillExperience in Software Release Management or worked in application side(Code Reviews)Should have strong knowledge in Java Should have strong knowledge in PythonShould have strong knowledge in AWSShould have Lead experience.Site...


  • Bengaluru, India Randstad Digital Full time

    Job Title: Site Reliability EngineeringLocation: BengaloreExperience: 6-8YearsJob Description:Summary: As an Application Developer, you will design, build, and configure applications to meet business process and application requirements. You will collaborate with the team to ensure smooth operations and provide solutions to problems. Your typical day will...


  • Bengaluru, India Randstad Digital Full time

    Job Title: Site Reliability EngineeringLocation: BengaloreExperience: 6-8 YearsJob Description:Summary: As an Application Developer, you will design, build, and configure applications to meet business process and application requirements. You will collaborate with the team to ensure smooth operations and provide solutions to problems. Your typical day will...


  • Bengaluru, India Tranzeal Incorporated Full time

    Job Title: Site Reliability Engineer (SRE)Location: BangaloreWe're hiring a Site Reliability Engineer to join our team in Bangalore! If you have a strong background in maintaining and scaling cloud services and love automating infrastructure at scale, this is for you.Experience with Ansible and Kubernetes is a MUST-HAVEKey Responsibilities:Manage and scale...


  • Bengaluru, India Tranzeal Incorporated Full time

    Job Title: Site Reliability Engineer (SRE)Location: BangaloreWe're hiring aSite Reliability Engineerto join our team in Bangalore! If you have a strong background in maintaining and scaling cloud services and love automating infrastructure at scale, this is for you.Experience with Ansible and Kubernetes is a MUST-HAVEKey Responsibilities:Manage and scale...


  • Bengaluru, India Tranzeal Incorporated Full time

    Job Title: Site Reliability Engineer (SRE)Location: BangaloreWe're hiring a Site Reliability Engineer to join our team in Bangalore! If you have a strong background in maintaining and scaling cloud services and love automating infrastructure at scale, this is for you.Experience with Ansible and Kubernetes is a MUST-HAVEKey Responsibilities:Manage and scale...


  • Bengaluru, India Randstad Digital Full time

    Job Title: Site Reliability EngineeringLocation: BengaloreExperience: 6-8YearsJob Description:Summary:As an Application Developer, you will design, build, and configure applications to meet business process and application requirements. You will collaborate with the team to ensure smooth operations and provide solutions to problems. Your typical day will...


  • Bengaluru, India Randstad Digital Full time

    Job Title: Site Reliability EngineeringLocation: BengaloreExperience: 6-8YearsJob Description:Summary:As an Application Developer, you will design, build, and configure applications to meet business process and application requirements. You will collaborate with the team to ensure smooth operations and provide solutions to problems. Your typical day will...


  • Bengaluru, India Randstad Digital Full time

    Job Title: Site Reliability Engineering Location: Bengalore Experience: 6-8Years Job Description: Summary: As an Application Developer, you will design, build, and configure applications to meet business process and application requirements. You will collaborate with the team to ensure smooth operations and provide solutions to problems. Your typical day...


  • Bengaluru, India Tranzeal Incorporated Full time

    Job Title: Site Reliability Engineer (SRE)Location: Bangalore, KAWork Mode: Office (5Days/Week)Position Type: Contract basedWe're hiring aSite Reliability Engineerto join our team in Bangalore! If you have a strong background in maintaining and scaling cloud services and love automating infrastructure at scale, this is for you.Experience with Ansible and...


  • Bengaluru, India Tsworks Full time

    Who We Aretsworks Technologies India Private Limited (subsidiary of The Software Works, Inc, USA) is a technology product and services company. Our mission is to provide domain expertise, innovative solutions and thought leadership to empower businesses to thrive in a digital world. We value our employees, take pride in providing best value in customer...


  • Bengaluru, India Tranzeal Incorporated Full time

    Job Title: Site Reliability Engineer (SRE)Location: Bangalore, KAWork Mode: Office (5 Days/Week)Position Type: Contract basedWe're hiring a Site Reliability Engineer to join our team in Bangalore! If you have a strong background in maintaining and scaling cloud services and love automating infrastructure at scale, this is for you.Experience with Ansible...


  • Bengaluru, India Tranzeal Incorporated Full time

    Job Title: Site Reliability Engineer (SRE)Location: Bangalore, KAWork Mode: Office (5Days/Week)Position Type: Contract basedWe're hiring a Site Reliability Engineer to join our team in Bangalore! If you have a strong background in maintaining and scaling cloud services and love automating infrastructure at scale, this is for you.Experience with Ansible and...


  • Bengaluru, India Randstad Digital Full time

    Job Title: Site Reliability EngineeringLocation: BengaloreExperience: 6-8YearsJob Description:Summary: As an Application Developer, you will design, build, and configure applications to meet business process and application requirements. You will collaborate with the team to ensure smooth operations and provide solutions to problems. Your typical day will...


  • Bengaluru, India Microsoft Full time

    Overview Looking to join an exciting industry and organization at the forefront of the next Tech industry transformation? Are you ready to join a team of the world’s best technical experts to enable the success of Microsoft solutions for our commercial & enterprise customers? We are seeking to build out the team of next generation Site Reliability...


  • Bengaluru, India Warner Bros. Discovery Full time

    Software Engineer II- Site Reliability Engineering (SRE Team)BangaloreAbout Warner Bros. Discovery:Warner Bros. Discovery (WBD), a premier global media and entertainment company, offers audiences the world's most differentiated and complete portfolio of content, brands, and franchises across television, film, streaming, and gaming. The company combines...