Lead Site Reliability Engineer
7 hours ago
About Groww We are a passionate group of people focused on making financial services accessible to every Indian through a multi-product platform. Each day, we help millions of customers take charge of their financial journey. Customer obsession is in our DNA. Every product, every design, every algorithm down to the tiniest detail is executed keeping the customers’ needs and convenience in mind. Our people are our greatest strength. Everyone at Groww is driven by ownership, customer-centricity, integrity and the passion to constantly challenge the status quo. Are you as passionate about defying conventions and creating something extraordinary as we are? Let’s chat. Our Vision Every individual deserves the knowledge, tools, and confidence to make informed financial decisions. At Groww, we are making sure every Indian feels empowered to do so through a cutting-edge multi-product platform offering a variety of financial services. Our long-term vision is to become the trusted financial partner for millions of Indians. Our Values Our culture enables us to be what we are — India’s fastest-growing financial services company. It fosters an environment where collaboration, transparency, and open communication take center-stage and hierarchies fade away. There is space for every individual to be themselves and feel motivated to bring their best to the table, as well as craft a promising career for themselves. The values that form our foundation are: Radical customer centricity Ownership-driven culture Keeping everything simple Long-term thinking Complete transparency Expertise and Qualifications We are seeking a highly motivated and experienced Senior Site Reliability Engineer to join our engineering team. As an SRE, you will be responsible for ensuring the reliability, availability, scalability, and performance of our applications and infrastructure. You will collaborate closely with software developers, platform engineers, and other team members to design, provision, build, and maintain systems that are scalable, secure, and highly available. Responsibilities Monitor and troubleshoot issues related to system performance, reliability, and security. Define and implement Service Level Indicators (SLIs), Service Level Objectives (SLOs), and Error Budgets to measure and improve service reliability. Analyze and report on metrics and trace data using Grafana, prometheus. Participate in an on-call rotation to provide 24/7 support for critical production systems. Evaluate and automate manual and repetitive tasks to reduce toil and improve system efficiency. Design and manage infrastructure using tools like Terraform, Crossplane, or Kubernetes Composite Resource Definitions (XRDs). Implement and manage security measures to protect infrastructure and data. Coordinate between developers and operations to ensure smooth software releases and timely resolution of production issues. Conduct thorough root cause analysis (RCA) of production incidents and implement preventive measures. Review and optimize system performance, identify bottlenecks, and implement capacity planning and recovery strategies. Maintain comprehensive documentation of systems, processes, and incident responses. Continuously seek and implement improvements to infrastructure, processes, and tools to enhance system reliability and performance. Requirements 5+ years of relevant work experience. Bachelor's or Master's degree in Computer Science or a related field. Strong understanding of Linux/Unix systems administration and networking, with troubleshooting skills. Must have experience with Kubernetes, Docker, and other containerization technologies. Experience with cloud platforms such as GCP, AWS, or Azure is required. Strong programming skills in one or more languages such as Go, Python, or Java. Experience with monitoring and alerting tools such as Grafana, Prometheus, PagerDuty, or similar technologies is desirable. Must have experience with infrastructure provisioning tools such as Terraform, Pulumi, CloudFormation, or similar technologies. Strong interpersonal and team collaboration skills.
-
Site reliability engineer
2 days ago
Bangalore, India Tsworks Full timeWho We Are tsworks Technologies India Private Limited (subsidiary of The Software Works, Inc, USA) is a technology product and services company. Our mission is to provide domain expertise, innovative solutions and thought leadership to empower businesses to thrive in a digital world. We value our employees, take pride in providing best value in customer...
-
Site reliability engineer
2 days ago
Bangalore, India Integra Connect Full timeAbout Integra Connect Integra Connect delivers a comprehensive, integrated suite of cloud-based technologies and services that enable specialty groups to optimize clinical and financial performance as reimbursement shifts to value-based models. Connected by the Integra Cloud platform, the company’s core applications span population health including care...
-
Site Reliability Engineering Manager
2 days ago
Bangalore, India RecRoots Full timeAbout the team - Private cloud: The Private Cloud group operates, orchestrates, and optimizes cloud infrastructure. The Private Cloud capabilities are provided on platform instances that are privately owned and centrally managed. These platform instances, and the workloads running on them, are hosted both in datacenters (“on-premises”) and on public...
-
Site reliability engineering manager
2 days ago
Bangalore, India RecRoots Full timeAbout the team - Private cloud: The Private Cloud group operates, orchestrates, and optimizes cloud infrastructure. The Private Cloud capabilities are provided on platform instances that are privately owned and centrally managed. These platform instances, and the workloads running on them, are hosted both in datacenters (“on-premises”) and on public...
-
Site Reliability Engineer
1 day ago
Bangalore, India tsworks Full timeWho We Are tsworks Technologies India Private Limited (subsidiary of The Software Works, Inc, USA) is a technology product and services company. Our mission is to provide domain expertise, innovative solutions and thought leadership to empower businesses to thrive in a digital world. We value our employees, take pride in providing best value in customer...
-
Site Reliability Engineer
2 days ago
Bangalore, India Integra Connect Full timeAbout IntegraConnect Integra Connect delivers a comprehensive, integrated suite of cloud-based technologies and services that enable specialty groups to optimize clinical and financial performance as reimbursement shifts to value-based models. Connected by the IntegraCloud platform, the company’s core applications span population health including care...
-
Site Reliability Engineering Manager
1 day ago
bangalore, India RecRoots Full timeAbout the team - Private cloud: The Private Cloud group operates, orchestrates, and optimizes cloud infrastructure. The Private Cloud capabilities are provided on platform instances that are privately owned and centrally managed. These platform instances, and the workloads running on them, are hosted both in datacenters (“on-premises”) and on public...
-
Senior site reliability engineer
2 days ago
Bangalore, India Ushur Full timeLocation: Bangalore Experience: 6-8 Years Work Mode: Hybrid/Remote The Role Senior Site Reliability Engineers at Ushur perform a unique blend of customer support engineering, solution engineering, and operational engineering. You will work on our largest customers’ most complex problems and craft intuitive, elegant solutions. You’ll also...
-
Lead site reliability engineer
2 days ago
Bangalore, India Groww Full timeAbout Groww We are a passionate group of people focused on making financial services accessible to every Indian through a multi-product platform. Each day, we help millions of customers take charge of their financial journey. Customer obsession is in our DNA. Every product, every design, every algorithm down to the tiniest detail is executed keeping the...
-
Site Reliability Engineer
3 days ago
Bangalore Urban, India Integra Connect Full timeAbout IntegraConnect Integra Connect delivers a comprehensive, integrated suite of cloud-based technologies and services that enable specialty groups to optimize clinical and financial performance as reimbursement shifts to value-based models. Connected by the IntegraCloud platform, the company’s core applications span population health including care...
-
Site Reliability Engineer
1 day ago
Bangalore Urban, India Integra Connect Full timeAbout IntegraConnect Integra Connect delivers a comprehensive, integrated suite of cloud-based technologies and services that enable specialty groups to optimize clinical and financial performance as reimbursement shifts to value-based models. Connected by the IntegraCloud platform, the company’s core applications span population health including care...
-
Site Reliability Engineering Manager
4 days ago
Bangalore Urban, India RecRoots Full timeAbout the team - Private cloud:The Private Cloud group operates, orchestrates, and optimizes cloud infrastructure. The Private Cloud capabilities are provided on platform instances that are privately owned and centrally managed. These platform instances, and theworkloads running on them, are hosted both in datacenters (“on-premises”) and on public cloud...
-
Senior Site Reliability Engineer
2 days ago
Bangalore, India Ushur Full timeLocation: Bangalore Experience: 6-8 Years Work Mode: Hybrid/Remote The Role Senior Site Reliability Engineers at Ushur perform a unique blend of customer support engineering, solution engineering, and operational engineering. You will work on our largest customers’ most complex problems and craft intuitive, elegant solutions. You’ll also...
-
Lead Site Reliability Engineer
2 days ago
Bangalore, India Groww Full timeAbout Groww We are a passionate group of people focused on making financial services accessible to every Indian through a multi-product platform. Each day, we help millions of customers take charge of their financial journey. Customer obsession is in our DNA. Every product, every design, every algorithm down to the tiniest detail is executed keeping the...
-
tsworks | Site Reliability Engineer
3 days ago
bangalore, India tsworks Full timeWho We Aretsworks Technologies India Private Limited (subsidiary of The Software Works, Inc, USA) is a technology product and services company. Our mission is to provide domain expertise, innovative solutions and thought leadership to empower businesses to thrive in a digital world. We value our employees, take pride in providing best value in customer...
-
bangalore, India RecRoots Full timeAbout the team - Private cloud:The Private Cloud group operates, orchestrates, and optimizes cloud infrastructure. The Private Cloud capabilities are provided on platform instances that are privately owned and centrally managed. These platform instances, and theworkloads running on them, are hosted both in datacenters (“on-premises”) and on public cloud...
-
Site Reliability Engineer
2 days ago
Bangalore, India Truelancer.com Full timeJob Title: SRE Lead (Docker + Kubernetes) Experience: 10+ years Mandatory Skills: Modern observability stack - Splunk, Elastic Search, Prometheus, Grafana Cloud-based SRE practices and experiences such as AWS, Azure, or Google Cloud Containerization technologies (e.g., Kubernetes, Docker) and microservices architecture DevOps practices Programming Skills:...
-
Senior Site Reliability Engineer
3 days ago
Bangalore Urban, India Ushur Full timeLocation: BangaloreExperience: 6-8 YearsWork Mode: Hybrid/RemoteThe RoleSenior Site Reliability Engineers at Ushur perform a unique blend of customer support engineering, solution engineering, and operational engineering. You will work on our largest customers’ most complex problems and craft intuitive, elegant solutions. You’ll also proactively work...
-
Senior Site Reliability Engineer
1 day ago
Bangalore Urban, India Ushur Full timeLocation: Bangalore Experience: 6-8 Years Work Mode: Hybrid/Remote The Role Senior Site Reliability Engineers at Ushur perform a unique blend of customer support engineering, solution engineering, and operational engineering. You will work on our largest customers’ most complex problems and craft intuitive, elegant solutions. You’ll also proactively...
-
Site Reliability Engineer
2 days ago
Bangalore, India Wipro Full timeSRE and GCP Cloud (8-10 yrs) Strong background of DevOps practices, Cloud Technologies in ensuring reliability and security of Cloud infrastructure Strong proficiency in Infrastructure as Code (IaC) tools like Terraform and Code Configuration Management Tools like Ansible in addition to expertise in scripting languages like Shell, MS Powershell, Groovy etc....