
Site Reliability Expert
3 days ago
We are seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key member of our engineering organization, you will be responsible for ensuring the reliability and performance of our applications.
The ideal candidate will have strong hands-on experience with Azure and Kubernetes (AKS preferred) and Helm/Kustomize. Solid knowledge of GitHub Actions, GitOps (FluxCD/ArgoCD), and Terraform/Terragrunt is also required.
You will participate in on-call rotation, postmortems, and continual improvement processes. Your strong troubleshooting and debugging skills across the stack will enable you to succeed in this role.
- Main Responsibilities:
- Implement and tune service level objectives (SLOs) and service level indicators (SLIs), build reliability dashboards, and respond to incidents using Grafana IRM, JSM, and escalation workflows.
- Monitor application performance and availability across Kubernetes clusters using Grafana, Prometheus, Loki, Mimir, and Tempo.
- Participate in on-call rotation, postmortems, and continual improvement processes.
- Act as the primary escalation point for production issues – whether internal or client-facing.
- Monitor logs, traces, and alerts to proactively identify and resolve incidents.
- Debug issues across the stack: Kubernetes, Helm releases, application logs, API errors, database bottlenecks.
- Coordinate with development, QA, and client teams to ensure timely and effective resolution of issues.
- Implement GitOps workflows using FluxCD and ArgoCD to manage Kubernetes deployments.
- Manage and maintain infrastructure-as-code using Terraform, Terragrunt, and Azure (Preferred).
- Automate CI/CD pipelines with GitHub Actions for Docker image builds, Helm-based deployments, release tagging, etc.
- Work closely with QA engineers to validate release branches, tag images, and verify integration across services.
- Test application functionality post deployments (sanity and product functional tests).
- Assist in defining performance benchmarks (e.g., pgBench for PostgreSQL clusters) and validate pre-production readiness.
-
Site Reliability Engineer, AVP
3 days ago
Bengaluru, Karnataka, India Deutsche Bank Full timeJob DescriptionSite Reliability Engineer, AVPPosition OverviewJob Title: Site Reliability Engineer, AVPLocation: Bangalore, IndiaCorporate Title: AVPRole DescriptionTechnology/Service is responsible for delivering the business vision and strategy, at a global level, focusing on achieving consistent operational excellence and client/user satisfaction through...
-
Site Reliability Engineer
2 weeks ago
Bengaluru, Karnataka, India Awign Expert Full timeWe are seeking a skilled and proactive engineer with expertise in Kubernetes, Java-based applications, and cloud platforms (AWS/Azure/GCP) , along with experience in ServiceNow for support ticket management. The ideal candidate will be responsible for maintaining cloud-native applications, troubleshooting production issues, and ensuring smooth operations...
-
Site Reliability Engineer
2 weeks ago
Bengaluru, Karnataka, India Awign Expert Full timeWe are seeking a skilled and proactive engineer with expertise in Kubernetes, Java-based applications, and cloud platforms (AWS/Azure/GCP) , along with experience in ServiceNow for support ticket management. The ideal candidate will be responsible for maintaining cloud-native applications, troubleshooting production issues, and ensuring smooth operations...
-
Site Reliability Engineer III
2 weeks ago
Bengaluru, Karnataka, India Chase- Candidate Experience page Full time ₹ 15,00,000 - ₹ 20,00,000 per yearThere's nothing more exciting than being at the center of a rapidly growing field in technology and applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems.As a Site Reliability Engineer III at JPMorgan Chase within the Employee Platforms team, you will solve complex and broad business problems with...
-
Site Reliability Engineer
7 days ago
Bengaluru, Karnataka, India WhiteLotus Talent Partners Full timeWe are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes . In this role, you will focus on monitoring , basic troubleshooting , and incident response , helping to maintain high...
-
Site Reliability Engineer
3 days ago
Bengaluru, Karnataka, India WhiteLotus Talent Partners Full timeWe are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes. In this role, you will focus on monitoring, basic troubleshooting, and incident response, helping to maintain high system...
-
Site Reliability Engineer
2 hours ago
Bengaluru, Karnataka, India H&M Full time ₹ 1,04,000 - ₹ 1,30,878 per yearJob DescriptionWe are looking for a Site Reliability Engineer within eCommerce with experience of Headless SaaS (e.g., a headless CMS experience) and API based commerce frameworks and managed cloud services (e.g. managed Kubernetes). You will work within our SRE Capability supporting the next generation customer experience by blending fashion and tech. You...
-
Site Reliability Engineer
1 week ago
Bengaluru, Karnataka, India beBeeReliability Full timeSystem Reliability ExpertWe are looking for a System Reliability Expert to join our team. The ideal candidate will have a strong background in software and systems engineering, with expertise in coding, algorithms, complexity analysis, and large-scale system design.About the RoleThe System Reliability Expert will be responsible for managing the end-to-end...
-
Site Reliability Engineer
5 days ago
Bengaluru, Karnataka, India Enterprise Minds, Inc Full timeWe're Hiring | Site Reliability Engineer | 8-10 years
-
Site Reliability Engineer
4 weeks ago
Bengaluru, Karnataka, India myGwork - LGBTQ+ Business Community Full timeJob DescriptionThis job is with eBay, an inclusive employer and a member of myGwork the largest global platform for the LGBTQ+ business community. Please do not contact the recruiter directly.At eBay, we&aposre more than a global ecommerce leader - we&aposre changing the way the world shops and sells. Our platform empowers millions of buyers and sellers in...