
Senior Site Reliability Engineer
1 week ago
Senior Site Reliability Engineer (SRE) – Job DescriptionKey ResponsibilitiesSRE & Application Reliability- Implement and tune SLOs/SLIs, build reliability dashboards, and respond to incidents using Grafana IRM, JSM, and escalation workflows.- Monitor application performance and availability across Kubernetes clusters using Grafana, Prometheus, Loki, Mimir, and Tempo.- Participate in on-call rotation, postmortems, and continual improvement processes.Application Support & Troubleshooting- Act as the primary escalation point for production issues — whether internal or client-facing.- Monitor logs, traces, and alerts to proactively identify and resolve incidents.- Debug issues across the stack: Kubernetes, Helm releases, application logs, API errors, database bottlenecks.- Coordinate with development, QA, and client teams to ensure timely and effective resolution of issues.DevOps & Infrastructure Automation- Implement GitOps workflows using FluxCD and ArgoCD to manage Kubernetes deployments.- Manage and maintain infrastructure-as-code using Terraform, Terragrunt, and Azure (Preferred).- Automate CI/CD pipelines with GitHub Actions for Docker image builds, Helm-based deployments, release tagging, etc.Post-QA & Release Validation- Work closely with QA engineers to validate release branches, tag images, and verify integration across services.- Test application functionality post deployments (sanity and product functional tests).- Assist in defining performance benchmarks (e.g., pgBench for PostgreSQL clusters) and validate pre-production readiness.Must-Have Qualifications- 6–8 years of experience in DevOps, SRE, or Production Support roles.- Strong hands-on experience with Azure and Kubernetes (AKS preferred) and Helm/Kustomize.- Solid knowledge of GitHub Actions, GitOps (FluxCD/ArgoCD), and Terraform/Terragrunt.- Experience with monitoring/logging stacks: Grafana, Prometheus, Loki, Tempo, Mimir, and Incident Response tools.- Experience debugging microservices written in Node.js, Go, or similar.- Excellent troubleshooting and debugging skills across the stack.
-
Senior Site Reliability Engineer
3 weeks ago
Bengaluru, Karnataka, India Akamai Full timeJob Category Site Reliability Would you like to lead modernization initiatives while building a public cloud platform from scratch Would you like to own critical services in a new public cloud platform Join our IaaS Site Reliability Engineering SRE team We design develop and operate infrastructure and services that power the backbone of our...
-
Site Reliability Engineer
4 weeks ago
Bengaluru, Karnataka, India WhiteLotus Talent Partners Full timeWe are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes. In this role, you will focus on monitoring, basic troubleshooting, and incident response, helping to maintain high system...
-
Site Reliability Engineer
3 weeks ago
Bengaluru, Karnataka, India WhiteLotus Talent Partners Full timeWe are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes. In this role, you will focus on monitoring, basic troubleshooting, and incident response, helping to maintain high system...
-
Site Reliability Engineer
1 week ago
Bengaluru, Karnataka, India WhiteLotus Talent Partners Full time ₹ 9,00,000 - ₹ 12,00,000 per yearWe are looking for aL0 and L1 Site Reliability Engineer (SRE) Supportto join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered byOpenStackandKubernetes. In this role, you will focus onmonitoring,basic troubleshooting, andincident response, helping to maintain high system availability,...
-
Site Reliability Engineer
2 weeks ago
Bengaluru, India WhiteLotus Talent Partners Full timeWe are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes. In this role, you will focus on monitoring, basic troubleshooting, and incident response, helping to maintain high system...
-
Site Reliability Engineer
2 weeks ago
Bengaluru, India WhiteLotus Talent Partners Full timeWe are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes. In this role, you will focus on monitoring, basic troubleshooting, and incident response, helping to maintain high system...
-
Site Reliability Engineer
1 week ago
Bengaluru, India Synechron Full timeWe have immediate opportunity for SRE (Senior Site Reliability Engineer) 5 to 9 years.Synechron – BangaloreJob Role: - SRE (Senior Site Reliability Engineer)Job Location: - BangaloreNotice Period: Within 30daysAbout SynechronWe began life in 2001 as a small, self-funded team of technology specialists. Since then, we’ve grown our organization to...
-
Senior Site Reliability Engineer
6 days ago
Bengaluru, Karnataka, India Saviynt Full time ₹ 12,00,000 - ₹ 36,00,000 per yearAbout the job Saviynt's AI-powered identity platform manages and governs human and non-human access to all of an organization's applications, data, and business processes. Customers trust Saviynt to safeguard their digital assets, drive operational efficiency, and reduce compliance costs. Built for the AI age, Saviynt is today helping organizations safely...
-
Senior Site Reliability Engineer
2 weeks ago
Bengaluru, India Procore Full timeJob Description We're looking for a Senior Site Reliability Engineer to join Procore's Product & Technology Team. Procore software solutions aim to improve the lives of everyone in construction and the people within Product & Technology are the driving force behind our innovative, top-rated global platform. We're a customer-centric group that encompasses...
-
Site reliability engineer
2 weeks ago
Bengaluru, India Synechron Full timeWe have immediate opportunity for SRE (Senior Site Reliability Engineer) 5 to 9 years.Synechron – BangaloreJob Role: - SRE (Senior Site Reliability Engineer)Job Location: - BangaloreNotice Period: Within 30daysAbout SynechronWe began life in 2001 as a small, self-funded team of technology specialists. Since then, we’ve grown our organization to 14,500+...