Senior Site Reliability Engineer
8 hours ago
Saviynt's AI-powered identity platform manages and governs human and non-human access to all of an organization's applications, data, and business processes. Customers trust Saviynt to safeguard their digital assets, drive operational efficiency, and reduce compliance costs. Built for the AI age, Saviynt is today helping organizations safely accelerate their deployment and usage of AI. Saviynt is recognized as the leader in identity security, with solutions that protect and empower the world's leading brands, Fortune 500 companies and government institutions. For more information, please visit Our Monitoring and Alerting team within the SaaS Operations team combines Operations Excellence with the Development Experience to deliver services at high scale, high availability with resilience by using automation and Infrastructure Code. We build reliability into our ecosystem by applying best practices in Resiliency Engineering, Automation, Observability & Chaos Testing The team comes from diverse technical backgrounds, and the responsibilities provide the opportunity for a variety of challenges. Ideal candidates will have a background in either software engineering or systems engineering with a desire to learn the other or previous experience with building and managing Monitoring and Alerting systems. We are looking for a Systems Thinking, Principal Engineer who has helped teams scale through production insights, operational automation, building observability program, developer guidance, real-time metrics, automation, automation, automation
We are looking for an experienced Senior Site Reliability Engineer to join our Product SRE team Engineering team. Reporting to the Senior Director, Site Reliability Engineering,
You'll be responsible for:
· Creating and sustaining infrastructure and tools to ensure reliable services and enhance customer experience · Collaborating with teams to enhance observability, automation, deployment, and system reliability · Developing, deploying, and managing scalable, dependable infrastructure solutions to power Zscaler's global cloud services · Collaborating with product, operations, and security teams to smoothly implement features, tools, and updates across the platform · Developing and deploying AI-powered tools to boost operational efficiency and advance engineering excellence
What We're Looking For (Minimum Qualifications) · Drive comprehensive observability for microservices and Kubernetes clusters using tools like OpenTelemetry · Build and manage automation tools to streamline deployment, patching, scaling, and infrastructure management · Build scalable portals for SRE dashboards, SLI/SLO/SLA tracking, error budgets, and executive metrics to enable data-driven decision-making · Proficient in programming and scripting with Java, Python, Go, Shell, or similar languages · Skilled in OpenStack cloud, Linux, Kafka, RabbitMQ, Prometheus, Terraform, Kubernetes, Ansible, MLOps, Generative AI, PostgreSQL, and analytics databases · Familiarity with current AWS solutions; Azure experience also considered · Containerized workloads (Prefer Helm; Related: AKS & EKS, other K8s distributions, Docker, JFrog · Logging and monitoring tools (Prefer: Prometheus, Grafana, Dataddon, AWS Cloudwatch; Related, , Azure Monitor, Log Analytics, Fluentd · Network Security (e.g. AWZ Policy, Azure Policy, VPN, Active Directory/RBAC, ACLs, NSG rules, private endpoints · Proven experience in implementing advanced observability practices and techniques at scale · Hands on experience with one or more observability tools (Prometheus, Grafana, · ELK/OpenSearch, OpenTelemetry, Datadog, etc
What Will Make You Stand Out (Preferred Qualifications) · Bachelor's in Computer Science or related field, or equivalent experience, with 4 years in Cloud-SRE, DevOps, or Systems Engineering · Strong problem-solving capabilities, excellent collaboration and communication skills, and a proactive approach to teamwork Knowledge of testing tools and frameworks
-
Senior Site Reliability Engineer
2 weeks ago
Bengaluru, Karnataka, India Akamai Full timeJob Category Site Reliability Would you like to lead modernization initiatives while building a public cloud platform from scratch Would you like to own critical services in a new public cloud platform Join our IaaS Site Reliability Engineering SRE team We design develop and operate infrastructure and services that power the backbone of our...
-
Site Reliability Engineer
3 weeks ago
Bengaluru, Karnataka, India WhiteLotus Talent Partners Full timeWe are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes. In this role, you will focus on monitoring, basic troubleshooting, and incident response, helping to maintain high system...
-
Site Reliability Engineer
2 weeks ago
Bengaluru, Karnataka, India WhiteLotus Talent Partners Full timeWe are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes. In this role, you will focus on monitoring, basic troubleshooting, and incident response, helping to maintain high system...
-
Site Reliability Engineer
2 weeks ago
Bengaluru, Karnataka, India Synechron Full timeWe have immediate opportunity for SRE (Senior Site Reliability Engineer) 5 to 9 years. Job Role: - SRE (Senior Site Reliability Engineer)We began life in 2001 as a small, self-funded team of technology specialists. Innovative tech solutions for business We're now a leading global digital consulting firm, providing innovative technology solutions for...
-
Site Reliability Engineer
4 days ago
Bengaluru, Karnataka, India WhiteLotus Talent Partners Full time ₹ 9,00,000 - ₹ 12,00,000 per yearWe are looking for aL0 and L1 Site Reliability Engineer (SRE) Supportto join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered byOpenStackandKubernetes. In this role, you will focus onmonitoring,basic troubleshooting, andincident response, helping to maintain high system availability,...
-
Senior Site Reliability Engineer
2 weeks ago
Bengaluru, Karnataka, India beBeeSiteReliability Full time ₹ 20,00,000 - ₹ 30,00,000As a senior site reliability engineer, you will play a critical role in ensuring the stability and scalability of financial platforms.Key Responsibilities:Ensure defined SLAs, SLOs, and SLIs are met for performance, reliability, and uptime.Build automation for deployments, monitoring, scaling, and self-healing capabilities to reduce manual effort and...
-
Senior Site Reliability Engineer
2 weeks ago
Bengaluru, Karnataka, India Procore Full time ₹ 5,00,000 - ₹ 8,00,000 per yearJob DescriptionWe're looking for a Senior Site Reliability Engineer to join Procore's Product & Technology Team. Procore software solutions aim to improve the lives of everyone in construction and the people within Product & Technology are the driving force behind our innovative, top-rated global platform. We're a customer-centric group that encompasses...
-
Senior Site Reliability Engineer
2 weeks ago
Bengaluru, Karnataka, India Procore Technologies Full time ₹ 1,04,000 - ₹ 1,30,878 per yearJob Description We're looking for a Senior Site Reliability Engineer to join Procore's Product & Technology Team. Procore software solutions aim to improve the lives of everyone in construction and the people within Product & Technology are the driving force behind our innovative, top-rated global platform. We're a customer-centric group that encompasses...
-
Site Reliability Engineer
5 hours ago
Bengaluru, Karnataka, India AppHelix Full time ₹ 9,00,000 - ₹ 12,00,000 per yearRole DescriptionThis is a full-time on-site role located in Bengaluru for a Site Reliability Engineer. The Site Reliability Engineer will be responsible for maintaining and improving the reliability of AppHelix's systems. Daily tasks include monitoring system performance, troubleshooting issues, managing infrastructure, and supporting software development....
-
Senior Site Reliability Engineer
2 weeks ago
Bengaluru, Karnataka, India Procore Technologies Full time ₹ 1,04,000 - ₹ 1,30,878 per yearJob Description We're looking for a Senior Site Reliability Engineer to join Procore's Product & Technology Team. Procore software solutions aim to improve the lives of everyone in construction and the people within Product & Technology are the driving force behind our innovative, top-rated global platform. We're a customer-centric group that encompasses...