Senior Site Reliability Engineer

3 days ago


Bangalore Karnataka, India Saviynt Full time

About the job Saviynt s AI-powered identity platform manages and governs human and non-human access to all of an organization s applications data and business processes Customers trust Saviynt to safeguard their digital assets drive operational efficiency and reduce compliance costs Built for the AI age Saviynt is today helping organizations safely accelerate their deployment and usage of AI Saviynt is recognized as the leader in identity security with solutions that protect and empower the world s leading brands Fortune 500 companies and government institutions For more information please visit Our Monitoring and Alerting team within the SaaS Operations team combines Operations Excellence with the Development Experience to deliver services at high scale high availability with resilience by using automation and Infrastructure Code We build reliability into our ecosystem by applying best practices in Resiliency Engineering Automation Observability Chaos Testing The team comes from diverse technical backgrounds and the responsibilities provide the opportunity for a variety of challenges Ideal candidates will have a background in either software engineering or systems engineering with a desire to learn the other or previous experience with building and managing Monitoring and Alerting systems We are looking for a Systems Thinking Principal Engineer who has helped teams scale through production insights operational automation building observability program developer guidance real-time metrics automation automation automation We are looking for an experienced Senior Site Reliability Engineer to join our Product SRE team Engineering team Reporting to the Senior Director Site Reliability Engineering You ll be responsible for Creating and sustaining infrastructure and tools to ensure reliable services and enhance customer experience Collaborating with teams to enhance observability automation deployment and system reliability Developing deploying and managing scalable dependable infrastructure solutions to power Zscaler s global cloud services Collaborating with product operations and security teams to smoothly implement features tools and updates across the platform Developing and deploying AI-powered tools to boost operational efficiency and advance engineering excellence What We re Looking For Minimum Qualifications Drive comprehensive observability for microservices and Kubernetes clusters using tools like OpenTelemetry Build and manage automation tools to streamline deployment patching scaling and infrastructure management Build scalable portals for SRE dashboards SLI SLO SLA tracking error budgets and executive metrics to enable data-driven decision-making Proficient in programming and scripting with Java Python Go Shell or similar languages Skilled in OpenStack cloud Linux Kafka RabbitMQ Prometheus Terraform Kubernetes Ansible MLOps Generative AI PostgreSQL and analytics databases Familiarity with current AWS solutions Azure experience also considered Containerized workloads Prefer Helm Related AKS EKS other K8s distributions Docker JFrog Logging and monitoring tools Prefer Prometheus Grafana Dataddon AWS Cloudwatch Related Azure Monitor Log Analytics Fluentd Network Security e g AWZ Policy Azure Policy VPN Active Directory RBAC ACLs NSG rules private endpoints Proven experience in implementing advanced observability practices and techniques at scale Hands on experience with one or more observability tools Prometheus Grafana ELK OpenSearch OpenTelemetry Datadog etc What Will Make You Stand Out Preferred Qualifications Bachelor s in Computer Science or related field or equivalent experience with 4 years in Cloud-SRE DevOps or Systems Engineering Strong problem-solving capabilities excellent collaboration and communication skills and a proactive approach to teamwork Knowledge of testing tools and frameworks



  • Bangalore, India WhiteLotus Talent Partners Full time

    We are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by Open Stack and Kubernetes . In this role, you will focus on monitoring , basic troubleshooting , and incident response , helping to maintain high...


  • Bangalore, India WhiteLotus Talent Partners Full time

    We are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes . In this role, you will focus on monitoring , basic troubleshooting , and incident response , helping to...


  • bangalore, India WhiteLotus Talent Partners Full time

    We are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes. In this role, you will focus on monitoring, basic troubleshooting, and incident response, helping to maintain high system...


  • Bangalore, India Synechron Full time

    We have immediate opportunity for SRE (Senior Site Reliability Engineer) 5 to 9 years. Synechron – Bangalore Job Role: - SRE (Senior Site Reliability Engineer) Job Location: - Bangalore Notice Period: Within 30days About Synechron We began life in 2001 as a small, self-funded team of technology specialists. Since then, we’ve grown our...


  • Bangalore, Karnataka, India Pearson Full time

    Job Category Technology Role Overview Learning The Associate Site Reliability Engineer s SRE primary focus will be on acquiring and honing the essential skills required to excel in the role They will work closely with more experienced engineers who will mentor and guide them throughout their journey The responsibilities will encompass various...


  • Bangalore, Karnataka, India NatWest Group Full time

    Join us as a Site Reliability Engineer In this key role you ll support the improvement of non-functional and operational characteristics such as availability performance efficiency change management monitoring security incident response and capacity planning of our products and services You ll enjoy significant stakeholder interaction working in...


  • Bangalore, Karnataka, India NatWest Group Full time

    Join us as a Site Reliability Engineer In this key role youll support the improvement of non-functional and operational characteristics such as availability performance efficiency change management monitoring security incident response and capacity planning of our products and services Youll enjoy significant stakeholder interaction working in...


  • Bangalore, Karnataka, India JPMorgan Chase Full time

    Job Category Software Engineering There s nothing more exciting than being at the center of a rapidly growing field in technology and applying your skillsets to drive innovation and modernize the world s most complex and mission-critical systems As a Site Reliability Engineer III at JPMorgan Chase within the Employee Platforms team you will solve...


  • Bangalore, India Synechron Full time

    We have immediate opportunity for SRE (Senior Site Reliability Engineer) 5 to 9 years. Synechron – Bangalore Job Role: - SRE (Senior Site Reliability Engineer) Job Location: - Bangalore Notice Period: Within 30days About Synechron We began life in 2001 as a small, self-funded team of technology specialists. Since then, we’ve grown...


  • bangalore, India Synechron Full time

    We have immediate opportunity for SRE (Senior Site Reliability Engineer) 5 to 9 years.Synechron – BangaloreJob Role: - SRE (Senior Site Reliability Engineer)Job Location: - BangaloreNotice Period: Within 30daysAbout SynechronWe began life in 2001 as a small, self-funded team of technology specialists. Since then, we’ve grown our organization to 14,500+...