Senior Cloud Reliability Advocate

1 day ago


Amrāvati, Maharashtra, India beBeeReliability Full time US$ 1,45,824 - US$ 2,43,984
Job Overview:

The Senior Site Reliability Engineer plays a pivotal role in ensuring the availability, latency, performance, and efficiency of our SaaS platform on Azure.

This individual will define and enforce reliability standards, lead high-impact projects, mentor engineers, and eliminate toil at scale. They will report directly to the Director of Site Reliability.

Key responsibilities include:

  • Defining customer-centric Service Level Indicators (SLIs) and Service Level Objectives (SLOs) for Tier-0/Tier-1 services.
  • Implementing an error-budget policy with multi-window, multi-burn-rate alerts; clear runbooks and paging thresholds.
  • Gating changes by budget status (freeze/relax rules) wired into CI/CD.
  • Maintaining SLO/EB dashboards (Azure Monitor, Grafana/Prometheus, App Insights). Running weekly SLO reviews with engineering/product.
  • Driving roadmap tradeoffs when budgets are at risk; landing reliability epics.
  • Incident management without drama: Leading SEV1/SEV2 incidents, owning comms, running blameless postmortems, and making corrective actions stick.
  • Engineering reliability in: Multi-AZ/region patterns (active-active/DR), PDBs/Pod Topology Spread, HPA/VPA/KEDA, resilient rollout/rollback.
  • AKS at scale: Hardening clusters (network, identity, policy), optimizing node/pod density, ingress (AGIC/Nginx); mesh optional.
  • Observability that works: Metrics/traces/logs with Azure Monitor/App Insights, Log Analytics, Prometheus/Grafana, OpenTelemetry. Alerting on symptoms, not noise.
  • IaC & policy: Terraform/Bicep modules, GitOps (Flux/Argo), policy-as-code (Azure Policy/OPA Gatekeeper). No snowflakes.
  • CI/CD reliability: Azure DevOps/GitHub Actions with canary/blue-green, progressive delivery, auto-rollback, Key Vault-backed secrets.
  • Capacity & performance: Load testing, right-sizing, autoscaling; partnering with FinOps to reduce spend without hurting SLOs.
  • Disaster Recovery you can trust: Defining RTO/RPO, testing backups/restore, running game days/chaos drills, validating ASR and multi-region failover.
  • Secure by design: Entra ID (Azure AD), managed identities, Key Vault rotation, VNets/NSGs/Private Link, shift-left checks in CI.
  • Reducing toil: Automating recurring ops, building self-service runbooks/chatops, publishing golden paths for product teams.
  • Customer escalations: Being the technical owner on calls; communicating tradeoffs and recovery plans with authority.
  • Documenting for scale: Architectures, runbooks, postmortems, SLIs/SLOs—kept current and discoverable.

Minimum Qualifications:

  • Bachelor’s in Computer Science/Engineering (or equivalent experience).
  • 12+ years in production ops/platform/SRE, including 5+ years on Azure.
  • PostgreSQL (must-have): Deep operational expertise incl. HA/DR, logical/physical replication, performance tuning (indexes/EXPLAIN/ANALYZE, pg_stat_statements), autovacuum strategy, partitioning, backup/restore testing, and connection pooling (pgBouncer). Prefer experience with Azure Database for PostgreSQL – Flexible Server.
  • Azure core: AKS (must-have); Front Door/App Gateway, API Management, VNets/NSGs/Private Link, Storage, Key Vault, Redis, Service Bus/Event Hubs.
  • Observability: Azure Monitor/App Insights, Log Analytics, Prometheus/Grafana; SLO design and error-budget operations.
  • IaC/automation: Terraform and/or Bicep; PowerShell and Python; GitOps (Flux/Argo). Pipelines in Azure DevOps or GitHub Actions.
  • Proven incident leadership at scale, blameless postmortems, and SLO/error-budget governance with change gating.
  • Mentorship and crisp written/verbal communication.

Preferred Qualifications:

  • Apache NiFi, Apache Flink, Apache Kafka or Redpanda (self-managed on AKS or managed equivalents); schema management, exactly-once semantics, backpressure, dead-letter/replay patterns.
  • Azure Solutions Architect Expert, CKA/CKAD.
  • ITSM (ServiceNow), on-call tooling (PagerDuty/Opsgenie).
  • Compliance/SecOps (SOC 2, ISO 27001), policy-as-code, workload identity.
  • OpenTelemetry, eBPF tooling, or service mesh.
  • Multi-tenant SaaS and cost optimization at scale.


  • Amrāvati, Maharashtra, India beBeeReliability Full time ₹ 18,00,000 - ₹ 25,00,000

    Job Title: Senior Reliability Strategist">About the Role:We are seeking a highly skilled and experienced Reliability Strategist to join our team. As a Senior Reliability Strategist, you will play a pivotal role in developing and implementing reliability strategies to ensure the availability and performance of our critical services.Key...


  • Amrāvati, Maharashtra, India beBeeCloud Full time US$ 1,50,000 - US$ 1,80,000

    Job DescriptionThe role of Senior Cloud Infrastructure Specialist involves leading the development, maintenance, and optimization of cloud infrastructure to support a Warehouse Management System (WMS) in production.Deploy, configure, and maintain Windows Server virtual machines within AWS.Provision, manage, and optimize performance for SQL Server and...


  • Amrāvati, Maharashtra, India beBeeInfrastructure Full time ₹ 1,80,00,000 - ₹ 2,50,00,000

    Job TitleSenior Site Reliability EngineerRole OverviewAs a Senior Site Reliability Engineer, you will play a crucial role in designing and architecting software platforms that enable the provisioning and managing of services. Our team treats infrastructure and operations as Software Engineering problems.Treat infrastructure and operations as software...


  • Amrāvati, Maharashtra, India beBeeExpertise Full time ₹ 19,56,257 - ₹ 23,45,678

    Transform Your Career as a Senior DevOps ExpertEmbark on a professional journey with our global network of 58,000+ experts, boasting a strong presence across 30+ countries. Our collective experience spans 30 years in the industry, driving innovation and growth through collaborative knowledge environments.We design and develop cutting-edge AI-driven,...


  • Amrāvati, Maharashtra, India beBeeReliability Full time ₹ 1,50,00,000 - ₹ 2,50,00,000

    Reliable System Engineer PositionWe are seeking a skilled Reliable System Engineer to join our team.The ideal candidate will have extensive experience in automation, cloud infrastructure, and observability solutions. They will be responsible for ensuring the reliability and performance of our systems, working closely with cross-functional teams to drive...


  • Amrāvati, Maharashtra, India beBeeCloud Full time ₹ 2,00,00,000 - ₹ 2,50,00,000

    About our company:We are a leading player in the restaurant industry, leveraging digital transformation and operational efficiencies to drive growth. Our technology hub serves as the nucleus for innovation and product development, focusing on data science, eCommerce, automation, cloud computing, and information security.We are seeking an exceptional Senior...


  • Amrāvati, Maharashtra, India beBeeCloud Full time ₹ 1,50,00,000 - ₹ 2,50,00,000

    Senior Cloud Architect PositionThis role focuses on the strategic development and implementation of cloud-based solutions, ensuring seamless integration with existing systems.Azure AD and On-premises AD administrators should have a solid grasp of Azure Active Directory, including user management, group policies, and conditional access.Additionally, they...


  • Amrāvati, Maharashtra, India beBeeObservability Full time ₹ 25,00,000 - ₹ 30,00,000

    Combining Development and Operations KnowledgeCvent's reliability engineering combines development and operations knowledge to improve an organization. If you have experience in reliability engineering or development, particularly with observability, Cvent's team can benefit from your skills. We're looking for passionate individuals who love learning,...


  • Amrāvati, Maharashtra, India beBeeSystemReliabilityEngineer Full time ₹ 15,00,000 - ₹ 20,00,000

    We are seeking a skilled System Reliability Engineer to join our team. This role offers the opportunity to shape the SRE function and be part of a dynamic team.This position provides the chance to work with some of the brightest minds in the industry and contribute to the development of methodologies and strategies for identification of toil-heavy and...


  • Amrāvati, Maharashtra, India beBeeCloudProfessional Full time ₹ 1,50,00,000 - ₹ 2,50,00,000

    Cloud Architect and Administrator PositionWe are seeking a highly skilled Cloud Architect and Administrator to join our team. The ideal candidate will have in-depth knowledge of cloud technologies, architecture design, and administration.Key Responsibilities:Design, deploy, and maintain cloud infrastructure and services.Monitor system performance and ensure...