High Availability Specialist

17 hours ago


Kurnool, Andhra Pradesh, India beBeeReliability Full time ₹ 80,00,000 - ₹ 1,50,00,000
Site Reliability Engineering Leader

You will be responsible for owning and delivering high availability, performance, and efficiency across our SaaS platform on Azure.

As a Site Reliability Engineer, you will define and enforce reliability standards, lead impactful projects, mentor engineers, and eliminate toil at scale.

  • Service Level Indicators/Service Level Objectives & Contracts: Define customer-centric SLIs/SLOs for Tier-0/Tier-1 services. Publish, review quarterly, and align teams to them.
  • Error Budgeting (Policy & Tooling):
  • Run the error-budget policy with multi-window, multi-burn-rate alerts; clear runbooks and paging thresholds.
  • Gate changes by budget status (freeze/relax rules) wired into CI/CD.
  • Maintain SLO/EB dashboards (Azure Monitor, Grafana/Prometheus, App Insights). Run weekly SLO reviews with engineering/product.
  • Drive roadmap tradeoffs when budgets are at risk; land reliability epics.
  • Incidents without Drama: Lead SEV1/SEV2, own comms, run blameless postmortems, and make corrective actions stick.
  • Engineer reliability in: Multi-AZ/region patterns (active-active/DR), PDBs/Pod Topology Spread, HPA/VPA/KEDA, resilient rollout/rollback.
  • Azure Kubernetes Service (AKS) at Scale: Harden clusters (network, identity, policy), optimize node/pod density, ingress (AGIC/Nginx); mesh optional.
  • Observability that works: Metrics/traces/logs with Azure Monitor/App Insights, Log Analytics, Prometheus/Grafana, OpenTelemetry. Alert on symptoms, not noise.
  • Infra as Code & Policy: Terraform/Bicep modules, GitOps (Flux/Argo), policy-as-code (Azure Policy/OPA Gatekeeper). No snowflakes.
  • CI/CD reliability: Azure DevOps/GitHub Actions with canary/blue-green, progressive delivery, auto-rollback, Key Vault-backed secrets.
  • Capacity & Performance: Load testing, right-sizing, autoscaling; partner with FinOps to reduce spend without hurting SLOs.
  • Disaster Recovery (DR) you can trust: Define RTO/RPO, test backups/restore, run game days/chaos drills, validate ASR and multi-region failover.
  • Secure by Default: Entra ID (Azure AD), managed identities, Key Vault rotation, VNets/NSGs/Private Link, shift-left checks in CI.
  • Reduce toil: Automate recurring ops, build self-service runbooks/chatops, publish golden paths for product teams.
  • Customer Escalations: Be the technical owner on calls; communicate tradeoffs and recovery plans with authority.
  • Document to scale: Architectures, runbooks, postmortems, SLIs/SLOs—kept current and discoverable.
  • (If applicable) Streaming/ETL reliability: Apply SRE practices (SLOs, backpressure, idempotency, replay) to NiFi/Flink/Kafka/Redpanda data flows.


  • Kurnool, Andhra Pradesh, India beBeeInfrastructure Full time ₹ 90,00,000 - ₹ 1,20,00,000

    Job Title: High Availability EngineerProvide technical leadership through knowledge sharing, code reviews, and solution design.Role Responsibilities:Enhance system performance and reliability.Automate manual processes.Collaborate with globally dispersed SRE teams.Implement solutions using Infrastructure as Code.Monitor and optimize databases for high...


  • Kurnool, Andhra Pradesh, India beBeeAvailability Full time ₹ 1,50,00,000 - ₹ 2,00,00,000

    Job Description">">We are looking for a seasoned Senior Site Reliability Engineer with deep expertise in the Elastic Stack (ELK) to join our Platform Engineering Practice. This is a high-impact engineering opportunity focused on performance, observability, and operational excellence at scale.">">Responsibilities">">Design, manage, and scale large-scale ELK...


  • Kurnool, Andhra Pradesh, India beBeePerformance Full time ₹ 2,00,00,000 - ₹ 2,50,00,000

    Job Title: High Performance SpecialistLocation: Hyderabad/Pune/Mysore/BangaloreTo excel in this role, you should have a strong background in software development or performance testing and engineering with systems analysis. Experience working with agile practices is highly valued.You will be responsible for engaging in the entire lifecycle of services - from...


  • Kurnool, Andhra Pradesh, India beBeeSystemReliability Full time ₹ 1,00,00,000 - ₹ 2,00,00,000

    About the RoleWe are seeking a skilled Site Reliability Engineer to join our team and contribute to ensuring the reliability, scalability, and performance of our critical systems.Key Responsibilities:Design and implement scalable and reliable infrastructure for production systems.Automate operational tasks, deployments, and monitoring to improve...


  • Kurnool, Andhra Pradesh, India beBeePerformance Full time ₹ 18,00,000 - ₹ 22,00,000

    Senior Performance Testing SpecialistYou will be responsible for ensuring the high-performance and scalability of software applications.Plan, design, and execute performance tests to identify bottlenecks and areas for improvement.Develop detailed test plans, scripts, and reports to ensure thorough testing coverage.Collaborate with cross-functional teams to...


  • Kurnool, Andhra Pradesh, India beBeeAutomation Full time ₹ 18,00,000 - ₹ 24,00,000

    Automation Specialist Role We are seeking an experienced Automation Specialist to join our engineering team. This key member will design, develop and maintain automated test suites using Selenium WebDriver and related frameworks. The ideal candidate will contribute to mobile automation with Appium and web automation with Playwright, perform API testing...


  • Kurnool, Andhra Pradesh, India beBeeDatabase Full time ₹ 10,00,000 - ₹ 15,00,000

    Job Title: Senior Oracle Database SpecialistKey Responsibilities:Manage and maintain high-performance Oracle databases in a production environment, ensuring optimal performance and availability.Oversee the installation, configuration, performance tuning, backup, and recovery of Oracle databases to meet business requirements.Design and implement...


  • Kurnool, Andhra Pradesh, India beBeeCloud Full time ₹ 15,00,000 - ₹ 25,00,000

    Cloud Architecture Specialist JobWe are seeking a highly skilled Cloud Architecture Specialist to design and implement scalable cloud solutions.Design and implement cloud architectures for optimal scalability and performanceDevelop strategies for automated infrastructure provisioning and deployment using TerraformMigrate applications to the cloud ensuring...


  • Kurnool, Andhra Pradesh, India beBeeVisual Full time ₹ 8,00,000 - ₹ 10,00,000

    Visual Design SpecialistAbout the RoleWe are seeking meticulous and creatively attuned individuals to contribute to the training and refinement of large language models (LLMs).Evaluate and Review: Critically assess visual design tasks and outputs based on internal guidelines.Quality Assurance: Ensure submitted data and responses align with principles of...


  • Kurnool, Andhra Pradesh, India beBeeReliability Full time ₹ 20,00,000 - ₹ 25,00,000

    System Reliability ExpertWe are seeking an experienced System Reliability Expert to join our team. This role involves managing the entire application and system stack, ensuring high reliability, scalability, and performance of distributed systems.Key Responsibilities:Software Development Lifecycle: Engage in and improve the software development lifecycle –...