Site Reliability Engineer

11 hours ago


Malappuram, Kerala, India beBeeReliability Full time ₹ 1,80,00,000 - ₹ 2,50,00,000
Job Description

As a seasoned site reliability engineer, you will be responsible for owning availability, latency, and performance of our SaaS on Azure.

You will define and enforce reliability standards, lead high-impact projects, mentor engineers, and eliminate toil at scale. You will report directly to the Director of Site Reliability Engineering.

Key Responsibilities:
  • Define customer-centric service level indicators (SLIs) and service level objectives (SLOs) for Tier-0/Tier-1 services, publish, review quarterly, and align teams to them.
  • Implement error budgeting policy and tooling, including multi-window alerts, clear runbooks, and paging thresholds.
  • Gate changes by budget status (freeze/relax rules) wired into continuous integration and deployment (CI/CD).
  • Maintain SLO/error budget dashboards (Azure Monitor, Grafana/Prometheus, App Insights). Run weekly SLO reviews with engineering/product.
  • Drive roadmap tradeoffs when budgets are at risk; land reliability epics.
  • Lead severe incident responses without drama: own communications, run blameless postmortems, and make corrective actions stick.
  • Engineer reliability in: Multi-AZ/region patterns (active-active/disaster recovery), Pod Topology Spread, Horizontal Pod Autoscaling (HPA), resilient rollout/rollback.
  • Harden Kubernetes clusters (network, identity, policy), optimize node/pod density, ingress (AGIC/Nginx); mesh optional.
  • Implement observability that works: Metrics/traces/logs with Azure Monitor/App Insights, Log Analytics, Prometheus/Grafana, OpenTelemetry. Alert on symptoms, not noise.
  • Use infrastructure-as-code and policy-as-code: Terraform/Bicep modules, GitOps (Flux/Argo), policy-as-code (Azure Policy/Opa Gatekeeper). No custom solutions.
  • Ensure CI/CD reliability: Azure DevOps/GitHub Actions with canary/blue-green deployments, progressive delivery, auto-rollback, Key Vault-backed secrets.
  • Partner with FinOps to reduce spend without hurting SLIs through load testing, right-sizing, autoscaling.
  • Define disaster recovery you can trust: Recovery Time Objective (RTO), Recovery Point Objective (RPO), test backups/restore, run game days/chaos drills, validate asynchronous replication and multi-region failover.
  • Secure by default: Entra ID (Azure AD), managed identities, Key Vault rotation, VNets/NSGs/Private Link, shift-left checks in CI.
  • Reduce toil: Automate recurring operations, build self-service runbooks/chatops, publish golden paths for product teams.
  • Be the technical owner on customer escalations; communicate tradeoffs and recovery plans with authority.
  • Document architectures, runbooks, postmortems, SLIs/SLOs—kept current and discoverable.

Benefits: Opportunity to work on large-scale systems, collaborate with experienced engineers, and develop your skills in reliability engineering.

Required Skills and Qualifications: Proven experience in site reliability engineering, strong understanding of cloud computing and containerization, excellent communication and problem-solving skills.



  • Malappuram, Kerala, India CorroHealth Full time

    We are seeking a highly skilled Site Reliability Engineer (SRE) to join our team. The ideal candidate will have a deep understanding of both software engineering and systems administration, with a focus on creating scalable and reliable systems. You will work closely with development and operations teams to ensure the reliability, availability, and...


  • Malappuram, Kerala, India beBeeEngineering Full time ₹ 1,20,00,000

    Job Title:Site Reliability Engineering LeaderJob SummaryWe are seeking a seasoned Site Reliability Engineer to lead our remote team in driving operational excellence and fostering a high-performing culture.Main Responsibilities:To provide leadership and management to a remote team of Site Reliability Engineers, ensuring seamless collaboration and efficient...


  • Malappuram, Kerala, India beBeeSre Full time ₹ 1,80,00,000 - ₹ 2,50,00,000

    Job Title: Site Reliability Engineering ManagerThe SRE Manager will lead the reliability engineering function, ensuring infrastructure resiliency and optimal operational performance.This role blends technical leadership with team mentorship and cross-functional coordination.Establish and lead the implementation of organizational reliability strategies,...


  • Malappuram, Kerala, India beBeeResilience Full time ₹ 23,00,000 - ₹ 25,00,000

    Senior SRE Role OverviewThis position focuses on advancing Site Reliability Engineering (SRE) best practices within an enterprise-level organization.


  • Malappuram, Kerala, India beBeeReliability Full time ₹ 30,00,000 - ₹ 40,00,000

    Transform your career with an exciting opportunity as a VP – Site Reliability Engineering.As a member of our elite team, you will work closely with the latest technologies and collaborate with some of the brightest minds in the industry to shape the future of site reliability engineering.This is a unique chance to drive innovation and growth by defining,...


  • Malappuram, Kerala, India beBeeReliability Full time ₹ 1,20,00,000 - ₹ 1,80,00,000

    Site Reliability EngineerWe are seeking a skilled and experienced Site Reliability Engineer to join our team. As a key member of our organization, you will play a vital role in ensuring the reliability and scalability of our products and services.This is an exceptional opportunity for someone who enjoys solving complex problems and working collaboratively...


  • Malappuram, Kerala, India beBeeSite Full time ₹ 90,00,000 - ₹ 1,54,00,000

    About the RoleThe Senior Site Reliability Engineer will play a crucial role in driving operational excellence by ensuring the reliability, scalability, and performance of mission-critical systems.


  • Malappuram, Kerala, India beBeeReliabilityEngineer Full time ₹ 40,00,000 - ₹ 50,00,000

    Senior Site Reliability EngineerWe are seeking an experienced Senior Site Reliability Engineer to play a critical role in ensuring the stability and scalability of financial platforms.About the Role:Ensure that financial systems meet defined performance, reliability, and uptime standards.Build automation for deployments, monitoring, scaling, and self-healing...


  • Malappuram, Kerala, India beBeeSoftware Full time ₹ 1,50,00,000 - ₹ 2,00,00,000

    We are seeking an experienced Software Engineer with a focus on Site Reliability to join our team. The ideal candidate will have a strong background in ensuring the stability, scalability, and operational excellence of accounting and finance platforms.

  • Site Engineer

    5 days ago


    Malappuram, Kerala, India Sketis Group Full time ₹ 2,40,000 per year

    We are seeking a dynamic and efficient Site Engineer to oversee construction projects from inception to completion. You will provide technical guidance to teams, monitor project progress, and ensure work aligns with engineering standards, safety regulations, and design specifications.ResponsibilitiesSupervise and manage daily construction activities onsite...