
Site Reliability Engineer
6 days ago
Job Title: Senior Site Reliability Engineer II
We are seeking a highly skilled Senior Site Reliability Engineer to join our team. The ideal candidate will have a strong background in production operations, platform engineering, and SRE, with expertise in Azure.
About the RoleThis role is responsible for ensuring the availability, latency, performance, and efficiency of our SaaS on Azure. You will define and enforce reliability standards, lead high-impact projects, mentor engineers, and eliminate toil at scale.
Key Responsibilities- Define SLIs/SLOs & contracts: Define customer-centric SLIs/SLOs for Tier-0/Tier-1 services, publish quarterly reviews, and align teams with them.
- Error budgeting: Run the error-budget policy with multi-window, multi-burn-rate alerts; clear runbooks and paging thresholds.
- Gates by budget status: Wire freeze/relax rules into CI/CD.
- Maintain SLO/EB dashboards: Use Azure Monitor, Grafana/Prometheus, App Insights, and run weekly SLO reviews with engineering/product.
- Roadmap tradeoffs: Land reliability epics when budgets are at risk.
- Incident leadership: Lead SEV1/SEV2 incidents without drama; own comms, run blameless postmortems, and make corrective actions stick.
- Engineer reliability in: Multi-AZ/region patterns, PDBs/Pod Topology Spread, HPA/VPA/KEDA, resilient rollout/rollback.
- Azure Kubernetes Service (AKS): Harden clusters, optimize node/pod density, ingress (AGIC/Nginx), mesh optional.
- Observability: Metrics/traces/logs with Azure Monitor/App Insights, Log Analytics, Prometheus/Grafana, OpenTelemetry. Alert on symptoms, not noise.
- IaC & policy: Terraform/Bicep modules, GitOps (Flux/Argo), policy-as-code (Azure Policy/OPA Gatekeeper). No snowflakes.
- CI/CD reliability: Azure DevOps/GitHub Actions with canary/blue-green, progressive delivery, auto-rollback, Key Vault-backed secrets.
- Capacity & performance: Load testing, right-sizing, autoscaling; partner with FinOps to reduce spend without hurting SLOs.
- Disaster recovery: Define RTO/RPO, test backups/restore, run game days/chaos drills, validate ASR and multi-region failover.
- Security: Entra ID (Azure AD), managed identities, Key Vault rotation, VNets/NSGs/Private Link, shift-left checks in CI.
- Reduce toil: Automate recurring ops, build self-service runbooks/chatops, publish golden paths for product teams.
- Customer escalations: Be the technical owner on calls; communicate tradeoffs and recovery plans with authority.
- Documentation: Architectures, runbooks, postmortems, SLIs/SLOs—kept current and discoverable.
- Bachelor's degree: In CS/Engineering or equivalent experience.
- Production ops/platform/SRE experience: 12+ years, including 5+ years on Azure.
- PostgreSQL expertise: Include HA/DR, logical/physical replication, performance tuning, autovacuum strategy, partitioning, backup/restore testing, and connection pooling (pgBouncer).
- Azure core skills: AKS, Front Door/App Gateway, API Management, VNets/NSGs/Private Link, Storage, Key Vault, Redis, Service Bus/Event Hubs.
- Observability skills: Azure Monitor/App Insights, Log Analytics, Prometheus/Grafana.
- IaC/automation skills: Terraform and/or Bicep, PowerShell and Python, GitOps (Flux/Argo).
- Proven incident leadership: At scale, blameless postmortems, and SLO/error-budget governance with change gating.
- Mentorship and communication: Proven written/verbal communication.
- Apache NiFi/Apache Flink/Apache Kafka/Redpanda skills: Desired.
- Azure Solutions Architect Expert/Certified Kubernetes Administrator/CKAD certifications: Desired.
- ITSM/on-call tooling: ServiceNow, PagerDuty/Opsgenie.
- Compliance/SecOps skills: SOC 2, ISO 27001, policy-as-code, workload identity.
- OpenTelemetry/eBPF tooling/service mesh: Desired.
- Multi-tenant SaaS/cost optimization: At scale.
We offer a competitive salary and benefits package. If you're passionate about SRE and want to work with a talented team, apply today
-
Site Reliability Engineering Leader
7 days ago
Belgaum, Karnataka, India beBeesreleader Full time ₹ 25,00,000 - ₹ 40,00,000Job Title: Site Reliability Engineering LeaderWe are seeking an experienced SRE leader to drive the development of our site reliability function. This role offers the opportunity to shape the SRE team and contribute to the company's operational efficiency.
-
Site Reliability Engineer Manager
2 weeks ago
Belgaum, Karnataka, India beBeeSre Full time ₹ 1,50,00,000 - ₹ 2,00,00,000Job Title: SRE Lead (Engineering & Reliability)We are seeking a seasoned and accomplished Site Reliability Engineering (SRE) professional to spearhead the reliability, scalability, and performance of our core systems. As an SRE Lead, you will play a pivotal role in establishing and implementing SRE best practices, leading a team of engineers, and driving...
-
Reliable Financial Systems Engineer
5 days ago
Belgaum, Karnataka, India beBeeSite Full time US$ 1,50,000 - US$ 1,80,000Job OverviewThis role is focused on delivering stable and scalable financial applications and data services that meet demanding requirements for accuracy, compliance, and availability.The Site Reliability Engineer will play a critical role in ensuring the stability, scalability, and operational excellence of accounting platforms.As an SRE, you will build...
-
Reliability Engineering Team Lead
1 week ago
Belgaum, Karnataka, India beBeeEngineering Full time ₹ 2,50,00,000 - ₹ 3,00,00,000**Job Description**Develop a strategic approach to implement an "Automate-first" culture in service operations, enhancing efficiency and reducing toil.**Key Responsibilities**Design and implement monitoring strategies with the engineering team to ensure effective capabilities are in place.Promote a collaborative environment with teams to establish...
-
Reliability Expert Wanted
3 days ago
Belgaum, Karnataka, India beBeeSre Full time ₹ 12,00,000 - ₹ 25,00,000Reliability Engineer PositionWe are seeking a highly skilled Reliability Engineer to join our team.The ideal candidate will have strong experience in Site Reliability Engineering and DevOps skills, including Continuous Integration/Continuous Deployment, monitoring, automation, and infrastructure as code.Troubleshoot complex issues independently and persevere...
-
Senior Network Reliability Engineer
7 days ago
Belgaum, Karnataka, India beBeeReliability Full time ₹ 1,50,00,000 - ₹ 1,70,00,000Job Title: Senior Network Reliability EngineerAbout this roleThis is a challenging opportunity to work as a senior network reliability engineer in a compliance-driven environment. The primary focus is on securely designing, deploying, automating, and monitoring both traditional and cloud network infrastructure.Network Design & Deployment: Lead the secure...
-
Senior System Reliability Specialist
7 days ago
Belgaum, Karnataka, India beBeeSiteReliabilityEngineer Full time ₹ 40,00,000 - ₹ 50,00,000System Reliability EngineerWe are seeking a seasoned Site Reliability Engineer to play a pivotal role in the success of our organisation.The successful candidate will be responsible for defining, driving and implementing the SRE strategy, promoting an 'automate-first' culture in operating services by reducing toil.They will develop methodologies and...
-
Reliability Engineering Leadership Role
2 weeks ago
Belgaum, Karnataka, India beBeeSeniority Full time ₹ 20,00,000 - ₹ 30,00,000We are seeking an experienced Senior Site Reliability Engineer to drive business growth by empowering users with high-quality software solutions. This role requires a unique blend of technical expertise and leadership skills, enabling the candidate to design robust cloud infrastructure, automate deployment processes, and monitor performance metrics. Key...
-
Reliability and Efficiency Engineer
5 days ago
Belgaum, Karnataka, India beBeeReliability Full time ₹ 90,00,000 - ₹ 1,25,00,000Job Title: Reliability and Efficiency EngineerWe are seeking a skilled professional to play a critical role in ensuring the stability, scalability, and operational excellence of our platforms.About the RoleEnsure our platforms meet defined performance, reliability, and uptime standards.Build automation for deployments, monitoring, scaling, and self-healing...
-
Chief System Reliability Engineer
2 weeks ago
Belgaum, Karnataka, India beBeeReliability Full time ₹ 1,75,00,000 - ₹ 2,25,00,000Reliability Engineering LeadWe are seeking an experienced and dynamic reliability engineering leader to oversee the effectiveness, scalability, and performance of our critical systems.As a reliability engineering lead, you will play a pivotal role in establishing and implementing reliability practices, leading a team of engineers, and driving automation,...