
Highly Skilled Site Reliability Engineer
2 weeks ago
Reliability Expert Wanted
Job DescriptionSenior site reliability engineers are responsible for ensuring the availability, latency, performance, and efficiency of software applications. This includes defining and enforcing reliability standards, leading high-impact projects, mentoring engineers, and eliminating toil at scale.
Key responsibilities include:
- Define SLIs/SLOs: Create customer-centric service level indicators (SLIs) and service level objectives (SLOs) for tier-0/tier-1 services.
- Error Budgeting: Implement error budgeting policies with multi-window and multi-burn-rate alerts, clear runbooks, and paging thresholds.
- Run the Error-Budget Policy: Freeze or relax rules based on budget status and integrate them into CI/CD pipelines.
- Maintain SLO/EB Dashboards: Develop and maintain dashboards in Azure Monitor, Grafana/Prometheus, and App Insights to track SLOs and error budgets.
- Drive Roadmap Tradeoffs: Collaborate with engineering and product teams to make informed decisions when error budgets are at risk.
- Incident Response: Lead SEV1/SEV2 incidents without drama, own communications, and facilitate blameless postmortems.
- Engineer Reliability: Apply reliability principles to design and implement resilient systems, including multi-AZ/region patterns, PDBs/Pod Topology Spread, HPA/VPA/KEDA, and AKS at scale.
- Production Experience: 12+ years of experience in production operations, platform engineering, or site reliability engineering.
- Azure Expertise: Deep knowledge of Azure core services, including AKS, Azure Database for PostgreSQL – Flexible Server, Front Door/App Gateway, API Management, VNets/NSGs/Private Link, Storage, Key Vault, Redis, Service Bus/Event Hubs.
- Observability: Strong understanding of observability tools, including Azure Monitor/App Insights, Log Analytics, Prometheus/Grafana, and OpenTelemetry.
- IaC/Automation: Experience with infrastructure as code (IaC) tools like Terraform and/or Bicep, automation frameworks like PowerShell and Python, and GitOps practices.
- Leadership: Proven incident leadership at scale, ability to facilitate blameless postmortems, and experience with change gating.
- Mentorship: Excellent written and verbal communication skills, with a focus on mentorship and team collaboration.
- Advanced Technical Skills: Knowledge of advanced technical skills, including Apache NiFi, Apache Flink, Apache Kafka or Redpanda, schema management, exactly-once semantics, backpressure, dead-letter/replay patterns.
- Certifications: Azure Solutions Architect Expert, CKA/CKAD certifications.
- ITSM Tooling: Experience with IT service management (ITSM) tooling, including ServiceNow, on-call tooling like PagerDuty/Opsgenie.
-
Morādābād, Uttar Pradesh, India beBeeSRE Full time US$ 10,00,000 - US$ 12,00,000Job Overview:The Site Reliability Engineering team at Cvent is responsible for ensuring the reliability and performance of our products. We are looking for a Senior SRE to join our Observability team.About this role:Ensure the availability, scalability, and performance of our systemsDevelop and maintain monitoring, logging, and alerting toolsCollaborate with...
-
Site Reliability Engineer
2 weeks ago
Morādābād, Uttar Pradesh, India HireAlpha Full timeJob Description We are looking for an engineer to focus on Developer Experience and who can help us design, build, and maintain high-performance, scalable, and reliable services. As Company provides a Contact Center service, we play a very critical role in our Customer's business operations and therefore need to provide a highly available and fault tolerant...
-
Site Reliability Engineer
2 weeks ago
Morādābād, Uttar Pradesh, India beBeeReliability Full time ₹ 17,50,000 - ₹ 21,50,000About the JobThe Senior Site Reliability Engineer will play a critical role in ensuring the stability, scalability, and operational excellence of financial systems.Key Responsibilities:Ensure financial platforms meet defined performance, reliability, and uptime standards.Build automation for deployments, monitoring, scaling, and self-healing capabilities to...
-
Site Reliability Engineer
2 weeks ago
Morādābād, Uttar Pradesh, India beBeeObservability Full time ₹ 12,99,133 - ₹ 1,99,93,462Observability Engineer Job OpportunityWe are seeking a skilled Observability Engineer to join our team. In this role, you will be responsible for building and maintaining our Observability platform.Job Description:As an Observability Engineer, you will be working closely with our performance team, data ingestion team, DevOps team, and data visualization team...
-
Morādābād, Uttar Pradesh, India beBeeReliability Full time ₹ 2,00,00,000 - ₹ 3,00,00,000We are seeking a seasoned reliability engineer to oversee the design and development of service level agreements, objectives, and indicators within our business unit.The ideal candidate will possess extensive experience in reliability engineering, handling high-traffic production systems independently, troubleshooting middleware and infrastructure,...
-
Highly Skilled Cloud Operations Engineer
2 weeks ago
Morādābād, Uttar Pradesh, India beBeeKubernetes Full time ₹ 1,50,00,000 - ₹ 2,50,00,000Job DescriptionWe are seeking a skilled Cloud Operations Specialist with a strong Site Reliability Engineering (SRE) mindset to join our team. This role will be critical in ensuring availability, reliability, and performance of our cloud-based platform services and applications, particularly those supporting Radio Access Network (RAN) and Core Network...
-
Reliable Financial System Specialist
2 weeks ago
Morādābād, Uttar Pradesh, India beBeeAutomation Full time ₹ 2,00,00,000 - ₹ 2,50,00,000Site Reliability EngineerWe are seeking a skilled Site Reliability Engineer to join our team.The ideal candidate will have experience in ensuring the stability and operational excellence of financial platforms, building automation, implementing monitoring, improving incident response, and championing DevOps practices.This role focuses on delivering highly...
-
Reliable Infrastructure Expert
2 weeks ago
Morādābād, Uttar Pradesh, India beBeeSeniorReliability Full time ₹ 1,80,00,000 - ₹ 2,25,00,000Highly Experienced Senior Site Reliability Engineer Needed for Terraform Scripts and Infrastructure ManagementMaintain, enhance and automate Terraform scripts to ensure seamless infrastructure operations.Collaborate with teams to implement robust environment setup, deployments and key rotation practices using AWS IAM, RSA key management and 1Password...
-
Reliable System Specialist
2 weeks ago
Morādābād, Uttar Pradesh, India beBeeSystem Full time US$ 1,00,000 - US$ 1,25,000Site Reliability EngineerWe are looking for a skilled Site Reliability Engineer to join our team. The ideal candidate will have a strong background in operations, DevOps, or software engineering.Key Responsibilities:Engineer reliability by identifying potential system issues early and implementing preventive measures to boost system resilience.Automate tasks...
-
Full-Time On-Site Site Engineer Opportunity
2 weeks ago
Morādābād, Uttar Pradesh, India beBeeCivil Full time ₹ 8,00,000 - ₹ 12,00,000Site Engineer Job DescriptionWe are seeking a skilled Site Engineer to join our team. As a Site Engineer, you will be responsible for ensuring the successful delivery of projects.This full-time role is an exciting opportunity to work on-site and contribute to innovative projects.The ideal candidate will have:A Diploma/Degree in Civil Engineering or...