
Senior Site Reliability Engineer
3 days ago
WHO WE ARESapaad is a global leader in unified commerce platforms, delivering world-class software solutions for the food and beverage industry. Our flagship product, also named Sapaad, has achieved remarkable success over the past decade, empowering thousands of F&B businesses across 40+ countries—with many more coming onboard each day.Driven by a passionate team of developers, designers, and product experts, Sapaad is constantly evolving—introducing innovative, industry-defining features that set the benchmark for F&B tech. Headquartered in Singapore, with offices across five countries, Sapaad is backed by seasoned technology veterans with deep expertise in web, mobility, and e-commerce.JOB OVERVIEWSapaad Software Private Limited is seeking a Senior Site Reliability Engineer (SRE) to lead our infrastructure reliability efforts and mentor a growing SRE team.This is a strategic, hands-on leadership position responsible for ensuring the reliability, scalability, and performance of our global cloud-based restaurant management platform serving thousands of customers worldwide.As a senior member of our engineering organization, you will take ownership of system availability, drive automation initiatives, and establish SRE best practices across the company. You’ll work at the intersection of development and operations—embedding reliability into every layer of our technology stack while building and leading a team focused on operational excellence.This role is ideal for an experienced SRE professional who is passionate about building resilient systems at scale, mentoring engineering talent, and shaping the reliability culture of a fast-growing SaaS organization.WHAT YOU’LL DO- Own the reliability, availability, and performance of all production systems supporting our multi-tenant SaaS platform.- Define and manage SLIs, SLOs, and error budgets across critical services.- Architect and implement highly available, fault-tolerant systems on AWS and Heroku.- Proactively monitor and analyze performance to predict capacity needs and prevent issues.- Lead incident management and postmortem processes, driving root cause analysis and preventive actions.- Develop incident response playbooks, implement chaos engineering, and reduce MTTD and MTTR.- Design and implement comprehensive observability solutions—monitoring, logging, and alerting for microservices and distributed systems.- Enforce security and compliance standards, including access controls, vulnerability management, and patching.- Mentor and lead SRE and infrastructure engineers, driving team growth, knowledge sharing, and operational maturity.- Collaborate with development, DevOps, and product teams to embed reliability practices into every stage of the software lifecycle.YOU’RE A STRONG FIT IF YOU HAVE- 5–8 years of experience in SRE, DevOps, or Systems Engineering roles within SaaS or cloud-based environments.- 2+ years in a technical leadership or mentoring capacity.- Proven experience maintaining large-scale, high-availability systems (99.9%+ uptime).- Expertise with AWS (EC2, RDS, S3, ECS/EKS, Lambda) and Heroku.- Proficiency in Infrastructure as Code (Terraform, CloudFormation) and containerization (Docker, Kubernetes).- Strong scripting and automation skills in Python, Bash, or PowerShell.- Experience with CI/CD pipelines (Jenkins, GitLab CI, GitHub Actions) and configuration management tools (Chef, Ansible, Puppet).- Deep understanding of SRE principles—SLIs, SLOs, toil reduction, blameless postmortems, and incident management frameworks.- Hands-on experience with monitoring tools (Prometheus, Grafana, Datadog, New Relic, CloudWatch, ELK).- Excellent leadership, analytical, and communication skills with a customer-first mindset.PREFERRED QUALIFICATIONS- AWS Certified Solutions Architect – Associate or Professional certification.- Experience with SOC 2, ISO 27001, GDPR, or PCI DSS compliance frameworks.- Background in microservices architectures, disaster recovery planning, or cost optimization.- Experience in the restaurant, hospitality, or retail technology sectors.
-
Senior Site Reliability Engineer
1 day ago
New Delhi, India Tata Consultancy Services Full timeDear Candidates,Greetings from TCS!!!TCS is looking for Senior Site Reliability Engineer – AWSExperience: 8-12 yearsLocation: ChennaiMust have skills:- Design, implement, and maintain scalable, secure, and highly available infrastructure on AWS - Develop and improve CI/CD pipelines, Infrastructure as Code (IaC) using Terraform, Harness - Own and implement...
-
Site Engineer
1 week ago
Delhi, Delhi, India Engineer Department Full time ₹ 6,00,000 - ₹ 12,00,000 per yearCompany DescriptionEngineer Department is a company We are dedicated to providing efficient and effective engineering solutions for public infrastructure and services. Our team is committed to ensuring the highest standards in project management and execution, serving the community with integrity and professionalism.Role DescriptionThis is a full-time...
-
Site Engineer
3 weeks ago
Delhi, India Engineer Department Full timeCompany Description Engineer Department is a company We are dedicated to providing efficient and effective engineering solutions for public infrastructure and services. Our team is committed to ensuring the highest standards in project management and execution, serving the community with integrity and professionalism. Role Description This is a full-time...
-
Senior Site Reliability Engineer
4 days ago
Delhi, India Sapaad Full timeWHO WE ARESapaad is a global leader in unified commerce platforms, delivering world-class software solutions for the food and beverage industry. Our flagship product, also named Sapaad, has achieved remarkable success over the past decade, empowering thousands of F&B businesses across 40+ countries —with many more coming onboard each day.Driven by a...
-
Site Reliability Engineer
2 weeks ago
Delhi, India Elgebra Full timeHiring: Site Reliability Engineer – 7+ Years Location: Bangalore / Chennai Payroll: Elgebra Client: Qincline Joining: Immediate to 15 Days Role Overview: We are looking for an experienced Site Reliability Engineer (SRE) with over 6 years of expertise to join our team. The ideal candidate will have strong technical skills, a problem-solving mindset, and...
-
Senior Site Reliability Engineer
1 week ago
Delhi, NCR, New Delhi, Pune, India Ithena Full time ₹ 20,00,000 - ₹ 25,00,000 per yearSenior Site Reliability Engineer (SRE) Backend SystemsLocation: Remote (India) Pune/Delhi/Delhi NCRMumbaiExperience: 8+ years Were looking for a Senior SRE to join our backend team and help scale our real-time, event-driven platform. This role goes beyond traditional DevOps we're seeking engineers who can write high-quality code, debug complex distributed...
-
Senior Site Reliability Engineer- ELK Expert
2 weeks ago
Delhi, India iVedha Inc. Full timeSenior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering PracticeLocation: India (Remote) - Must be available to work in the EST (US/Canada) Time Zone.Role Summary:Are you a Senior Site Reliability Engineer (SRE) with deep ELK expertise, ready to take ownership of large-scale observability infrastructure?We're looking for an SRE with 7+...
-
Delhi, India Tata Consultancy Services Full timeDear Candidates, Greetings from TCS!!! TCS is looking for Senior Site Reliability Engineer – AWS Experience: 8-12 years Location: Chennai Must have skills: Design, implement, and maintain scalable, secure, and highly available infrastructure on AWS Develop and improve CI/CD pipelines, Infrastructure as Code (IaC) using Terraform, Harness Own and implement...
-
New Delhi, India iVedha Inc. Full timeSenior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering PracticeLocation: India (Remote) - Must be available to work in the EST (US/Canada) Time Zone.Role Summary:Are you a Senior Site Reliability Engineer (SRE) with deep ELK expertise, ready to take ownership of large-scale observability infrastructure?We're looking for an SRE with 7+...
-
Site Reliability Engineer
2 weeks ago
Delhi, India Concord Full timeSRE Sr. Engineers (Individual Contributors)Key Attributes:- Strong SRE (Site Reliability Engineering) experience- DevOps skills – CI/CD, monitoring, automation, infrastructure as code, etc.- Excellent troubleshooting and debugging skills (infrastructure + application level)- Perseverance – must push through complex/challenging issues without giving up-...