
Sr. Site Reliability Engineer
1 day ago
About the Role
We are seeking a highly skilled Sr. Site Reliability Engineer (SRE) to lead the implementation, optimization, and management of our observability stack across cloud infrastructure. You will play a key role in ensuring the reliability, scalability, and performance of our platform, spanning microservices on Kubernetes/EC2 and mission-critical systems. This role requires strong problem-solving, automation mindset, and a proactive approach to incident management.
Key Responsibilities
Design, implement, and manage monitoring, logging, and alerting systems across production and non-production environments.
Lead incident response, root cause analysis, and post-mortem practices for continuous improvement.
Define and implement disaster recovery strategies with regular testing.
Collaborate with development teams to define and track SLAs/SLOs for critical services.
Optimize AWS cloud infrastructure for cost efficiency, reliability, and scalability.
Build and maintain automation frameworks for deployment, scaling, and recovery using Terraform, GitLab CI/CD, and Kubernetes.
Administer Kubernetes clusters, troubleshoot performance bottlenecks, and ensure high availability.
Manage databases (PostgreSQL or similar), including replication and disaster recovery strategies.
Contribute to infrastructure security, compliance, and best practices.
Participate in the on-call rotation and handle high-priority incidents under pressure.
Required Skills & Experience
4+ years of experience as an SRE, DevOps, or similar role.
Strong hands-on experience with AWS services: EC2, EKS, RDS, Cognito, CloudWatch, etc.
Proven expertise in Kubernetes administration in production environments.
Proficiency in scripting/programming: Python, Bash, Chef (recipes, cookbooks), Ansible.
Strong knowledge of Infrastructure as Code (Terraform/CloudFormation).
Deep experience with observability tools: Prometheus, Grafana, ELK stack, distributed tracing.
Database administration experience with PostgreSQL or similar systems.
Understanding of network protocols, load balancing, and security best practices.
Experience in CI/CD pipelines and GitOps workflows.
Ability to handle multiple incidents and prioritize effectively under pressure.
Exposure to monitoring solutions like Splunk, Datadog, Dynatrace.
Preferred Qualifications
AWS Certified Solutions Architect or AWS DevOps Engineer certification.
Certified Kubernetes Administrator (CKA).
Why Join Us
Be part of a fast-growing HealthTech startup transforming healthcare technology.
Work with modern tools, cutting-edge infrastructure, and a collaborative team.
Opportunity to own end-to-end infrastructure reliability and automation.
Competitive salary and growth opportunities.
-
Sr. Site Reliability Engineer
1 day ago
Mohali, Punjab, India Wits Innovation Lab Full time ₹ 15,00,000 - ₹ 20,00,000 per yearAbout the RoleWe are seeking a highly skilled Sr. Site Reliability Engineer (SRE) to lead the implementation, optimization, and management of our observability stack across cloud infrastructure. You will play a key role in ensuring the reliability, scalability, and performance of our platform, spanning microservices on Kubernetes/EC2 and mission-critical...
-
Sr. Site Reliability Engineer
1 day ago
Mohali, Punjab, India HRS Group Full time ₹ 1,04,000 - ₹ 1,30,878 per yearHrs As a CompanyHRS, a pioneer in business travel, aims to elevate every stay through innovative technology. With over 50 years of experience, their digital platform, driven by ProcureTech, TravelTech, and FinTech, transforms how companies and travelers Stay, Work, and Pay.ProcureTech digitally revolutionizes lodging procurement, connecting corporations and...
-
Senior Site Reliability Engineer
2 days ago
Mohali, Punjab, India Wits Innovation Lab Full timeJob Description : Sr. Site Reliability Engineer (SRE)We are seeking an experienced and results-driven Sr. Site Reliability Engineer (SRE) to join our team. The SRE will be responsible for ensuring the reliability, scalability, performance, and observability of our infrastructure and services. This role requires strong expertise in cloud computing,...
-
Sr. Site Reliability Engineer
7 days ago
Mohali, Punjab, India HRS Group Full time US$ 90,000 - US$ 1,20,000 per yearHRS AS A COMPANY HRS, a pioneer in business travel, aims to elevate every stay through innovative technology. With over 50 years of experience, their digital platform, driven by ProcureTech, TravelTech, and FinTech, transforms how companies and travelers Stay, Work, and Pay.ProcureTech digitally revolutionizes lodging procurement, connecting corporations and...
-
Senior Site Reliability Engineer
1 day ago
Mohali, Punjab, India Wits Innovation Lab Full time ₹ 15,00,000 - ₹ 20,00,000 per yearJob OverviewThe Sr. SRE will lead the implementation and management of the observability stack across cloud infrastructure, ensuring reliability, scalability, performance, and cost-efficiency. The role spans across Kubernetes, AWS, automation, incident response, and platform reliability.Key ResponsibilitiesBuild and maintain monitoring, logging, and alerting...
-
Senior Site Reliability Engineer
2 days ago
Mohali, Punjab, India Wits Innovation Lab Full timeJob Overview :The Sr. SRE will lead the implementation and management of the observability stack across cloud infrastructure, ensuring reliability, scalability, performance, and cost-efficiency. The role spans across Kubernetes, AWS, automation, incident response, and platform reliability.Key Responsibilities :- Build and maintain monitoring, logging, and...
-
Senior Site Reliability Engineer
2 days ago
Mohali, Punjab, India Wits Innovation Lab Full timeSite Reliability Engineer (SRE) Senior RoleLocation : MohaliExperience : 4+ yearsWe are looking for an experienced Site Reliability Engineer (SRE) to strengthen our cloud and infrastructure team. The role involves owning reliability, availability, and scalability of distributed platforms, while driving automation and observability best practices.Key...
-
Site Reliability Engineer
2 days ago
Mohali, Punjab, India Wits Innovation Lab Full timeKey Responsibilities :- Design, implement, and maintain comprehensive monitoring, logging, and alerting solutions across our production and other environments- Lead incident response and post-mortem analyses, establishing best practices for problem resolution- Design and implement disaster recovery strategies and ensure regular testing- Collaborate with...
-
Senior Site Reliability Engineer
1 day ago
Mohali, Punjab, India WITS INNOVATION LAB Full time ₹ 13,00,000 per yearKey ResponsibilitiesDesign, implement, and maintain comprehensive monitoring, logging,and alerting solutions across our production and other environmentsLead incident response and post-mortem analyses, establishing bestpractices for problem resolutionDesign and implement disaster recovery strategies and ensure regulartestingCollaborate with development teams...
-
Site Engineer
1 day ago
Mohali, Punjab, India One Point Realty Full time ₹ 8,00,000 - ₹ 12,00,000 per yearResponsibilities:* Manage site operations & civil works* Ensure building construction compliance* Oversee site planning & execution* Control labour resources* Monitor site conditionsHealth insuranceProvident fundAnnual bonus