Site Reliability Engineer
3 weeks ago
About InstaService InstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ states and expanding nationwide — backed by strong traction, rapid adoption, and a mission to simplify how people get work done at home. We’re looking for a Senior Site Reliability Engineer (SRE) to join our core engineering team and scale our infrastructure to serve millions of users reliably. What You’ll Do Lead incident response , conduct root cause analysis , and ensure permanent preventive measures. Design and optimize CI/CD pipelines , automate deployments, and enforce release stability. Build and manage scalable infrastructure on AWS, GCP, or Azure using Terraform , Ansible , and Kubernetes . Continuously monitor system health with Prometheus , Grafana , ELK , and CloudWatch . Conduct load and performance testing (k6, JMeter, Locust) and optimize systems for high-traffic events. Improve observability , reduce alert noise, and enhance signal clarity for faster debugging. Collaborate with developers and architects to ensure systems meet SLOs, SLIs, and SLAs . Develop automation scripts and tools in Python, Go, Node.js, or Shell to streamline operations. Manage distributed systems and message queues like Kafka or RabbitMQ . Drive a culture of reliability, automation, and scalability across teams. What We’re Looking For 4–7 years of experience in SRE or DevOps roles (preferably in high-scale or e-commerce environments). Strong hands-on experience with Kubernetes , Docker , Terraform , Ansible , and CI/CD pipelines . Deep understanding of Linux systems , networking , and distributed architecture . Solid programming skills in Python , Go , or Node.js . Experience managing cloud platforms (AWS, GCP, or Azure). Proven track record of maintaining production uptime and optimizing system performance . Nice to Have Experience with observability stacks , distributed tracing , and incident automation . Familiarity with microservices and event-driven systems . Exposure to cost optimization and capacity planning in multi-cloud environments. Why Join InstaService? Fast-growing startup reshaping a massive industry Work on high-scale systems and impactful technology Collaborative and innovation-driven team Competitive compensation and growth opportunities
-
Site Reliability Engineer
4 days ago
India Pagos Consultants Full timewe are looking for experienced site reliability engineers to join a founding team of startup-minded individuals that will lay the groundwork for our new fintech offering. This team will play a pivotal role in spearheading innovation. As such, you will have the opportunity to shape the early architecture and design of the system and set the trajectory for its...
-
Site Reliability Engineer
4 days ago
india Pagos Consultants Full timewe are looking for experienced site reliability engineers to join a founding team of startup-minded individuals that will lay the groundwork for our new fintech offering. This team will play a pivotal role in spearheading innovation. As such, you will have the opportunity to shape the early architecture and design of the system and set the trajectory for its...
-
Site Reliability Engineer
4 days ago
India Pagos Consultants Full timewe are looking for experienced site reliability engineers to join a founding team of startup-minded individuals that will lay the groundwork for our new fintech offering. This team will play a pivotal role in spearheading innovation. As such, you will have the opportunity to shape the early architecture and design of the system and set the trajectory for its...
-
Site Reliability Engineer
4 days ago
India Pagos Consultants Full timewe are looking for experienced site reliability engineers to join a founding team of startup-minded individuals that will lay the groundwork for our new fintech offering. This team will play a pivotal role in spearheading innovation. As such, you will have the opportunity to shape the early architecture and design of the system and set the trajectory for its...
-
Site Reliability Engineer
2 weeks ago
India Datum Technologies Group Full timeJob Title: Site Reliability Engineer (SRE) – AWS Experience: 8+ years Location: Chennai / Mumbai Work Mode: Hybrid Key Skills: AWS, Terraform, Kubernetes, Docker, Grafana, Prometheus, Datadog Job Summary: We are looking for a skilled Site Reliability Engineer (SRE) with strong AWS experience and a solid background in DevOps, automation, observability, and...
-
Site Reliability Engineer
1 day ago
India InOrg Full time ₹ 12,00,000 - ₹ 36,00,000 per yearAbout VivaOps :VivaOps is a leading DevSecOps platform company specializing in GitLab - The comprehensive DevOps platform, to transform and secure software development processes. We help organizations to streamline their DevSecOps journey by offering a complete range of GitLab services, from advisory, to implementation and managed services, to accelerate...
-
Site Reliability Engineer
5 days ago
India Jobgether Full time ₹ 10,00,000 - ₹ 12,00,000 per yearThis position is posted by Jobgether on behalf of a partner company. We are currently looking for a Site Reliability Engineer (India 3rd Shift) in India.This role is ideal for an engineer who thrives in high-availability, mission-critical environments and enjoys ensuring systems operate reliably at scale. As a Site Reliability Engineer, you will work during...
-
Site Reliability Engineer
5 days ago
India Akamai Full time ₹ 8,00,000 - ₹ 24,00,000 per yearDo you like collaborating across teams to solve complex problems?Do you enjoy solving large scale distributed content delivery challenges?Join our highly skilled Compute Site Reliability teamOur team designs, develops, and manages applications and infrastructure that support Akamai's Compute products and services. We specialize in creating solutions that...
-
Site Reliability Engineer
1 week ago
Chennai, India Datum Technologies Group Full timeJob Description Job Title: Site Reliability Engineer (SRE) Azure & AI Experience: 7+ years Work Mode: Hybrid Work Location: Chennai/Mumbai/Gurgaon Job Summary: We are looking for an experienced Site Reliability Engineer (SRE) with strong expertise in Microsoft Azure, AI infrastructure, and automation. The ideal candidate will have a solid background in...
-
Site Reliability Engineer
6 days ago
Chennai, India Datum Technologies Group Full timeJob Description Job Title: Site Reliability Engineer (SRE) AWS Experience: 8+ years Location: Chennai / Mumbai Work Mode: Hybrid Key Skills: AWS, Terraform, Kubernetes, Docker, Grafana, Prometheus, Datadog Job Summary: We are looking for a skilled Site Reliability Engineer (SRE) with strong AWS experience and a solid background in DevOps, automation,...