Site Reliability Engineer
2 days ago
About InstaService InstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ states and expanding nationwide — backed by strong traction, rapid adoption, and a mission to simplify how people get work done at home. We’re looking for a Senior Site Reliability Engineer (SRE) to join our core engineering team and scale our infrastructure to serve millions of users reliably. What You’ll Do Lead incident response , conduct root cause analysis , and ensure permanent preventive measures. Design and optimize CI/CD pipelines , automate deployments, and enforce release stability. Build and manage scalable infrastructure on AWS, GCP, or Azure using Terraform , Ansible , and Kubernetes . Continuously monitor system health with Prometheus , Grafana , ELK , and CloudWatch . Conduct load and performance testing (k6, JMeter, Locust) and optimize systems for high-traffic events. Improve observability , reduce alert noise, and enhance signal clarity for faster debugging. Collaborate with developers and architects to ensure systems meet SLOs, SLIs, and SLAs . Develop automation scripts and tools in Python, Go, Node.js, or Shell to streamline operations. Manage distributed systems and message queues like Kafka or RabbitMQ . Drive a culture of reliability, automation, and scalability across teams. What We’re Looking For 4–7 years of experience in SRE or DevOps roles (preferably in high-scale or e-commerce environments). Strong hands-on experience with Kubernetes , Docker , Terraform , Ansible , and CI/CD pipelines . Deep understanding of Linux systems , networking , and distributed architecture . Solid programming skills in Python , Go , or Node.js . Experience managing cloud platforms (AWS, GCP, or Azure). Proven track record of maintaining production uptime and optimizing system performance . Nice to Have Experience with observability stacks , distributed tracing , and incident automation . Familiarity with microservices and event-driven systems . Exposure to cost optimization and capacity planning in multi-cloud environments. Why Join InstaService? Fast-growing startup reshaping a massive industry Work on high-scale systems and impactful technology Collaborative and innovation-driven team Competitive compensation and growth opportunities
-
Site Reliability Engineer
2 weeks ago
Bengaluru, India Relanto Full timeJob Description Job Title: Site Reliability Engineer Summary We are looking for a Site Reliability Engineer to join our Digital & Transformation department. The ideal candidate will have 2-3 years of experience in this field and will be responsible for ensuring the reliability, availability, and performance of our systems and applications. Roles And...
-
Site Reliability Engineer
2 days ago
India InOrg Full time ₹ 12,00,000 - ₹ 36,00,000 per yearAbout VivaOps :VivaOps is a leading DevSecOps platform company specializing in GitLab - The comprehensive DevOps platform, to transform and secure software development processes. We help organizations to streamline their DevSecOps journey by offering a complete range of GitLab services, from advisory, to implementation and managed services, to accelerate...
-
Site Reliability Engineer
4 weeks ago
, India, IN Sonata Software Full timeWe're Hiring: Senior Site Reliability Engineer Location: Onsite (Office: Hyderabad – Mandatory from Day 1) Employment Type: Full-time Notice Period: Immediate to 15 Days Only Experience: 8+ Years About the RoleWe’re looking for a Senior Site Reliability Engineer (SRE) to lead reliability initiatives across our production systems. This is a high-impact...
-
Site Reliability Engineer
1 week ago
India Akamai Technologies Full timeJob Description Job Description Do you like collaborating across teams to solve complex problems Do you enjoy solving large scale distributed content delivery challenges Join our highly skilled Compute Site Reliability team Our team designs, develops, and manages applications and infrastructure that support Akamai's Compute products and services. We...
-
Site Reliability Engineer
6 days ago
India Akamai Full time ₹ 8,00,000 - ₹ 24,00,000 per yearDo you like collaborating across teams to solve complex problems?Do you enjoy solving large scale distributed content delivery challenges?Join our highly skilled Compute Site Reliability teamOur team designs, develops, and manages applications and infrastructure that support Akamai's Compute products and services. We specialize in creating solutions that...
-
Site Reliability Engineer
4 days ago
India LivePerson Full time ₹ 9,00,000 - ₹ 12,00,000 per yearLivePerson (NASDAQ: LPSN) is a leading customer engagement company, creating digital experiences powered by Curiously Human AI. Every person is unique, and our technology makes it possible for companies, including leading brands like HSBC, Orange, and GM Financial, to treat their audiences that way at scale. Nearly a billion conversational interactions are...
-
Site Reliability Engineer
4 days ago
India LivePerson Full time ₹ 12,00,000 - ₹ 36,00,000 per yearLivePerson (NASDAQ: LPSN) is a leading customer engagement company, creating digital experiences powered by Curiously Human AI. Every person is unique, and our technology makes it possible for companies, including leading brands like HSBC, Orange, and GM Financial, to treat their audiences that way at scale. Nearly a billion conversational interactions are...
-
Site Reliability Engineer
2 weeks ago
India CareerUS Solutions Full timeJob Description Position Overview: The Site Reliability Engineer (SRE) is responsible for ensuring the stability, scalability, performance, and reliability of production systems and services. This role bridges software development and operations, using automation, monitoring, and performance optimization to build resilient systems that can scale efficiently...
-
Site Reliability Engineer
4 weeks ago
Bengaluru, Karnataka, India, Karnataka WhiteLotus Talent Partners Full timeWe are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes. In this role, you will focus on monitoring, basic troubleshooting, and incident response, helping to maintain high system...
-
Site Reliability Engineer
4 days ago
India CitNOW Group Full timeAbout us Founded in 2008, CitNOW is an innovative, enterprise-level software product suite that allows automotive dealerships globally to sell more vehicles and parts more profitably. CitNOW’s app-based platform provides a secure, brand-compliant solution – for dealers to build trust, transparency and long-lasting relationships. CitNOW Group was formed...