Site reliability engineer

3 weeks ago


Kolkata, India InstaService Full time

About Insta Service Insta Service is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ states and expanding nationwide — backed by strong traction, rapid adoption, and a mission to simplify how people get work done at home. We’re looking for a Senior Site Reliability Engineer (SRE) to join our core engineering team and scale our infrastructure to serve millions of users reliably. What You’ll Do Lead incident response , conduct root cause analysis , and ensure permanent preventive measures. Design and optimize CI/CD pipelines , automate deployments, and enforce release stability. Build and manage scalable infrastructure on AWS, GCP, or Azure using Terraform , Ansible , and Kubernetes . Continuously monitor system health with Prometheus , Grafana , ELK , and Cloud Watch . Conduct load and performance testing (k6, JMeter, Locust) and optimize systems for high-traffic events. Improve observability , reduce alert noise, and enhance signal clarity for faster debugging. Collaborate with developers and architects to ensure systems meet SLOs, SLIs, and SLAs . Develop automation scripts and tools in Python, Go, Node.js, or Shell to streamline operations. Manage distributed systems and message queues like Kafka or Rabbit MQ . Drive a culture of reliability, automation, and scalability across teams. What We’re Looking For 4–7 years of experience in SRE or Dev Ops roles (preferably in high-scale or e-commerce environments). Strong hands-on experience with Kubernetes , Docker , Terraform , Ansible , and CI/CD pipelines . Deep understanding of Linux systems , networking , and distributed architecture . Solid programming skills in Python , Go , or Node.js . Experience managing cloud platforms (AWS, GCP, or Azure). Proven track record of maintaining production uptime and optimizing system performance . Nice to Have Experience with observability stacks , distributed tracing , and incident automation . Familiarity with microservices and event-driven systems . Exposure to cost optimization and capacity planning in multi-cloud environments. Why Join Insta Service? Fast-growing startup reshaping a massive industry Work on high-scale systems and impactful technology Collaborative and innovation-driven team Competitive compensation and growth opportunities



  • Kolkata, West Bengal, India Qiskitq Technology Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Job Title: Senior Site Reliability Engineer (SRE) Datadog ObservabilityExperience Required: 7+ years overall in SRE and Infrastructure Operations with minimum 3+ years hands-on experience in DatadogLocation: remoteJob Summary:We are seeking an experienced Site Reliability Engineer (SRE) to lead end-to-end SRE implementation initiatives with a strong focus on...


  • Kolkata, India CloudHire Full time

    Job SummaryThe Technical Manager for Site Reliability Engineering (SRE) will lead a remote team of Site Reliability Engineers, ensuring operational excellence and fostering a high-performing team culture. Reporting to the US-based Director of Systems and Security, this role is responsible for overseeing day-to-day operations, technical mentorship, and...


  • Greater Kolkata Area, India Meazure Learning Full time ₹ 12,00,000 - ₹ 24,00,000 per year

    At Meazure Learning, we believe in transforming learning and assessment experiences to unlock human potential. As a global leader in online testing and exam services, we support credentialing, licensure, workforce education, and higher education through purpose-built solutions that are secure, accessible, and deeply human-centered. With a global footprint...


  • Kolkata, India CloudHire Full time

    Job SummaryThe Technical Manager for Site Reliability Engineering (SRE) will lead a remote team of Site Reliability Engineers, ensuring operational excellence and fostering a high-performing team culture. Reporting to the US-based Director of Systems and Security, this role is responsible for overseeing day-to-day operations, technical mentorship, and...


  • Kolkata, India CloudHire Full time

    Job Summary The Technical Manager for Site Reliability Engineering (SRE) will lead a remote team of Site Reliability Engineers, ensuring operational excellence and fostering a high-performing team culture. Reporting to the US-based Director of Systems and Security, this role is responsible for overseeing day-to-day operations, technical mentorship, and...


  • Kolkata, India CloudHire Full time

    Job Summary The Technical Manager for Site Reliability Engineering (SRE) will lead a remote team of Site Reliability Engineers, ensuring operational excellence and fostering a high-performing team culture. Reporting to the US-based Director of Systems and Security, this role is responsible for overseeing day-to-day operations, technical mentorship, and...


  • Kolkata, India CloudHire Full time

    Job SummaryThe Technical Manager for Site Reliability Engineering (SRE) will lead a remote team of Site Reliability Engineers, ensuring operational excellence and fostering a high-performing team culture. Reporting to the US-based Director of Systems and Security, this role is responsible for overseeing day-to-day operations, technical mentorship, and...


  • Greater Kolkata Area, India Atlassian Full time ₹ 1,20,000 - ₹ 2,60,000 per year

    OverviewWe are looking for a reliability expert who is passionate about scaling Cloud services to join our growing Site Reliability Engineering (SRE) teams. You are someone who is aware of current industry trends (particularly those related to reliability) and who values working with a diverse set of partners, who can articulate the business impact of a...


  • Kolkata, India Genpact Full time

    Job Description Ready to build the future with AI At Genpact, we don't just keep up with technology-we set the pace. AI and digital innovation are redefining industries, and we're leading the charge. Genpact's AI Gigafactory, our industry-first accelerator, is an example of how we're scaling advanced technology solutions to help global enterprises work...


  • Kolkata, India Genpact Full time

    Job Description Ready to build the future with AI At Genpact, we don't just keep up with technology-we set the pace. AI and digital innovation are redefining industries, and we're leading the charge. Genpact's AI Gigafactory, our industry-first accelerator, is an example of how we're scaling advanced technology solutions to help global enterprises work...