Site Reliability Engineer

7 days ago


Bengaluru Karnataka India Exotel Techcom Full time ₹ 12,00,000 - ₹ 24,00,000 per year

About Us

Exotel is one of Asia's largest customer communication platforms. We are on a mission to move enterprise customer communication to the cloud. In 2020, we powered over 4 billion calls and connected over 320 million people. We work with some of the most innovative companies such as Ola, Swiggy, Zerodha, Whitehat Jr, Practo, Flipkart, GoJek, etc. We also power customer communication for some of the top banks in the country. Join us on this journey to improve how companies look at customer communication. Read our growth story here.

 
About the Role

The Site Reliability Engineer (SRE) team at Exotel ensures that our large-scale, distributed production systems are reliable, scalable, and efficient. As an SRE, you will own uptime, monitoring, and incident response while driving automation to minimise manual work. You will be the bridge between infrastructure and engineering teams to ensure new services are production-ready.

If you're passionate about building reliable systems, automating away repetitive tasks, and solving complex production challenges, this is the role for you.

What You'll Do
  • Manage and support production-grade infrastructure across cloud and data centers.

  • Take ownership of monitoring and troubleshooting production systems (on-call or shift-based support).

  • Deep dive into Linux system internals, networking, and debugging production issues.

  • Build and improve observability stacks using Prometheus, Grafana, ELK/EFK, or equivalent.

  • Develop and maintain automation scripts/tools (Python, Bash, or similar).

  • Work with CI/CD tools (Jenkins, GitHub Actions, GitLab CI) to support reliable deployments.

  • Drive incident management, root cause analysis (RCA), and long-term fixes.

  • Partner with developers to ensure new features/services are production-ready (monitoring, logging, failover strategies).

  • Continuously improve system availability, reliability, and performance through automation and process improvements

 
What We're Looking For
 
 (Must-Haves)
  • 7+ years of hands-on experience managing production systems at scale.

  • Strong proficiency in Linux system administration, internals, and networking.

  • Proven experience in monitoring & troubleshooting production systems.

  • Hands-on experience with monitoring/alerting/logging tools (Prometheus, Grafana, ELK, Nagios, etc.).

  • Proficiency in at least one scripting language (Python, Bash, Go, etc.).

 
Good-to-Haves
  • Experience with CI/CD and deployment automation (Jenkins, GitHub Actions, Ansible, Terraform, etc.).

  • Demonstrated ability to automate operational tasks to reduce MTTR.

  • Exposure to cloud platforms like AWS (VPC, EC2, RDS, CloudWatch, IAM).

  • Strong debugging and root cause analysis skills in complex, distributed environments.

Mindset We Value
  • You don't just fix problems — you build systems to prevent them.

  • You believe monitoring + automation = reliability at scale.

  • You thrive in high-availability, high-scale environments.

  • You have an SRE mindset: you own what you set up, and you engineer for reliability.

Why Exotel?
  • Opportunity to work at scale — billions of calls, millions of users.

  • Be part of a team that deeply values automation, ownership, and reliability.

  • Work with cutting-edge tech and solve complex reliability challenges in real-world production systems.

  • Collaborative, fast-paced, and impact-driven environment.



  • Bengaluru, Karnataka, , India Qure ai Technologies Full time ₹ 12,00,000 - ₹ 24,00,000 per year

    About Qure.AI:Qure.AI is an equal opportunity employer. is a leading Healthcare Artificial Intelligence (AI) company disrupting the 'status quo' by enhancing diagnostic imaging and improving health outcomes with the assistance of machine -supported tools. taps deep learning technology to provide an automated interpretation of radiology exams like X -rays,...


  • Bengaluru, India Relanto Full time

    Job Description Job Title: Site Reliability Engineer Summary We are looking for a Site Reliability Engineer to join our Digital & Transformation department. The ideal candidate will have 2-3 years of experience in this field and will be responsible for ensuring the reliability, availability, and performance of our systems and applications. Roles And...


  • Bengaluru, Karnataka, India, Karnataka JRD Systems Full time

    Position: Site Reliability Engineer (SRE) Role Overview: We are seeking an experienced Site Reliability Engineer (SRE) with a strong background in Windows infrastructure to manage and optimize our cloud and on-premises environments. The ideal candidate will partner with development teams to improve service reliability, implement automation, and ensure...


  • Bengaluru, Karnataka, India AppHelix Full time ₹ 9,00,000 - ₹ 12,00,000 per year

    Role DescriptionThis is a full-time on-site role located in Bengaluru for a Site Reliability Engineer. The Site Reliability Engineer will be responsible for maintaining and improving the reliability of AppHelix's systems. Daily tasks include monitoring system performance, troubleshooting issues, managing infrastructure, and supporting software development....


  • Bengaluru South, Karnataka, India Gravity Engineering Services Full time

    Company DescriptionGravity Engineering Services helps ambitious brands, retailers, and enterprises turn technology into a growth engine. With over 11 years of global experience and a team of more than 300 experts, Gravity Engineering Services delivers end-to-end digital transformation across commerce, supply chain, AI, and cloud sectors. Their mission,...


  • India InOrg Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    About VivaOps :VivaOps is a leading DevSecOps platform company specializing in GitLab - The comprehensive DevOps platform, to transform and secure software development processes. We help organizations to streamline their DevSecOps journey by offering a complete range of GitLab services, from advisory, to implementation and managed services, to accelerate...


  • India Akamai Technologies Full time

    Job Description Job Description Do you like collaborating across teams to solve complex problems Do you enjoy solving large scale distributed content delivery challenges Join our highly skilled Compute Site Reliability team Our team designs, develops, and manages applications and infrastructure that support Akamai's Compute products and services. We...


  • Bangalore, Karnataka, India Empower Annuity Insurance Full time

    Our vision for the future is based on the idea that transforming financial lives starts by giving our people the freedom to transform their own We have a flexible work environment and fluid career paths We not only encourage but celebrate internal mobility We also recognize the importance of purpose well-being and work-life balance Within Empower and our...


  • India Akamai Full time ₹ 8,00,000 - ₹ 24,00,000 per year

    Do you like collaborating across teams to solve complex problems?Do you enjoy solving large scale distributed content delivery challenges?Join our highly skilled Compute Site Reliability teamOur team designs, develops, and manages applications and infrastructure that support Akamai's Compute products and services. We specialize in creating solutions that...


  • India Akamai Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Description Do you like collaborating across teams to solve complex problems? Do you enjoy solving large scale distributed content delivery challenges?Join our highly skilled Compute Site Reliability teamOur team designs, develops, and manages applications and infrastructure that support Akamai's Compute products and services. We specialize in creating...