Site Reliability Engineer

4 days ago


Chennai Mumbai, India Datum Software Full time ₹ 12,00,000 - ₹ 24,00,000 per year

Job Title: Site Reliability Engineer (SRE) Azure & AI

Work Mode: Hybrid

Notice Period: Immediate - 30 days

Location: Chennai, Mumbai, Gurugram.

Job Summary

We are looking for an experienced Site Reliability Engineer (SRE) with a strong background in Azure Cloud, AI infrastructure, and automation. The ideal candidate will have hands-on experience designing and maintaining reliable, secure, and scalable environments for AI workloads and enterprise applications. Youll collaborate with cross-functional teams to build robust CI/CD pipelines, automate deployments, and ensure seamless performance in production systems.

Key Responsibilities

  • Design, build, and maintain scalable, resilient cloud infrastructure on Microsoft Azure.
  • Automate provisioning and deployments using tools like Terraform, Argo, and Helm.
  • Manage and optimize Azure Kubernetes Service (AKS) clusters for AI and microservices workloads.
  • Support AI model hosting and serving (e.g., Hugging Face Transformers, vLLM, ) on Azure OpenAI, Azure VMs, or GPU environments.
  • Build and maintain CI/CD pipelines using GitHub Actions, integrated with JFrog Artifactory.
  • Monitor infrastructure health, reliability, and performance using Grafana and implement proactive measures.
  • Collaborate closely with software engineering teams to align infrastructure with application needs.
  • Ensure compliance with networking, data security, and infrastructure governance best practices.
  • Enhance caching and data performance using Redis and related tools.

Required Skills & Technologies

Microsoft Azure Cloud Services (including Azure OpenAI)

  • AI Model Hosting & Infrastructure Management
  • GitHub Actions / Azure DevOps (CI/CD)
  • Azure Kubernetes Service (AKS)
  • Argo, Helm, Terraform, Docker
  • JFrog Artifactory
  • Grafana (Monitoring & Observability)
  • Networking & Security
  • Redis (Caching & Data Layer Optimization)


  • Chennai, Tamil Nadu, India Elgebra Full time ₹ 6,00,000 - ₹ 18,00,000 per year

    Hiring: Site Reliability Engineer – 7+ YearsLocation: Bangalore / Chennai Payroll: Elgebra Client: Qincline Joining: Immediate to 15 DaysRole Overview:We are looking for an experienced Site Reliability Engineer (SRE) with over 6 years of expertise to join our team. The ideal candidate will have strong technical skills, a problem-solving mindset, and the...


  • Chennai, India Datum Technologies Group Full time

    Job Title: Site Reliability Engineer (SRE) – Azure & AIExperience: 7+ yearsWork Mode: HybridWork Location: Chennai/Mumbai/GurgaonJob Summary:We are looking for an experienced Site Reliability Engineer (SRE) with strong expertise in Microsoft Azure, AI infrastructure, and automation. The ideal candidate will have a solid background in managing cloud...


  • Chennai, India Datum Technologies Group Full time

    Job Title: Site Reliability Engineer (SRE) – Azure & AIExperience: 7+ yearsWork Mode: HybridWork Location: Chennai/Mumbai/GurgaonJob Summary:We are looking for an experienced Site Reliability Engineer (SRE) with strong expertise in Microsoft Azure, AI infrastructure, and automation. The ideal candidate will have a solid background in managing cloud...


  • Chennai, India Datum Technologies Group Full time

    Job Title: Site Reliability Engineer (SRE) – Azure & AIExperience: 7+ yearsWork Mode: HybridWork Location: Chennai/Mumbai/GurgaonJob Summary:We are looking for an experienced Site Reliability Engineer (SRE) with strong expertise in Microsoft Azure, AI infrastructure, and automation. The ideal candidate will have a solid background in managing cloud...


  • Chennai, India Zyoin Group Full time

    Description : MoneyForward is seeking a Site Reliability Engineer (SRE) to lead the reliability, scalability, and performance of our products. This role involves making critical technical decisions, collaborating with development and platform engineering teams, and ensuring that our systems remain resilient and scalable to support stable business...


  • chennai, India Tata Consultancy Services Full time

    Role: Site Reliability Engineer Experience: 4 to 7 Years Locations: Chennai/Pune/Kolkata


  • Chennai, India Tata Consultancy Services Full time

    Role: Site Reliability Engineer Experience: 4 to 7 Years Locations: Chennai/Pune/Kolkata


  • Chennai, India Tata Consultancy Services Full time

    Role: Site Reliability Engineer Experience: 4 to 7 Years Locations: Chennai/Pune/Kolkata


  • Chennai, India Datum Technologies Group Full time

    Job Title: Site Reliability Engineer (SRE) – Azure & AI Experience: 7+ years Work Mode: Hybrid Work Location: Chennai/Mumbai/Gurgaon Job Summary: We are looking for an experienced Site Reliability Engineer (SRE) with strong expertise in Microsoft Azure, AI infrastructure, and automation. The ideal candidate will have a solid background in managing cloud...


  • Chennai, India Elgebra Full time

    Role Overview :We are seeking a highly experienced and technically proficient Site Reliability Engineer (SRE) to join our team in support of our client, Qincline. The ideal candidate will have 7 or more years of dedicated experience in Site Reliability Engineering or a closely related discipline. This pivotal role requires a strong focus on ensuring the...