Site Reliability Engineer

4 weeks ago


New Delhi, India Datum Technologies Group Full time

Job Title: Site Reliability Engineer (SRE) – Azure & AIExperience: 7+ yearsWork Mode: HybridWork Location: Chennai/Mumbai/GurgaonJob Summary:We are looking for an experienced Site Reliability Engineer (SRE) with strong expertise in Microsoft Azure, AI infrastructure, and automation. The ideal candidate will have a solid background in managing cloud environments using GitHub/Azure DevOps, and hands-on experience in AI model deployment and scaling. This role involves working closely with engineering teams to deliver reliable, secure, and scalable cloud infrastructure that supports AI workloads and enterprise applications.Key Responsibilities:- Design, build, and maintain scalable cloud infrastructure on Microsoft Azure. - Automate infrastructure provisioning and deployment using Terraform, Argo, and Helm. - Manage and optimize Azure Kubernetes Service (AKS) for AI and microservices workloads. - Support AI model hosting using frameworks such as Huggingface Transformers, vLLM, or Llama.cpp on Azure OpenAI, VMs, or GPUs. - Implement CI/CD pipelines using GitHub Actions and integrate with JFrog Artifactory. - Monitor and maintain system performance and reliability using Grafana, ensuring proactive issue resolution. - Collaborate with development teams to align infrastructure with application requirements. - Enforce networking and information security best practices. - Manage and optimize caching and data layer performance using Redis.Required Skills & Technologies:- Azure Cloud Services (including Azure OpenAI) - AI Model Hosting & Infrastructure - GitHub (CI/CD, workflows) - Azure Kubernetes Service (AKS) - Argo, Helm, Terraform - Docker, JFrog, Grafana - Networking & Security, Redis



  • New Delhi, India Tata Consultancy Services Full time

    TCS has been a great pioneer in feeding the fire of young techies like you. We are a global leader in the technology arena and there’s nothing that can stop us from growing together.What we are looking forRole: Site Reliability Engineering (SRE)Experience Range: 5 – 15 YearsLocation: Chennai/Punecandidates should come to office for Walk in Drive(Face to...


  • New Delhi, India Tata Consultancy Services Full time

    TCS has been a great pioneer in feeding the fire of young techies like you. We are a global leader in the technology arena and there’s nothing that can stop us from growing together.What we are looking forRole: Site Reliability Engineering (SRE)Experience Range: 5 – 15 YearsLocation: Chennai/Punecandidates should come to office for Walk in Drive(Face to...


  • New Delhi, India Tata Consultancy Services Full time

    TCS has been a great pioneer in feeding the fire of young techies like you. We are a global leader in the technology arena and there’s nothing that can stop us from growing together.What we are looking forRole: Site Reliability Engineering (SRE) Experience Range: 5 – 15 Years Location: Chennai/Pune candidates should come to office for Walk in Drive(Face...


  • New Delhi, India Relanto Full time

    We’re Hiring: Site Reliability Engineer (4+ Years Experience) Location: Bangalore (WFO only - 5 days)We are looking for a passionate Site Reliability Engineer (SRE) to join our growing team! If you love building reliable, scalable systems and enjoy solving complex problems, this role is for you.What You’ll Do Ensure high availability and performance of...


  • New Delhi, India Relanto Full time

    We’re Hiring: Site Reliability Engineer (4+ Years Experience) Location: Bangalore (WFO only - 5 days)We are looking for a passionate Site Reliability Engineer (SRE) to join our growing team! If you love building reliable, scalable systems and enjoy solving complex problems, this role is for you.What You’ll Do Ensure high availability and performance of...


  • New Delhi, India Datum Technologies Group Full time

    Job Title: Site Reliability Engineer (SRE) – AWSExperience: 8+ years Location: Chennai / Mumbai Work Mode: HybridKey Skills:AWS, Terraform, Kubernetes, Docker, Grafana, Prometheus, DatadogJob Summary:We are looking for a skilled Site Reliability Engineer (SRE) with strong AWS experience and a solid background in DevOps, automation, observability, and...


  • New Delhi, India Enterprise Minds, Inc Full time

    Senior Site Reliability Engineer (GCP | Terraform | Ansible | SRE | On-Call)We are looking for ahigh-impact Site Reliability Engineer (SRE)who will play a key role in ensuring the reliability, availability, and scalability of our production systems onGoogle Cloud Platform (GCP) . If you thrive in fast-paced environments, excel in incident management, and...


  • New Delhi, India Enterprise Minds, Inc Full time

    Senior Site Reliability Engineer (GCP | Terraform | Ansible | SRE | On-Call)We are looking for ahigh-impact Site Reliability Engineer (SRE)who will play a key role in ensuring the reliability, availability, and scalability of our production systems onGoogle Cloud Platform (GCP) . If you thrive in fast-paced environments, excel in incident management, and...


  • New Delhi, India Grootan Technologies Full time

    About the RoleWe are seeking a skilled Site Reliability Engineer (SRE) with 4–5 years of hands-on experience to join our engineering team. In this role, you will be responsible for building and maintaining reliable, scalable, and secure infrastructure to support our applications. You will leverage your expertise in automation, cloud platforms, and...


  • New Delhi, India Insight Global Full time

    Company:Insight Global Duration:Approved for 1 year Location:Remote (India) Type:Contract with Insight Global Client Compensation:14 LPA – 20 LPA Working Hours:Normal IST hours Start Date:Immediate (No notice period)About the Role Join our Site Reliability Engineering (SRE) team as a Fullstack Developer, focused on building and maintaining highly reliable,...