Site Reliability Engineer

23 hours ago


Chennai India Datum Technologies Group Full time

Job Description Job Title: Site Reliability Engineer (SRE) Azure & AI Experience: 7+ years Work Mode: Hybrid Work Location: Chennai/Mumbai/Gurgaon Job Summary: We are looking for an experienced Site Reliability Engineer (SRE) with strong expertise in Microsoft Azure, AI infrastructure, and automation. The ideal candidate will have a solid background in managing cloud environments using GitHub/Azure DevOps, and hands-on experience in AI model deployment and scaling. This role involves working closely with engineering teams to deliver reliable, secure, and scalable cloud infrastructure that supports AI workloads and enterprise applications. Key Responsibilities: - Design, build, and maintain scalable cloud infrastructure on Microsoft Azure. - Automate infrastructure provisioning and deployment using Terraform, Argo, and Helm. - Manage and optimize Azure Kubernetes Service (AKS) for AI and microservices workloads. - Support AI model hosting using frameworks such as Huggingface Transformers, vLLM, or Llama.cpp on Azure OpenAI, VMs, or GPUs. - Implement CI/CD pipelines using GitHub Actions and integrate with JFrog Artifactory. - Monitor and maintain system performance and reliability using Grafana, ensuring proactive issue resolution. - Collaborate with development teams to align infrastructure with application requirements. - Enforce networking and information security best practices. - Manage and optimize caching and data layer performance using Redis. Required Skills & Technologies: - Azure Cloud Services (including Azure OpenAI) - AI Model Hosting & Infrastructure - GitHub (CI/CD, workflows) - Azure Kubernetes Service (AKS) - Argo, Helm, Terraform - Docker, JFrog, Grafana - Networking & Security, Redis



  • Chennai, India Ford Motor Company Full time

    Job Description Job Description Job Description: Ford is seeking an experienced Site Reliability Engineer (SRE) to join our team and lead the development, enhancement, and extension of our global monitoring and observability platform. Enterprise Technology plays a critical part in shaping the future of mobility. If you're looking for the chance to leverage...


  • Chennai, India Siemens Full time

    Job Description Dear Aspirant! We empower our people to stay resilient and relevant in a constantly changing world. We're looking for people who are always searching for creative ways to grow and learn. People who want to make a real impact, now and in the future. Does that sound like you Then it seems like you'd make a great addition to our vibrant...


  • India Grootan Technologies Full time

    About the Role We are seeking a skilled Site Reliability Engineer (SRE) with 4–5 years of hands-on experience to join our engineering team. In this role, you will be responsible for building and maintaining reliable, scalable, and secure infrastructure to support our applications. You will leverage your expertise in automation, cloud platforms, and...


  • India Datum Technologies Group Full time

    Job Title: Site Reliability Engineer (SRE) – AWS Experience: 8+ years Location: Chennai / Mumbai Work Mode: Hybrid Key Skills: AWS, Terraform, Kubernetes, Docker, Grafana, Prometheus, Datadog Job Summary: We are looking for a skilled Site Reliability Engineer (SRE) with strong AWS experience and a solid background in DevOps, automation, observability, and...


  • Chennai, India Datum Technologies Group Full time

    Job Title: Site Reliability Engineer (SRE) – Azure & AIExperience: 7+ yearsWork Mode: HybridWork Location: Chennai/Mumbai/GurgaonJob Summary:We are looking for an experienced Site Reliability Engineer (SRE) with strong expertise in Microsoft Azure, AI infrastructure, and automation. The ideal candidate will have a solid background in managing cloud...


  • Chennai, India Datum Technologies Group Full time

    Job Title: Site Reliability Engineer (SRE) – Azure & AIExperience: 7+ yearsWork Mode: HybridWork Location: Chennai/Mumbai/GurgaonJob Summary:We are looking for an experienced Site Reliability Engineer (SRE) with strong expertise in Microsoft Azure, AI infrastructure, and automation. The ideal candidate will have a solid background in managing cloud...


  • Chennai, India Datum Technologies Group Full time

    Job Title: Site Reliability Engineer (SRE) – Azure & AIExperience: 7+ yearsWork Mode: HybridWork Location: Chennai/Mumbai/GurgaonJob Summary:We are looking for an experienced Site Reliability Engineer (SRE) with strong expertise in Microsoft Azure, AI infrastructure, and automation. The ideal candidate will have a solid background in managing cloud...


  • Chennai, India Zyoin Group Full time

    Description : MoneyForward is seeking a Site Reliability Engineer (SRE) to lead the reliability, scalability, and performance of our products. This role involves making critical technical decisions, collaborating with development and platform engineering teams, and ensuring that our systems remain resilient and scalable to support stable business...


  • Chennai, India Tata Consultancy Services Full time

    Role: Site Reliability Engineer Experience: 4 to 7 Years Locations: Chennai/Pune/Kolkata


  • Chennai, India Tata Consultancy Services Full time

    Role: Site Reliability Engineer Experience: 4 to 7 Years Locations: Chennai/Pune/Kolkata