Site Reliability Engineer
4 days ago
Job Title: Site Reliability Engineer (SRE) Azure & AI
Work Mode: Hybrid
Notice Period: Immediate - 30 days
Location: Chennai, Mumbai, Gurugram.
Job Summary
We are looking for an experienced Site Reliability Engineer (SRE) with a strong background in Azure Cloud, AI infrastructure, and automation. The ideal candidate will have hands-on experience designing and maintaining reliable, secure, and scalable environments for AI workloads and enterprise applications. Youll collaborate with cross-functional teams to build robust CI/CD pipelines, automate deployments, and ensure seamless performance in production systems.
Key Responsibilities
- Design, build, and maintain scalable, resilient cloud infrastructure on Microsoft Azure.
- Automate provisioning and deployments using tools like Terraform, Argo, and Helm.
- Manage and optimize Azure Kubernetes Service (AKS) clusters for AI and microservices workloads.
- Support AI model hosting and serving (e.g., Hugging Face Transformers, vLLM, ) on Azure OpenAI, Azure VMs, or GPU environments.
- Build and maintain CI/CD pipelines using GitHub Actions, integrated with JFrog Artifactory.
- Monitor infrastructure health, reliability, and performance using Grafana and implement proactive measures.
- Collaborate closely with software engineering teams to align infrastructure with application needs.
- Ensure compliance with networking, data security, and infrastructure governance best practices.
- Enhance caching and data performance using Redis and related tools.
Required Skills & Technologies
Microsoft Azure Cloud Services (including Azure OpenAI)
- AI Model Hosting & Infrastructure Management
- GitHub Actions / Azure DevOps (CI/CD)
- Azure Kubernetes Service (AKS)
- Argo, Helm, Terraform, Docker
- JFrog Artifactory
- Grafana (Monitoring & Observability)
- Networking & Security
- Redis (Caching & Data Layer Optimization)
-
Site Reliability Engineer
1 week ago
Chennai, Tamil Nadu, India Elgebra Full time ₹ 6,00,000 - ₹ 18,00,000 per yearHiring: Site Reliability Engineer – 7+ YearsLocation: Bangalore / Chennai Payroll: Elgebra Client: Qincline Joining: Immediate to 15 DaysRole Overview:We are looking for an experienced Site Reliability Engineer (SRE) with over 6 years of expertise to join our team. The ideal candidate will have strong technical skills, a problem-solving mindset, and the...
-
Site Reliability Engineer
1 day ago
Chennai, India Datum Technologies Group Full timeJob Title: Site Reliability Engineer (SRE) – Azure & AIExperience: 7+ yearsWork Mode: HybridWork Location: Chennai/Mumbai/GurgaonJob Summary:We are looking for an experienced Site Reliability Engineer (SRE) with strong expertise in Microsoft Azure, AI infrastructure, and automation. The ideal candidate will have a solid background in managing cloud...
-
Site Reliability Engineer
1 day ago
Chennai, India Datum Technologies Group Full timeJob Title: Site Reliability Engineer (SRE) – Azure & AIExperience: 7+ yearsWork Mode: HybridWork Location: Chennai/Mumbai/GurgaonJob Summary:We are looking for an experienced Site Reliability Engineer (SRE) with strong expertise in Microsoft Azure, AI infrastructure, and automation. The ideal candidate will have a solid background in managing cloud...
-
Site Reliability Engineer
1 day ago
Chennai, India Datum Technologies Group Full timeJob Title: Site Reliability Engineer (SRE) – Azure & AIExperience: 7+ yearsWork Mode: HybridWork Location: Chennai/Mumbai/GurgaonJob Summary:We are looking for an experienced Site Reliability Engineer (SRE) with strong expertise in Microsoft Azure, AI infrastructure, and automation. The ideal candidate will have a solid background in managing cloud...
-
Site Reliability Engineer
7 days ago
Chennai, India Zyoin Group Full timeDescription : MoneyForward is seeking a Site Reliability Engineer (SRE) to lead the reliability, scalability, and performance of our products. This role involves making critical technical decisions, collaborating with development and platform engineering teams, and ensuring that our systems remain resilient and scalable to support stable business...
-
Site Reliability Engineer
1 week ago
chennai, India Tata Consultancy Services Full timeRole: Site Reliability Engineer Experience: 4 to 7 Years Locations: Chennai/Pune/Kolkata
-
Site Reliability Engineer
2 weeks ago
Chennai, India Tata Consultancy Services Full timeRole: Site Reliability Engineer Experience: 4 to 7 Years Locations: Chennai/Pune/Kolkata
-
Site Reliability Engineer
7 days ago
Chennai, India Tata Consultancy Services Full timeRole: Site Reliability Engineer Experience: 4 to 7 Years Locations: Chennai/Pune/Kolkata
-
Site Reliability Engineer
1 day ago
Chennai, India Datum Technologies Group Full timeJob Title: Site Reliability Engineer (SRE) – Azure & AI Experience: 7+ years Work Mode: Hybrid Work Location: Chennai/Mumbai/Gurgaon Job Summary: We are looking for an experienced Site Reliability Engineer (SRE) with strong expertise in Microsoft Azure, AI infrastructure, and automation. The ideal candidate will have a solid background in managing cloud...
-
Site Reliability Engineer
2 weeks ago
Chennai, India Elgebra Full timeRole Overview :We are seeking a highly experienced and technically proficient Site Reliability Engineer (SRE) to join our team in support of our client, Qincline. The ideal candidate will have 7 or more years of dedicated experience in Site Reliability Engineering or a closely related discipline. This pivotal role requires a strong focus on ensuring the...