Site Reliability Engineer
4 weeks ago
Job Title: Site Reliability Engineer (SRE) – Azure & AIExperience: 7+ yearsWork Mode: HybridWork Location: Chennai/Mumbai/GurgaonJob Summary:We are looking for an experienced Site Reliability Engineer (SRE) with strong expertise in Microsoft Azure, AI infrastructure, and automation. The ideal candidate will have a solid background in managing cloud environments using GitHub/Azure DevOps, and hands-on experience in AI model deployment and scaling. This role involves working closely with engineering teams to deliver reliable, secure, and scalable cloud infrastructure that supports AI workloads and enterprise applications.Key Responsibilities:- Design, build, and maintain scalable cloud infrastructure on Microsoft Azure. - Automate infrastructure provisioning and deployment using Terraform, Argo, and Helm. - Manage and optimize Azure Kubernetes Service (AKS) for AI and microservices workloads. - Support AI model hosting using frameworks such as Huggingface Transformers, vLLM, or Llama.cpp on Azure OpenAI, VMs, or GPUs. - Implement CI/CD pipelines using GitHub Actions and integrate with JFrog Artifactory. - Monitor and maintain system performance and reliability using Grafana, ensuring proactive issue resolution. - Collaborate with development teams to align infrastructure with application requirements. - Enforce networking and information security best practices. - Manage and optimize caching and data layer performance using Redis.Required Skills & Technologies:- Azure Cloud Services (including Azure OpenAI) - AI Model Hosting & Infrastructure - GitHub (CI/CD, workflows) - Azure Kubernetes Service (AKS) - Argo, Helm, Terraform - Docker, JFrog, Grafana - Networking & Security, Redis
-
Site Reliability Engineer
7 days ago
New Delhi, India Tata Consultancy Services Full timeTCS has been a great pioneer in feeding the fire of young techies like you. We are a global leader in the technology arena and there’s nothing that can stop us from growing together.What we are looking forRole: Site Reliability Engineering (SRE)Experience Range: 5 – 15 YearsLocation: Chennai/Punecandidates should come to office for Walk in Drive(Face to...
-
Site Reliability Engineer
4 days ago
New Delhi, India Tata Consultancy Services Full timeTCS has been a great pioneer in feeding the fire of young techies like you. We are a global leader in the technology arena and there’s nothing that can stop us from growing together.What we are looking forRole: Site Reliability Engineering (SRE)Experience Range: 5 – 15 YearsLocation: Chennai/Punecandidates should come to office for Walk in Drive(Face to...
-
Site Reliability Engineer
3 days ago
New Delhi, India Tata Consultancy Services Full timeTCS has been a great pioneer in feeding the fire of young techies like you. We are a global leader in the technology arena and there’s nothing that can stop us from growing together.What we are looking forRole: Site Reliability Engineering (SRE) Experience Range: 5 – 15 Years Location: Chennai/Pune candidates should come to office for Walk in Drive(Face...
-
Site Reliability Engineer
5 days ago
New Delhi, India Relanto Full timeWe’re Hiring: Site Reliability Engineer (4+ Years Experience) Location: Bangalore (WFO only - 5 days)We are looking for a passionate Site Reliability Engineer (SRE) to join our growing team! If you love building reliable, scalable systems and enjoy solving complex problems, this role is for you.What You’ll Do Ensure high availability and performance of...
-
Site Reliability Engineer
4 days ago
New Delhi, India Relanto Full timeWe’re Hiring: Site Reliability Engineer (4+ Years Experience) Location: Bangalore (WFO only - 5 days)We are looking for a passionate Site Reliability Engineer (SRE) to join our growing team! If you love building reliable, scalable systems and enjoy solving complex problems, this role is for you.What You’ll Do Ensure high availability and performance of...
-
Site Reliability Engineer
4 days ago
New Delhi, India Datum Technologies Group Full timeJob Title: Site Reliability Engineer (SRE) – AWSExperience: 8+ years Location: Chennai / Mumbai Work Mode: HybridKey Skills:AWS, Terraform, Kubernetes, Docker, Grafana, Prometheus, DatadogJob Summary:We are looking for a skilled Site Reliability Engineer (SRE) with strong AWS experience and a solid background in DevOps, automation, observability, and...
-
Site Reliability Engineer
1 week ago
New Delhi, India Enterprise Minds, Inc Full timeSenior Site Reliability Engineer (GCP | Terraform | Ansible | SRE | On-Call)We are looking for ahigh-impact Site Reliability Engineer (SRE)who will play a key role in ensuring the reliability, availability, and scalability of our production systems onGoogle Cloud Platform (GCP) . If you thrive in fast-paced environments, excel in incident management, and...
-
Site Reliability Engineer
4 days ago
New Delhi, India Enterprise Minds, Inc Full timeSenior Site Reliability Engineer (GCP | Terraform | Ansible | SRE | On-Call)We are looking for ahigh-impact Site Reliability Engineer (SRE)who will play a key role in ensuring the reliability, availability, and scalability of our production systems onGoogle Cloud Platform (GCP) . If you thrive in fast-paced environments, excel in incident management, and...
-
Site Reliability Engineer
3 weeks ago
New Delhi, India Grootan Technologies Full timeAbout the RoleWe are seeking a skilled Site Reliability Engineer (SRE) with 4–5 years of hands-on experience to join our engineering team. In this role, you will be responsible for building and maintaining reliable, scalable, and secure infrastructure to support our applications. You will leverage your expertise in automation, cloud platforms, and...
-
Site Reliability Engineer
7 days ago
New Delhi, India Insight Global Full timeCompany:Insight Global Duration:Approved for 1 year Location:Remote (India) Type:Contract with Insight Global Client Compensation:14 LPA – 20 LPA Working Hours:Normal IST hours Start Date:Immediate (No notice period)About the Role Join our Site Reliability Engineering (SRE) team as a Fullstack Developer, focused on building and maintaining highly reliable,...