
DevOps/sre
4 days ago
JUTEQ is an AI-native and cloud-native technology consulting firm helping enterprises in financial services, telecom, and healthcare build intelligent, production-grade systems. We combine the power of GenAI, cloud architecture, and automation to deliver next-generation business tools.
We’re seeking a **DevOps/Site Reliability Engineer (SRE)** with experience in **Google Cloud Platform (GCP)** to lead and evolve our AI infrastructure as we scale multi-tenant agentic systems across automotive and enterprise use cases. This is a hands-on role working at the intersection of automation, observability, and production AI system reliability.
**What You’ll Work On**
**Platform Reliability & Automation**
- Own deployment pipelines, autoscaling, and high-availability for AI microservices running on GCP (Cloud Run, GKE, App Engine)
- Design and optimize CI/CD pipelines using Cloud Build, Skaffold, GitHub Actions
- Implement intelligent autoscaling strategies based on LLM cost, latency, and throughput
- Use Infrastructure as Code (Terraform, Deployment Manager) for repeatable cloud provisioning
**Monitoring & Observability**
- Deploy monitoring and alerting across Cloud Logging, Cloud Monitoring, and custom dashboards for agent performance metrics
- Define SLOs and SLIs for key services; implement failover and rollback strategies
- Build observability into agent workflows: latency, success rate, AI token consumption, prompt drift, etc.
**Data & AI Infrastructure**
- Manage access, scaling, and resilience of data services: BigQuery, Firestore, Memorystore, Cloud Storage, Pub/Sub
- Support model integration workflows with Vertex AI and third-party LLM providers (OpenAI, Anthropic, etc.)
- Monitor and secure retrieval pipelines (RAG, embedding generation, vector DBs)
**Security & Compliance**
- Implement and maintain IAM policies, workload identity, and service-to-service authentication
- Lead incident response and postmortem analysis for production outages
- Ensure systems comply with data residency, privacy, and SOC2/GDPR requirements
**What We’re Looking For**
**Experience & Skills**
- 4+ years of DevOps or SRE experience, with at least 2+ years on GCP
- Strong understanding of GCP products including Cloud Run, GKE, Cloud Build, BigQuery, Pub/Sub, Cloud Monitoring
- Experience with CI/CD and GitOps workflows (GitHub Actions, ArgoCD, etc.) and Observability/Monitoring
- Deep knowledge of containerization, Docker, and Kubernetes
- Familiarity with AI infrastructure (LLMs, prompt evaluation, LangChain/CrewAI patterns) is a strong plus
- Experience with alerting and logging using Prometheus, Grafana, or GCP-native tools
- Proficient in scripting (Python, Bash, Go preferred)
**Bonus Points**
- Experience managing infrastructure for AI agent systems or GenAI workloads
- Familiarity with multi-tenant SaaS platforms
- Understanding of RAG pipelines, embedding generation, or agent orchestration
- Certifications: Google Professional Cloud DevOps Engineer or equivalent
**Why Join Us**
- Shape the infrastructure behind real-world AI agents used by automotive dealerships and enterprises
- Work alongside AI developers, product engineers, and solution architects
- Ship fast in a zero-to-one environment while building for scale
- Own platform-level impact across reliability, security, cost, and developer productivity
**How to Apply**
Please send:
- Your resume highlighting DevOps/SRE experience on GCP
- GitHub or portfolio links showcasing infrastructure projects or CI/CD pipelines
- (Optional) A short Loom or video describing your favorite system you’ve built or scaled
-
DevOps / SRE with Python
5 days ago
Bengaluru, Karnataka, India Bahwan Cybertek Group Full time ₹ 9,00,000 - ₹ 12,00,000 per yearWe are looking for a talented DevOps / SRE Engineer with strong Python skills to join our team at Bahwan Cybertek Group. As a DevOps / SRE Engineer, you will be responsible for maintaining and improving our software development and deployment processes, as well as ensuring the reliability and scalability of our infrastructure.Responsibilities:- Develop and...
-
SRE Devops
5 days ago
Bengaluru, Karnataka, India ITC Infotech Full timeNote : We are looking for Strong experience in Python scripting mandatory.This requires 3 days WFO mandatory at Kadubesanhalii Bangalore.Need Immediate to 15 days joiners only.JD : -We are seeking a Site Reliability Engineer (SRE) / DevOps Engineer with strong expertise in Python development and AWS cloud platforms to design, implement, and maintain...
-
SRE Devops
1 day ago
Bengaluru, Karnataka, India ITC Infotech Full timeNote : We are looking for Strong experience in Python scripting mandatory. Need Immediate to 15 days joiners only. We are seeking a Site Reliability Engineer (SRE) / DevOps Engineer with strong expertise in Python development and AWS cloud platforms to design, implement, and maintain highly available, scalable, and secure infrastructure. The ideal...
-
SRE Devops
3 weeks ago
Bengaluru, Karnataka, India ITC Infotech Full timeJob DescriptionNote : We are looking for Strong experience in Python scripting mandatory.This requires 3 days WFO mandatory at Kadubesanhalii Bangalore.Need Immediate to 15 days joiners only.JD : -We are seeking a Site Reliability Engineer (SRE) / DevOps Engineer with strong expertise in Python development and AWS cloud platforms to design, implement, and...
-
SRE Devops
2 weeks ago
Bengaluru, Karnataka, India ITC Infotech Full timeNote : We are looking for Strong experience in Python scripting mandatory. This requires 3 days WFO mandatory at Kadubesanhalii Bangalore. Need Immediate to 15 days joiners only. JD : - We are seeking a Site Reliability Engineer (SRE) / DevOps Engineer with strong expertise in Python development and AWS cloud platforms to design, implement, and maintain...
-
DevOps Sre
5 days ago
Bengaluru, Karnataka, India Tata Consultancy Services Full timeMust Have 1) Docker, Kubernetes skills ( workingexperience ) 2) Working knowledge of CI/CD pipelines. 3) Linux Administration skills. 4) AWS knowledge. 5) GitHub knowledge 6) Nginx Load Balancing. JD Good understanding of DevOps principles(CI/CD, release automation) - Good knowledge of Linux / Bash Scripting / Python / Java - Good understanding of...
-
Sre with Aiop and Dynatrace
5 days ago
Bengaluru, Karnataka, India Virtusa Full timeKnowledge & Experience: Minimum of 6 years of relevant work experience in critical production environments Hands-on experience of curating Service Level Objectives, defining Error Budgets and refining the change management lifecycle to accommodate the same Knowledge and experience with CI CD pipelines and deployment patterns like Canary Has experience...
-
Highly Skilled Devops Sre Career Opportunity
3 days ago
Bengaluru, Karnataka, India beBeeExpert Full time ₹ 1,80,00,000 - ₹ 2,00,00,000Job ProfileWe are seeking an exceptional DevOps SRE Expert to join our organization. The ideal candidate will possess a solid background in system reliability, incident response, and automation.
-
Azure Ai ml Sre
5 days ago
Bengaluru, Karnataka, India NTT DATA Full time**Req ID**: 338862 We are currently seeking a Azure AI_ML SRE to join our team in Bengaluru, Karnātaka (IN-KA), India (IN). - Provide Azure infrastructure Sustain/Operations support - Manage all activities of SRE- Site Reliability Engineer - Support for deployments/change requests requiring SRE assistance - 365 Days coverage and 1st or 2nd...
-
Senior Transformation Expert
22 hours ago
Bengaluru, Karnataka, India beBeeTransformation Full time ₹ 1,04,000 - ₹ 1,30,878Job OverviewWe are seeking a seasoned transformation expert to lead our DevOps and SRE efforts. This role will focus on delivering successful program management, aligning with business strategy, and fostering continuous improvement.Key Responsibilities:Develop and implement effective transformation strategies to drive business outcomes.Guide cross-functional...