DevOps/sre

1 day ago


Bangalore City Bengaluru Karnataka, India JUTEQ Inc Full time

JUTEQ is an AI-native and cloud-native technology consulting firm helping enterprises in financial services, telecom, and healthcare build intelligent, production-grade systems. We combine the power of GenAI, cloud architecture, and automation to deliver next-generation business tools.

We’re seeking a **DevOps/Site Reliability Engineer (SRE)** with experience in **Google Cloud Platform (GCP)** to lead and evolve our AI infrastructure as we scale multi-tenant agentic systems across automotive and enterprise use cases. This is a hands-on role working at the intersection of automation, observability, and production AI system reliability.

**What You’ll Work On**

**Platform Reliability & Automation**
- Own deployment pipelines, autoscaling, and high-availability for AI microservices running on GCP (Cloud Run, GKE, App Engine)
- Design and optimize CI/CD pipelines using Cloud Build, Skaffold, GitHub Actions
- Implement intelligent autoscaling strategies based on LLM cost, latency, and throughput
- Use Infrastructure as Code (Terraform, Deployment Manager) for repeatable cloud provisioning

**Monitoring & Observability**
- Deploy monitoring and alerting across Cloud Logging, Cloud Monitoring, and custom dashboards for agent performance metrics
- Define SLOs and SLIs for key services; implement failover and rollback strategies
- Build observability into agent workflows: latency, success rate, AI token consumption, prompt drift, etc.

**Data & AI Infrastructure**
- Manage access, scaling, and resilience of data services: BigQuery, Firestore, Memorystore, Cloud Storage, Pub/Sub
- Support model integration workflows with Vertex AI and third-party LLM providers (OpenAI, Anthropic, etc.)
- Monitor and secure retrieval pipelines (RAG, embedding generation, vector DBs)

**Security & Compliance**
- Implement and maintain IAM policies, workload identity, and service-to-service authentication
- Lead incident response and postmortem analysis for production outages
- Ensure systems comply with data residency, privacy, and SOC2/GDPR requirements

**What We’re Looking For**

**Experience & Skills**
- 4+ years of DevOps or SRE experience, with at least 2+ years on GCP
- Strong understanding of GCP products including Cloud Run, GKE, Cloud Build, BigQuery, Pub/Sub, Cloud Monitoring
- Experience with CI/CD and GitOps workflows (GitHub Actions, ArgoCD, etc.) and Observability/Monitoring
- Deep knowledge of containerization, Docker, and Kubernetes
- Familiarity with AI infrastructure (LLMs, prompt evaluation, LangChain/CrewAI patterns) is a strong plus
- Experience with alerting and logging using Prometheus, Grafana, or GCP-native tools
- Proficient in scripting (Python, Bash, Go preferred)

**Bonus Points**
- Experience managing infrastructure for AI agent systems or GenAI workloads
- Familiarity with multi-tenant SaaS platforms
- Understanding of RAG pipelines, embedding generation, or agent orchestration
- Certifications: Google Professional Cloud DevOps Engineer or equivalent

**Why Join Us**
- Shape the infrastructure behind real-world AI agents used by automotive dealerships and enterprises
- Work alongside AI developers, product engineers, and solution architects
- Ship fast in a zero-to-one environment while building for scale
- Own platform-level impact across reliability, security, cost, and developer productivity

**How to Apply**

Please send:

- Your resume highlighting DevOps/SRE experience on GCP
- GitHub or portfolio links showcasing infrastructure projects or CI/CD pipelines
- (Optional) A short Loom or video describing your favorite system you’ve built or scaled



  • bangalore, India Bahwan Cybertek Group Full time

    We are looking for a talented DevOps / SRE Engineer with strong Python skills to join our team at Bahwan Cybertek Group. As a DevOps / SRE Engineer, you will be responsible for maintaining and improving our software development and deployment processes, as well as ensuring the reliability and scalability of our infrastructure.Responsibilities:- Develop and...

  • Senior Sre Advanced

    2 weeks ago


    Bangalore, Karnataka, India EMBARKGCC SERVICES PRIVATE LIMITED Full time

    About the Role We are seeking a Site Reliability Engineer SRE - DevOps professional to design automate and maintain reliable scalable and high-performing systems The ideal candidate will contribute directly to enhancing technology capabilities and business performance through innovative sustainable DevOps and reliability practices Key Responsibilities Design...


  • Bangalore, Karnataka, India EMBARKGCC SERVICES PRIVATE LIMITED Full time

    About the Role We are seeking a Site Reliability Engineer SRE - DevOps professional to design automate and maintain reliable scalable and high-performing systems The ideal candidate will contribute directly to enhancing technology capabilities and business performance through innovative sustainable DevOps and reliability practices Key Responsibilities Design...

  • SRE (Devops)

    7 days ago


    bangalore, India Cozzera Full time

    Role: Senior SRE Devops Shifts: Night Shift Location: Remote Key Responsibilities: Manage and optimize cloud infrastructure with strong hands-on expertise in AWS , Kubernetes , and Terraform . Automate deployment pipelines and ensure high availability and scalability of services. Troubleshoot production issues and provide on-call support during night shift....

  • DevOps Sre

    1 week ago


    Bengaluru, Karnataka, India Hutech Solutions Full time

    **DevOps SRE** **Greetings From Hutech Solutions !!!** **Requirements**: - Bachelor's or Master's degree in Computer Science, Information Technology, or a related field. Advanced degrees or relevant certifications are a plus. - Proven experience in setting up and managing Observability tools like Prometheus, Grafana, Alert Manager, and Loki. - Strong...


  • Bengaluru, India Bahwan Cybertek Group Full time

    We are looking for a talented DevOps / SRE Engineer with strong Python skills to join our team at Bahwan Cybertek Group. As a DevOps / SRE Engineer, you will be responsible for maintaining and improving our software development and deployment processes, as well as ensuring the reliability and scalability of our infrastructure. Responsibilities: - Develop and...

  • SRE (Devops)

    7 days ago


    bangalore, India Cozzera Full time

    Role: Senior SRE DevopsShifts: Night Shift Location: RemoteKey Responsibilities:Manage and optimize cloud infrastructure with strong hands-on expertise in AWS, Kubernetes, and Terraform.Automate deployment pipelines and ensure high availability and scalability of services.Troubleshoot production issues and provide on-call support during night...


  • Bangalore, India Prospance Inc Full time

    SRE & DevOps Engineer (ML/AI Platform) Contract Position | Global E-Commerce Leader | Hybrid We're partnering with a leading global e-commerce company to find an exceptional SRE & DevOps Engineer to join their AI Platform Team. This is your chance to shape the future of machine learning infrastructure that powers innovation for millions of users worldwide....

  • SRE Devops Manager

    7 days ago


    bangalore, India Infinite Computer Solutions Full time

    We are looking for Site Reliability Engineering (SRE) Devops ManagerLocation: Bangalore / Hyderabad / Chennai / Noida / Pune / Visakhapatnam / GurgaonShift timing: regularCan join Immediate - 30 daysInterested candidates, Please share your profiles and below details toEmail ID: Shanmukh.Varma@infinite.comTotal experience:Relevant Experience:Current...

  • DEVOPS SRE

    3 weeks ago


    Bengaluru, India RARR Technologies Full time

    Job Description Key Responsibilities: SRE & DevOps Strategy: - Design and develop a robust SRE ecosystem following industry best practices. - Formulate SRE strategies based on emerging trends and organizational needs. - Implement best practices into local functional teams for consistent adoption. Platform & Automation: - Develop scaffolding libraries for...