SRE & MLOps Engineer (Platform Reliability & AI Operations)

3 days ago


bangalore, India Apna Full time

About Blue MachinesBlue Machines powers large-scale, real-time Voice AI and Agentic Workflows across BFSI,Healthcare, HRTech, and Global Enterprises.Role: SRE & MLOps Engineer (3–6 Years Experience)Location: Bangalore (Hybrid)What You Will Own1. Platform Uptime & Reliability- Maintain 99.9%+ uptime.- Monitor and optimize latency for voice agents.2. Observability, Monitoring & Incident Response- Build and maintain monitoring dashboards.- Configure alerts; first responder for incidents.3. MLOps & Model Provider Reliability- Monitor STT/TTS/LLM providers.- Manage failovers and latency SLAs.4. Kubernetes & Infrastructure- Manage GKE clusters, autoscaling, deployments.5. Internal Platform Tooling- Build automation around scaling, canaries, logs.6. Security & Compliance- Enforce encryption, network policies, audit support.RequirementsYou Are a Great Fit If You…- 2–5 years SRE/DevOps/MLOps experience.- Strong with Kubernetes, Prometheus, ELK, Redis, Pub/Sub.- Understand streaming, SIP, WebSockets.- Good communication and incident ownership.Preferred Skills- Experience with LLM pipelines, telephony, GPU, GCP.Why Blue Machines- Build India's most advanced Voice AI platform.- High-scale, low-latency engineering.- Work with CTO's office on reliability.



  • bangalore, India Oracle Full time

    Responsibilities Design, develop, and execute end-to-end (E2E) scenario validations that simulate real-world usage of complex AI data platform workflows (data ingestion, transformation, ML pipeline orchestration, etc.). Collaborate closely with product, engineering, and field teams to identify gaps in coverage and propose test automation strategies. Develop...


  • bangalore, India Oracle Full time

    ResponsibilitiesDesign, develop, and execute end-to-end (E2E) scenario validations that simulate real-world usage of complex AI data platform workflows (data ingestion, transformation, ML pipeline orchestration, etc.).Collaborate closely with product, engineering, and field teams to identify gaps in coverage and propose test automation strategies.Develop and...


  • bangalore, India Jade Global Full time

    Job Description Job Description Job Title: Senior Site Reliability Engineer (SRE) – Datadog Observability Experience Required: 8+ years overall in SRE and Infrastructure Operations with minimum 3+ years hands-on experience in Datadog Location: Hyderabad preferable but open for Pune and remote Job Summary: We are seeking an experienced Site Reliability...


  • bangalore district, India Oracle Full time

    Responsibilities Design, develop, and execute end-to-end (E2E) scenario validations that simulate real-world usage of complex AI data platform workflows (data ingestion, transformation, ML pipeline orchestration, etc.). Collaborate closely with product, engineering, and field teams to identify gaps in coverage and propose test automation strategies. Develop...


  • Bangalore Division, India Oracle Full time

    Responsibilities Design, develop, and execute end-to-end (E2E) scenario validations that simulate real-world usage of complex AI data platform workflows (data ingestion, transformation, ML pipeline orchestration, etc.). Collaborate closely with product, engineering, and field teams to identify gaps in coverage and propose test automation strategies. Develop...


  • bangalore, India BayOne Solutions Full time

    Job DescriptionWe are seeking a highly skilled AI Platform Engineer to design, build, and operate our next-generation AI application platform. In this role, you will work on advanced AI systems including Retrieval-Augmented Generation (RAG) pipelines, multi-model gateways, Model Context Protocol (MCP) tools, agentic workflow automations (e.g., n8n), and...


  • bangalore, India BayOne Solutions Full time

    Job Description We are seeking a highly skilled AI Platform Engineer to design, build, and operate our next-generation AI application platform . In this role, you will work on advanced AI systems including Retrieval-Augmented Generation (RAG) pipelines, multi-model gateways , Model Context Protocol (MCP) tools , agentic workflow automations (e.g., n8n), and...

  • SRE Devops Manager

    3 days ago


    bangalore, India Infinite Computer Solutions Full time

    We are looking for Site Reliability Engineering (SRE) Devops ManagerLocation: Bangalore / Hyderabad / Chennai / Noida / Pune / Visakhapatnam / GurgaonShift timing: regularCan join Immediate - 30 daysInterested candidates, Please share your profiles and below details toEmail ID: Shanmukh.Varma@infinite.comTotal experience:Relevant Experience:Current...


  • bangalore, India Oracle Full time

    Responsibilities Key Responsibilities: Design, develop, and execute end-to-end (E2E) scenario validations that simulate real-world usage of complex AI data platform workflows (data ingestion, transformation, ML pipeline orchestration, etc.). Collaborate closely with product, engineering, and field teams to identify gaps in coverage and propose test...


  • bangalore, India Oracle Full time

    ResponsibilitiesKey Responsibilities: Design, develop, and execute end-to-end (E2E) scenario validations that simulate real-world usage of complex AI data platform workflows (data ingestion, transformation, ML pipeline orchestration, etc.).Collaborate closely with product, engineering, and field teams to identify gaps in coverage and propose test automation...