Forward Deployed ML Engineer, Agents

23 hours ago


Bengaluru, Karnataka, India AION Full time
About AION

AION is building an interoperable AI cloud platform by transforming the future of high-performance computing (HPC) through its decentralized AI cloud. Purpose-built for bare-metal performance, AION democratizes access to compute and provides managed services, aiming to be an end-to-end AI lifecycle platform—taking organizations from data to deployed models using its forward-deployed engineering approach.

AI is transforming every business around the world, and the demand for compute is surging like never before. AION thrives to be the gateway for dynamic compute workloads by building integration bridges with diverse data centers around the world and re-inventing the compute stack via its state-of-the-art serverless technology. We stand at the crossroads where enterprises are finding it hard to balance AI adoption with security. At AION, we take enterprise security and compliance very seriously and are re-thinking every piece of infrastructure from hardware and network packets to API interfaces.

Led by high-pedigree founders with previous exits, AION is well-funded by major VCs with strategic global partnerships. Headquartered in the US with global presence, the company is building its initial core team in India/UK.

Who You Are

You're a hands-on AI engineer with 3-5+ years of experience building production-grade multimodal AI systems and LLM applications. Your responsibilities mirror those of a hands-on AI startup CTO—you work in small teams to own delivery of high-stakes customer projects, embedding directly at client sites to architect, build, and deploy intelligent agent solutions.

You're equally comfortable writing production code, presenting technical solutions to C-level executives, and debugging complex AI systems on factory floors or in customer data centers. You've shipped voice agents, video processing systems, or conversational AI to production. You thrive translating ambiguous business requirements into concrete technical solutions that create measurable impact.

You're comfortable working across the full AI deployment lifecycle—from use case discovery and solution architecture to multimodal agent development, MLOps pipeline implementation, and production optimization. You understand what makes agents perform well in production and how to systematically improve quality through observability and evaluation. Experience with voice AI platforms, RAG systems, and LLM orchestration frameworks is highly desirable. You bring exceptional communication skills, customer empathy, and the drive to build AI solutions that transform enterprise operations globally.

What You'll Do
Customer Engagement & Multimodal Agent Development
  • Work directly at customer sites—from factory floors to executive offices—conducting discovery workshops and technical assessments to identify high-impact AI opportunities
  • Design and architect end-to-end multimodal agent systems (voice + video + text) that leverage AION's distributed GPU infrastructure and managed services
  • Build production-grade voice AI systems using STT, TTS APIs, and LLMs deployed on AION's platform
  • Develop vision-enabled agents processing real-time video streams using computer vision pipelines on AION's infrastructure
  • Implement sophisticated multi-agent orchestration with(or similar) frameworks like LangChain or LlamaIndex—enabling tool use, memory management, and autonomous task completion
  • Rapidly prototype POCs in 2-4 weeks, coding alongside client teams to validate concepts and iterate based on feedback
  • Optimize for sub-500ms latency, natural conversation flow, turn detection, and interruption handling in real-time systems
  • Integrate agents directly into customer codebases via REST/GraphQL/WebSocket APIs and custom SDKs (Python, TypeScript)
  • Act as trusted technical advisor to customers, shaping AI strategy and guiding roadmap decisions from concept to production
Data Strategy & MLOps Infrastructure
  • Design data architectures with efficient processing pipelines and ingestion workflows for training and inference on AION's platform
  • Implement RAG systems with vector databases—optimizing embedding strategies, chunk sizes, and retrieval methods
  • Prepare and validate datasets for fine-tuning, evaluation, and synthetic data generation
  • Work with other MLEs, MLOps, SREs to carry out model deployment and productionization
Observability, Evaluation & Production Operations
  • Implement LLM and agents observability and monitoring—tracking token usage, latency, costs, and quality metrics across deployments on AION's infrastructure
  • Instrument applications to trace LLM calls, retrieval operations, agent actions, and data flows
  • Build evaluation frameworks with offline benchmarks (accuracy, relevance, safety metrics) and online monitoring (user feedback, drift detection)
Technical Skills & Experience

If you are meeting some of these requirements and feel comfortable catching up on others, we definitely recommend you to apply:

  • 3-5+ years of hands-on experience building production AI/ML systems, with 1-2+ years deploying LLM applications to production
  • Multimodal AI expertise—practical experience building voice agents, vision systems, or conversational AI serving real users
  • Strong LLM foundations—hands-on with modern foundation models including fine-tuning, prompt engineering, and evaluation methodologies
  • Agent framework proficiency—production experience with LangChain, LlamaIndex, or similar orchestration frameworks
  • Voice AI platform experience—built real-time conversational systems with production STT/TTS integration
  • Proficiency in Python (production-grade, async programming, type hints) and JavaScript/TypeScript (full-stack development)
  • RAG implementation experience—built retrieval-augmented generation systems with vector databases
  • MLOps & deployment—hands-on with Docker, Kubernetes, CI/CD pipelines, and infrastructure-as-code
  • Cloud platforms—experience with AWS, Azure, or GCP for ML workloads and infrastructure management
  • Exceptional communication—ability to explain complex AI concepts clearly to both technical and business stakeholders
  • Customer-facing experience in Solutions Architecture, Technical Account Management, or Pre-Sales Engineering is highly desirable
  • Computer vision experience—working with video processing, object detection, or vision-language models is a plus
  • Model fine-tuning—practical experience with LoRA/QLoRA, supervised fine-tuning, or RLHF workflows is a plus
  • Inference optimization—experience with vLLM, TensorRT-LLM, Triton, or model quantization techniques is desirable
  • Observability tooling—practical experience with LLM monitoring, tracing, and evaluation frameworks is a strong plus
  • Familiarity with WebRTC, real-time streaming protocols, and low-latency media processing

Why Join AION?

  • Work directly with high-pedigree founders shaping technical and product strategy.
  • Build infrastructure powering the future of AI compute globally.
  • Significant ownership and impact with equity reflective of your contributions.
  • Competitive compensation, flexible work options, and wellness benefits.

Apply Now:
If you're a machine learning engineer ready to lead MLAAS(Machine learning as a Service) architecture and scale next-generation AI infrastructure, we want to hear from you. Please share the following in the summary section:

  • Your resume highlights relevant projects and leadership experience
  • Links to products, code(Github), or demos you've built.
  • A brief note on why AION's mission excites you.


  • Bengaluru, Karnataka, India AION Full time

    About AIONAION is building an interoperable AI cloud platform by transforming the future of high-performance computing (HPC) through its decentralized AI cloud. Purpose-built for bare-metal performance, AION democratizes access to compute and provides managed services, aiming to be an end-to-end AI lifecycle platform—taking organizations from data to...


  • Bengaluru, Karnataka, India, Karnataka Adopt AI Full time

    Company DescriptionAdopt AI empowers modern applications to provide agentic experiences for end users within days. With Adopt, users can execute complex actions, automate workflows, and innovate via natural language commands. Our AI agents understand workflows and execute dynamic plans to achieve desired outcomes, enabling intuitive interaction with...


  • Bengaluru, Karnataka, India C3 AI Full time

    C3 AI (NYSE: AI), is the Enterprise AI application software company. C3 AI delivers a family of fully integrated products including the C3 Agentic AI Platform, an end-to-end platform for developing, deploying, and operating enterprise AI applications, C3 AI applications, a portfolio of industry-specific SaaS enterprise AI applications that enable the digital...


  • Bengaluru, Karnataka, India Emergent Labs Full time

    Forward Deployed AI EngineerLocation: Remote / Hybrid (Preferred: Bengaluru, India)Team: Customer EngineeringCompany: Emergent – The world's first agentic vibe-coding platformAbout EmergentEmergent is reimagining how software is built. We're the world's first agentic vibe-coding platform — built on top of Claude — that turns natural language into fully...


  • Bengaluru, Karnataka, India Sarvam AI Full time

    Forward Deployed Software EngineerLocation: Bengaluru, India  Experience: 1 - 3 years experienceCompany Overview is a pioneering generative AI startup headquartered in Bengaluru, India. We are dedicated to leading transformative research and development in the field of language technologies. With a focus on building scalable and efficient large language...


  • Bengaluru, Karnataka, India Workato Full time

    About WorkatoWorkato transforms technology complexity into business opportunity. As the leader in enterprise orchestration, Workato helps businesses globally streamline operations by connecting data, processes, applications, and experiences. Its AI-powered platform enables teams to navigate complex workflows in real-time, driving efficiency and...


  • Bengaluru, Karnataka, India Zigment Full time

    Why In today's fast-paced digital landscape, engaging and converting leads is crucial for every business. provides next-generation AI conversational solutions to help businesses grow their revenue, acquire more customers, and build a stronger brand with ease.We see a massive opportunity ahead and are committed to building a significant, lasting business....


  • Bengaluru, Karnataka, India ToolJet Full time

    Role DescriptionThis is a full-time role for a Forward Deployed Engineer at ToolJet.As a Forward Deployed Engineer, you will work closely with clients to understand their business requirements and build tailored internal applications using ToolJet platform.You'll design, develop, and implement functional solutions, ensure smooth integration with client...


  • Bengaluru, Karnataka, India Zamp Full time

    About Zamp:Mission -Zamp is not a company, we're a humanity catalyst. We're on a mission to enable people to move at the speed of thought.This decade, we're focused on building digital employees for the future of work, unlocking human creativity at a scale the world has never seen. We work with 50+ top global organizations and banks (including DoorDash,...


  • Bengaluru, Karnataka, India Giift Full time

    Role OverviewAs aForward Deployed Engineer (FDE), you'll be thetechnical bridge between Xoxoday and our enterprise clients. You'll lead solution design, deployment, and technical integration of our products into customer environments — ensuring clients unlock the full value of the Xoxoday suite.This role blendsengineering, consulting, and...