Multimodal AI Researcher

2 days ago


Vapi, Gujarat, India beBeeArtificialintelligence Full time ₹ 1,50,00,000 - ₹ 2,00,00,000

We are seeking a skilled Multimodal AI Researcher to develop and implement multimodal models for instruction following, scene grounding, and tool use across various platforms.

Key Responsibilities:
  • Pretrain and fine-tune VLMs aligning them with robotics data including video, teleoperation, and language.
  • Build perception-to-language grounding for referring expressions, affordances, and task graphs.
  • Develop interfaces to convert language intents into actionable skills and motion plans.
  • Create evaluation pipelines for instruction following, safety filters, and hallucination control.
Requirements:
  • Masters or PhD in Computer Science or relevant field.
  • 1–2+ years of experience in Computer Vision/Machine Learning.
  • Strong proficiency in PyTorch or JAX; experience with LLMs and VLMs.
  • Familiarity with multimodal datasets, distributed training, and RL/IL.
Success Metrics:
  • Success@k on language-based tasks.
  • Grounding precision and latency.
  • SIM-to-real performance retention.

This role will contribute to cutting-edge projects in Humanoids, AGVs, Cars, and Drones, focusing on language-guided manipulation, natural language tasking, gesture interpretation, and target search.

The ideal candidate should have a solid background in computer science, strong programming skills, and experience working with multimodal data. The ability to communicate complex ideas effectively is crucial.



  • Vapi, Gujarat, India beBeeMachineLearning Full time ₹ 2,00,00,000 - ₹ 2,50,00,000

    Research EngineerWe are seeking a highly skilled Research Engineer to build multimodal models for instruction following, scene grounding, and tool use across platforms. The role involves developing advanced models that bridge perception and language understanding for autonomous systems.Requirements- 12+ years of experience in Computer Vision/Machine...


  • Vapi, Gujarat, India beBeeVisionLanguage Full time ₹ 12,00,000 - ₹ 20,00,000

    Advanced Multimodal Model DeveloperJob Description:We are seeking a highly skilled developer to build multimodal models for instruction following, scene grounding, and tool use across platforms. The role involves developing advanced models that bridge perception and language understanding for autonomous systems.Key Responsibilities:Pretrain and finetune...


  • Vapi, Gujarat, India beBeeVisionLanguageModelEngineer Full time ₹ 9,00,000 - ₹ 12,00,000

    Job Title: Vision-Language Model EngineerAbout the Role:We are seeking a highly skilled Vision-Language Model (VLM) engineer to develop multimodal models for instruction following, scene grounding, and tool use across various platforms. The role involves designing advanced models that bridge perception and language understanding for autonomous systems.Key...


  • Vapi, Gujarat, India Meril Full time

    Job Title: VLM Research Engineer Location: Vapi, Gujarat Employment Type: Full-Time Overview We are seeking a highly skilled VLM Research Engineer to build multimodal (vision-language-action) models for instruction following, scene grounding, and tool use across platforms. The role involves developing advanced models that bridge perception and...


  • Vapi, Gujarat, India Meril Full time

    Job Title: VLM Research EngineerLocation: Vapi, GujaratEmployment Type: Full-TimeOverviewWe are seeking a highly skilled VLM Research Engineer to build multimodal (vision-language-action) models for instruction following, scene grounding, and tool use across platforms. The role involves developing advanced models that bridge perception and language...

  • AI Research Planner

    5 days ago


    Vapi, Gujarat, India beBeeExpertise Full time ₹ 2,50,00,000 - ₹ 3,00,00,000

    Job Title: Reinforcement Learning ExpertWe are seeking an exceptional Reinforcement Learning (RL) professional with expertise in planning and control. The role focuses on developing learning-based planners and policies (RL, imitation learning, model-based) and integrating them with classical control approaches to enable safe, efficient, and robust autonomous...


  • Vapi, Gujarat, India Meril Full time

    Job Title: Applied AI Engineer – Fresher (IIT/NIT Graduates Only)Job Type: Full-TimeExperience: 0–1 YearLocation: Vapi GujaratRole Overview:We are seeking high-potential graduates from IITs or NITs for the role of Applied AI Engineer (Fresher) to join our cutting-edge AI Engineering team. This opportunity is ideal for someone with a solid foundation in...


  • Vapi, Gujarat, India Meril Full time

    Job Title: RL Research Engineer (Planning & Control)Location: Vapi, GujaratEmployment Type: Full-TimeOverviewWe are seeking a highly skilled Reinforcement Learning (RL) Research Engineer specializing in planning and control. The role focuses on designing learning-based planners and policies (RL, imitation learning, model-based) and integrating them with...

  • Ui/ux Designer

    4 weeks ago


    Vapi, Gujarat, India Meril Full time

    Job Title: UI/UX Designer Location: Vapi, India Company: Meril (AI/ML Division)Experience: 5+ years About Meril Meril is an Artificial Intelligence and Machine Learning-driven company in the healthcare domain.We develop cutting-edge AI solutions to revolutionize healthcare, improving efficiency and patient outcomes.Job Overview We are seeking an experienced...

  • UI/UX Designer

    4 weeks ago


    Vapi, Gujarat, India Meril Full time

    Job Title: UI/UX DesignerLocation: Vapi, IndiaCompany: Meril (AI/ML Division)Experience: 5+ yearsAbout MerilMeril is an Artificial Intelligence and Machine Learning-driven company in the healthcare domain. We develop cutting-edge AI solutions to revolutionize healthcare, improving efficiency and patient outcomes.Job OverviewWe are seeking an experienced...