Multimodal Vision-Language Expert

2 days ago


Vapi, Gujarat, India beBeeVisionLanguageModelEngineer Full time ₹ 9,00,000 - ₹ 12,00,000

Job Title: Vision-Language Model Engineer

About the Role:

We are seeking a highly skilled Vision-Language Model (VLM) engineer to develop multimodal models for instruction following, scene grounding, and tool use across various platforms. The role involves designing advanced models that bridge perception and language understanding for autonomous systems.

Key Responsibilities:

  • Pretrain and fine-tune VLMs, aligning them with robotics data including video, teleoperation, and language.
  • Breathe life into perception-to-language grounding for referring expressions, affordances, and task graphs.
  • Create seamless interfaces to convert language intents into actionable skills and motion plans using Toolformers/actuators.
  • Develop comprehensive evaluation pipelines for instruction following, safety filters, and hallucination control.
  • Collaborate with cross-functional teams to integrate models into robotics platforms.

Requirements:

  • Master's or PhD in a relevant field.
  • 1–2+ years of experience in Computer Vision/Machine Learning.
  • Strong proficiency in PyTorch or JAX; experience with Large Language Models (LLMs) and VLMs.
  • Familiarity with multimodal datasets, distributed training, and Reinforcement Learning (RL)/Imitation Learning (IL).

Preferred Qualifications:

  • Experience with world models, diffusion-policy integration, and speech interfaces.
  • Familiarity with sim-to-real transfer in robotics applications.

Success Metrics:

  • Enhanced performance on language-based tasks.
  • Grounding precision and latency optimization.
  • Sim-to-real performance retention.

Domain Overview:

The VLM Research Engineer will work on developing multimodal models for various domains, including humanoids, AGVs, cars, and drones.

How to Apply:

Interested candidates can submit their resume and cover letter to us with the subject line: 'VLM Research Engineer Application'.



  • Vapi, Gujarat, India beBeeArtificialintelligence Full time ₹ 1,50,00,000 - ₹ 2,00,00,000

    We are seeking a skilled Multimodal AI Researcher to develop and implement multimodal models for instruction following, scene grounding, and tool use across various platforms.Key Responsibilities:Pretrain and fine-tune VLMs aligning them with robotics data including video, teleoperation, and language.Build perception-to-language grounding for referring...


  • Vapi, Gujarat, India beBeeMachineLearning Full time ₹ 2,00,00,000 - ₹ 2,50,00,000

    Research EngineerWe are seeking a highly skilled Research Engineer to build multimodal models for instruction following, scene grounding, and tool use across platforms. The role involves developing advanced models that bridge perception and language understanding for autonomous systems.Requirements- 12+ years of experience in Computer Vision/Machine...


  • Vapi, Gujarat, India beBeeVisionLanguage Full time ₹ 12,00,000 - ₹ 20,00,000

    Advanced Multimodal Model DeveloperJob Description:We are seeking a highly skilled developer to build multimodal models for instruction following, scene grounding, and tool use across platforms. The role involves developing advanced models that bridge perception and language understanding for autonomous systems.Key Responsibilities:Pretrain and finetune...


  • Vapi, Gujarat, India Meril Full time

    Job Title: VLM Research Engineer Location: Vapi, Gujarat Employment Type: Full-Time Overview We are seeking a highly skilled VLM Research Engineer to build multimodal (vision-language-action) models for instruction following, scene grounding, and tool use across platforms. The role involves developing advanced models that bridge perception and...


  • Vapi, Gujarat, India beBeeComputerVision Full time ₹ 1,50,00,000 - ₹ 2,00,00,000

    Key Roles:3D Computer Vision SpecialistMain Responsibilities:Develop algorithms for stereo/monocular depth and optical flow.Work on 3D reconstruction and occupancy mapping techniques.Implement kinematic pose estimation for humans and robots to enable affordance modeling and grasp synthesis.Create 3D priors to enhance prediction, planning, and vision-language...


  • Vapi, Gujarat, India Meril Full time

    Job Title: VLM Research EngineerLocation: Vapi, GujaratEmployment Type: Full-TimeOverviewWe are seeking a highly skilled VLM Research Engineer to build multimodal (vision-language-action) models for instruction following, scene grounding, and tool use across platforms. The role involves developing advanced models that bridge perception and language...


  • Vapi, Gujarat, India beBeeSlam Full time ₹ 9,00,000 - ₹ 12,00,000

    SLAM Expert WantedWe are seeking a highly skilled SLAM expert to join our team. The ideal candidate will have extensive knowledge of localization and mapping techniques, including visual-inertial, LiDAR, and multi-sensor fusion.The SLAM expert will be responsible for developing robust SLAM systems for various robotics platforms. This includes architecting...


  • Vapi, Gujarat, India beBeeAutonomous Full time ₹ 15,00,000 - ₹ 18,00,000

    SLAM Expert PositionJob Description:We are seeking a highly skilled SLAM expert responsible for localization and mapping, including visual-inertial, LiDAR, and multi-sensor fusion as well as dense and semantic mapping. The role involves developing robust SLAM systems for various robotics platforms.Key ResponsibilitiesDeveloping advanced SLAM algorithms for...


  • Vapi, Gujarat, India beBeeReinforcement Full time ₹ 1,50,00,000 - ₹ 2,50,00,000

    Job Summary:We are seeking a highly skilled Reinforcement Learning (RL) Research Engineer specializing in planning and control to design learning-based planners and policies and integrate them with classical control approaches for safe, efficient, and robust autonomous operation.Main Responsibilities:Policy Development: Develop and train policies from human...


  • Vapi, Gujarat, India beBeeProcess Full time ₹ 15,00,000 - ₹ 25,00,000

    Job SummarySkillful and experienced SAP professional required to drive strategic process management across MM/PP modules. As a Senior Executive, you will be responsible for overseeing SAP enhancement and development projects, working closely with stakeholders to define business requirements, and leading small-scale change initiatives.You will also assist in...