
VLM Research Engineer
2 days ago
Location: Vapi, Gujarat
Employment Type: Full-Time
Overview
We are seeking a highly skilled VLM Research Engineer to build multimodal (vision-language-action) models for instruction following, scene grounding, and tool use across platforms. The role involves developing advanced models that bridge perception and language understanding for autonomous systems.
Key Responsibilities
- Pretrain and finetune VLMs, aligning them with robotics data including video, teleoperation, and language.
- Build perception-to-language grounding for referring expressions, affordances, and task graphs.
- Develop Toolformer/actuator interfaces to convert language intents into actionable skills and motion plans.
- Create evaluation pipelines for instruction following, safety filters, and hallucination control.
- Collaborate with cross-functional teams for integration of models into robotics platforms.
Must-Haves
- Master's or PhD in a relevant field.
- 1–2+ years of experience in Computer Vision/Machine Learning.
- Strong proficiency in PyTorch or JAX; experience with LLMs and VLMs.
- Familiarity with multimodal datasets, distributed training, and RL/IL.
Nice-to-Haves
- Experience with world models, diffusion-policy integration, and speech interfaces.
- Familiarity with sim-to-real transfer in robotics applications.
Success Metrics
- Success@k on language-based tasks.
- Grounding precision and latency.
- Sim-to-real performance retention.
Domain Notes
Humanoids:
- Language-guided manipulation and tool use.
AGVs (Autonomous Ground Vehicles):
- Natural language tasking for warehouse operations; semantic maps.
Cars:
- Gesture and sign interpretation; driver interaction.
Drones:
- Natural language mission specification; target search and inspection.
Application Instructions
Interested candidates may apply by sending their resume and cover letter to parijat.patel@merai.co with the subject line: "VLM Research Engineer Application".
-
VLM Research Engineer
2 days ago
Vapi, Gujarat, India Meril Full timeJob Title: VLM Research Engineer Location: Vapi, Gujarat Employment Type: Full-Time Overview We are seeking a highly skilled VLM Research Engineer to build multimodal (vision-language-action) models for instruction following, scene grounding, and tool use across platforms. The role involves developing advanced models that bridge perception and...
-
Research Interns MS or PhD
2 days ago
Vapi, Gujarat, India Meril Full timeJob Title: Research Interns (MS / PhD)Location: Vapi, GujaratEmployment Type: Internship (Full-Time)OverviewWe are seeking talented **Research Interns (MS/PhD)** to contribute to scoped projects across Vision-Language Models (VLM), Reinforcement Learning and Planning, Perception, SLAM, 3D Vision, and Simulation. Interns will focus on achieving publishable...
-
Research Interns MS or PhD
2 days ago
Vapi, Gujarat, India Meril Full timeJob Title: Research Interns (MS / PhD) Location: Vapi, Gujarat Employment Type: Internship (Full-Time) Overview We are seeking talented **Research Interns (MS/PhD)** to contribute to scoped projects across Vision-Language Models (VLM), Reinforcement Learning and Planning, Perception, SLAM, 3D Vision, and Simulation. Interns will focus on achieving...
-
Multimodal Vision-Language Expert
2 days ago
Vapi, Gujarat, India beBeeVisionLanguageModelEngineer Full time ₹ 9,00,000 - ₹ 12,00,000Job Title: Vision-Language Model EngineerAbout the Role:We are seeking a highly skilled Vision-Language Model (VLM) engineer to develop multimodal models for instruction following, scene grounding, and tool use across various platforms. The role involves designing advanced models that bridge perception and language understanding for autonomous systems.Key...
-
Multimodal AI Researcher
2 days ago
Vapi, Gujarat, India beBeeArtificialintelligence Full time ₹ 1,50,00,000 - ₹ 2,00,00,000We are seeking a skilled Multimodal AI Researcher to develop and implement multimodal models for instruction following, scene grounding, and tool use across various platforms.Key Responsibilities:Pretrain and fine-tune VLMs aligning them with robotics data including video, teleoperation, and language.Build perception-to-language grounding for referring...
-
Research Developers
2 days ago
Vapi, Gujarat, India beBeeMachineLearning Full time ₹ 7,50,000 - ₹ 15,00,000Job Title: Research Interns MS or PhDWe are seeking skilled professionals to contribute to cutting-edge research projects in vision-language models, reinforcement learning and planning, perception, SLAM, 3D vision, and simulation.Key Responsibilities:Own a focused research question and deliver results from baselines to state-of-the-art attempts, including...
-
Multimodal Model Developer
5 days ago
Vapi, Gujarat, India beBeeMachineLearning Full time ₹ 2,00,00,000 - ₹ 2,50,00,000Research EngineerWe are seeking a highly skilled Research Engineer to build multimodal models for instruction following, scene grounding, and tool use across platforms. The role involves developing advanced models that bridge perception and language understanding for autonomous systems.Requirements- 12+ years of experience in Computer Vision/Machine...
-
RL Research Engineer
2 days ago
Vapi, Gujarat, India Meril Full timeJob Title: RL Research Engineer (Planning & Control) Location: Vapi, Gujarat Employment Type: Full-Time Overview We are seeking a highly skilled Reinforcement Learning (RL) Research Engineer specializing in planning and control. The role focuses on designing learning-based planners and policies (RL, imitation learning, model-based) and integrating...
-
RL Research Engineer
2 days ago
Vapi, Gujarat, India Meril Full timeJob Title: RL Research Engineer (Planning & Control)Location: Vapi, GujaratEmployment Type: Full-TimeOverviewWe are seeking a highly skilled Reinforcement Learning (RL) Research Engineer specializing in planning and control. The role focuses on designing learning-based planners and policies (RL, imitation learning, model-based) and integrating them with...
-
Applied AI Engineer – Fresher
4 weeks ago
Vapi, Gujarat, India Meril Full timeJob Title: Applied AI Engineer – Fresher (IIT/NIT Graduates Only)Job Type: Full-TimeExperience: 0–1 YearLocation: Vapi GujaratRole Overview:We are seeking high-potential graduates from IITs or NITs for the role of Applied AI Engineer (Fresher) to join our cutting-edge AI Engineering team. This opportunity is ideal for someone with a solid foundation in...