
RL Research Engineer
1 day ago
Location: Vapi, Gujarat
Employment Type: Full-Time
Overview
We are seeking a highly skilled Reinforcement Learning (RL) Research Engineer specializing in planning and control. The role focuses on designing learning-based planners and policies (RL, imitation learning, model-based) and integrating them with classical control approaches to enable safe, efficient, and robust autonomous operation across multiple domains including humanoids, AGVs, cars, and drones.
Key Responsibilities
- Develop and train policies from human demonstrations and teleoperation data.
- Implement safe reinforcement learning approaches with constraints.
- Design long-horizon planners using world models and uncertainty-aware control.
- Implement safety shields, fallback controllers, and verify-before-deploy pipelines.
- Collaborate with cross-functional teams to integrate RL policies with control systems.
- Conduct sim-to-real transfer and ensure policies generalize in real-world settings.
- Design reward functions and implement offline RL and behavioral cloning strategies.
Must-Haves
- 4–8+ years of experience in RL and control systems.
- Strong expertise in Model Predictive Control (MPC), Control Barrier Functions (CBFs), reachability analysis, or similar methods.
- Master's or PhD in Robotics, Control, AI, or a related field.
- Experience with sim-to-real transfer, reward design, offline RL, and behavioral cloning.
Nice-to-Haves
- Experience with multi-agent reinforcement learning.
- Knowledge of hierarchical options and diffusion policies.
- Familiarity with long-horizon task planning in complex environments.
Success Metrics
- Task success rate in target domains.
- Rate of human or system interventions during execution.
- Compliance with energy, jerk, and other control limits.
- Minimization of constraint violations in real-world deployment.
Domain Notes
Humanoids:
- Stable locomotion and bimanual task RL.
AGVs (Autonomous Ground Vehicles):
- Navigation in mixed human zones, traffic rule compliance, and aisle etiquette.
Cars:
- Interactive merges, handling unprotected turns, and safe navigation in dynamic traffic.
Drones:
- Wind-robust flight, safe landing and perching maneuvers.
Application Instructions
Interested candidates may apply by sending their resume and cover letter to parijat.patel@merai.co with the subject line: "RL Research Engineer (Planning & Control) Application".
-
RL Research Engineer
21 hours ago
Vapi, Gujarat, India Meril Full timeJob Title: RL Research Engineer (Planning & Control) Location: Vapi, Gujarat Employment Type: Full-Time Overview We are seeking a highly skilled Reinforcement Learning (RL) Research Engineer specializing in planning and control. The role focuses on designing learning-based planners and policies (RL, imitation learning, model-based) and integrating...
-
VLM Research Engineer
21 hours ago
Vapi, Gujarat, India Meril Full timeJob Title: VLM Research Engineer Location: Vapi, Gujarat Employment Type: Full-Time Overview We are seeking a highly skilled VLM Research Engineer to build multimodal (vision-language-action) models for instruction following, scene grounding, and tool use across platforms. The role involves developing advanced models that bridge perception and...
-
Autonomous Systems Engineer
1 day ago
Vapi, Gujarat, India beBeeResearch Full time ₹ 9,00,000 - ₹ 12,00,000Job Title: Reinforcement Learning Engineer for Autonomous Systems Location: Vapi, Gujarat Employment Type: Full-TimeOverviewWe are seeking a highly skilled Reinforcement Learning (RL) Research Engineer specializing in planning and control. The role focuses on designing learning-based planners and policies (RL, imitation learning, model-based) and integrating...
-
VLM Research Engineer
1 day ago
Vapi, Gujarat, India Meril Full timeJob Title: VLM Research EngineerLocation: Vapi, GujaratEmployment Type: Full-TimeOverviewWe are seeking a highly skilled VLM Research Engineer to build multimodal (vision-language-action) models for instruction following, scene grounding, and tool use across platforms. The role involves developing advanced models that bridge perception and language...
-
AI Research Planner
4 days ago
Vapi, Gujarat, India beBeeExpertise Full time ₹ 2,50,00,000 - ₹ 3,00,00,000Job Title: Reinforcement Learning ExpertWe are seeking an exceptional Reinforcement Learning (RL) professional with expertise in planning and control. The role focuses on developing learning-based planners and policies (RL, imitation learning, model-based) and integrating them with classical control approaches to enable safe, efficient, and robust autonomous...
-
Research Developers
14 hours ago
Vapi, Gujarat, India beBeeMachineLearning Full time ₹ 7,50,000 - ₹ 15,00,000Job Title: Research Interns MS or PhDWe are seeking skilled professionals to contribute to cutting-edge research projects in vision-language models, reinforcement learning and planning, perception, SLAM, 3D vision, and simulation.Key Responsibilities:Own a focused research question and deliver results from baselines to state-of-the-art attempts, including...
-
Research Interns MS or PhD
1 day ago
Vapi, Gujarat, India Meril Full timeJob Title: Research Interns (MS / PhD)Location: Vapi, GujaratEmployment Type: Internship (Full-Time)OverviewWe are seeking talented **Research Interns (MS/PhD)** to contribute to scoped projects across Vision-Language Models (VLM), Reinforcement Learning and Planning, Perception, SLAM, 3D Vision, and Simulation. Interns will focus on achieving publishable...
-
Research Interns MS or PhD
21 hours ago
Vapi, Gujarat, India Meril Full timeJob Title: Research Interns (MS / PhD) Location: Vapi, Gujarat Employment Type: Internship (Full-Time) Overview We are seeking talented **Research Interns (MS/PhD)** to contribute to scoped projects across Vision-Language Models (VLM), Reinforcement Learning and Planning, Perception, SLAM, 3D Vision, and Simulation. Interns will focus on achieving...
-
Multimodal Model Developer
4 days ago
Vapi, Gujarat, India beBeeMachineLearning Full time ₹ 2,00,00,000 - ₹ 2,50,00,000Research EngineerWe are seeking a highly skilled Research Engineer to build multimodal models for instruction following, scene grounding, and tool use across platforms. The role involves developing advanced models that bridge perception and language understanding for autonomous systems.Requirements- 12+ years of experience in Computer Vision/Machine...
-
Multimodal AI Researcher
1 day ago
Vapi, Gujarat, India beBeeArtificialintelligence Full time ₹ 1,50,00,000 - ₹ 2,00,00,000We are seeking a skilled Multimodal AI Researcher to develop and implement multimodal models for instruction following, scene grounding, and tool use across various platforms.Key Responsibilities:Pretrain and fine-tune VLMs aligning them with robotics data including video, teleoperation, and language.Build perception-to-language grounding for referring...