
RL Research Engineer
2 days ago
Job Title: RL Research Engineer (Planning & Control)
Location: Vapi, Gujarat
Employment Type: Full-Time
Overview
We are seeking a highly skilled Reinforcement Learning (RL) Research Engineer specializing in planning and control. The role focuses on designing learning-based planners and policies (RL, imitation learning, model-based) and integrating them with classical control approaches to enable safe, efficient, and robust autonomous operation across multiple domains including humanoids, AGVs, cars, and drones.
Key Responsibilities
- Develop and train policies from human demonstrations and teleoperation data.
- Implement safe reinforcement learning approaches with constraints.
- Design long-horizon planners using world models and uncertainty-aware control.
- Implement safety shields, fallback controllers, and verify-before-deploy pipelines.
- Collaborate with cross-functional teams to integrate RL policies with control systems.
- Conduct sim-to-real transfer and ensure policies generalize in real-world settings.
- Design reward functions and implement offline RL and behavioral cloning strategies.
Must-Haves
- 4–8+ years of experience in RL and control systems.
- Strong expertise in Model Predictive Control (MPC), Control Barrier Functions (CBFs), reachability analysis, or similar methods.
- Master's or PhD in Robotics, Control, AI, or a related field.
- Experience with sim-to-real transfer, reward design, offline RL, and behavioral cloning.
Nice-to-Haves
- Experience with multi-agent reinforcement learning.
- Knowledge of hierarchical options and diffusion policies.
- Familiarity with long-horizon task planning in complex environments.
Success Metrics
- Task success rate in target domains.
- Rate of human or system interventions during execution.
- Compliance with energy, jerk, and other control limits.
- Minimization of constraint violations in real-world deployment.
Domain Notes
Humanoids:
- Stable locomotion and bimanual task RL.
AGVs (Autonomous Ground Vehicles):
- Navigation in mixed human zones, traffic rule compliance, and aisle etiquette.
Cars:
- Interactive merges, handling unprotected turns, and safe navigation in dynamic traffic.
Drones:
- Wind-robust flight, safe landing and perching maneuvers.
Application Instructions
Interested candidates may apply by sending their resume and cover letter to with the subject line: "RL Research Engineer (Planning & Control) Application" .
-
RL Research Engineer
5 days ago
Vapi, Gujarat, India Meril Full timeJob Title: RL Research Engineer (Planning & Control) Location: Vapi, Gujarat Employment Type: Full-Time Overview We are seeking a highly skilled Reinforcement Learning (RL) Research Engineer specializing in planning and control. The role focuses on designing learning-based planners and policies (RL, imitation learning, model-based) and integrating...
-
RL Research Engineer
5 days ago
Vapi, Gujarat, India Meril Full timeJob Title: RL Research Engineer (Planning & Control)Location: Vapi, GujaratEmployment Type: Full-TimeOverviewWe are seeking a highly skilled Reinforcement Learning (RL) Research Engineer specializing in planning and control. The role focuses on designing learning-based planners and policies (RL, imitation learning, model-based) and integrating them with...
-
Advanced RL Control Systems Specialist
2 days ago
Vapi, Gujarat, India beBeeReinforcementLearningEngineer Full time ₹ 2,00,00,000 - ₹ 2,50,00,000Key Position in Reinforcement Learning EngineeringThis is a challenging role for an expert in planning and control, focusing on the application of reinforcement learning (RL) to real-world systems.
-
VLM Research Engineer
5 days ago
Vapi, Gujarat, India Meril Full timeJob Title: VLM Research Engineer Location: Vapi, Gujarat Employment Type: Full-Time Overview We are seeking a highly skilled VLM Research Engineer to build multimodal (vision-language-action) models for instruction following, scene grounding, and tool use across platforms. The role involves developing advanced models that bridge perception and...
-
VLM Research Engineer
5 days ago
Vapi, Gujarat, India Meril Full timeJob Title: VLM Research EngineerLocation: Vapi, GujaratEmployment Type: Full-TimeOverviewWe are seeking a highly skilled VLM Research Engineer to build multimodal (vision-language-action) models for instruction following, scene grounding, and tool use across platforms. The role involves developing advanced models that bridge perception and language...
-
VLM Research Engineer
3 days ago
Vapi, Gujarat, India Meril Full timeJob DescriptionJob Title: VLM Research EngineerLocation: Vapi, GujaratEmployment Type: Full-TimeOverviewWe are seeking a highly skilled VLM Research Engineer to build multimodal (vision-language-action) models for instruction following, scene grounding, and tool use across platforms. The role involves developing advanced models that bridge perception and...
-
Autonomous Systems Engineer
5 days ago
Vapi, Gujarat, India beBeeResearch Full time ₹ 9,00,000 - ₹ 12,00,000Job Title: Reinforcement Learning Engineer for Autonomous Systems Location: Vapi, Gujarat Employment Type: Full-TimeOverviewWe are seeking a highly skilled Reinforcement Learning (RL) Research Engineer specializing in planning and control. The role focuses on designing learning-based planners and policies (RL, imitation learning, model-based) and integrating...
-
VLM Research Engineer
2 days ago
Vapi, Gujarat, India Meril Full timeJob Title: VLM Research Engineer Location: Vapi, Gujarat Employment Type: Full-Time Overview We are seeking a highly skilled VLM Research Engineer to build multimodal (vision-language-action) models for instruction following, scene grounding, and tool use across platforms. The role involves developing advanced models that bridge perception and...
-
AI Research Planner
1 week ago
Vapi, Gujarat, India beBeeExpertise Full time ₹ 2,50,00,000 - ₹ 3,00,00,000Job Title: Reinforcement Learning ExpertWe are seeking an exceptional Reinforcement Learning (RL) professional with expertise in planning and control. The role focuses on developing learning-based planners and policies (RL, imitation learning, model-based) and integrating them with classical control approaches to enable safe, efficient, and robust autonomous...
-
Research Developers
4 days ago
Vapi, Gujarat, India beBeeMachineLearning Full time ₹ 7,50,000 - ₹ 15,00,000Job Title: Research Interns MS or PhDWe are seeking skilled professionals to contribute to cutting-edge research projects in vision-language models, reinforcement learning and planning, perception, SLAM, 3D vision, and simulation.Key Responsibilities:Own a focused research question and deliver results from baselines to state-of-the-art attempts, including...