RL Research Engineer

2 days ago

Vapi, Gujarat, India Meril Full time

Job Title: RL Research Engineer (Planning & Control)

Location: Vapi, Gujarat

Employment Type: Full-Time

Overview

We are seeking a highly skilled Reinforcement Learning (RL) Research Engineer specializing in planning and control. The role focuses on designing learning-based planners and policies (RL, imitation learning, model-based) and integrating them with classical control approaches to enable safe, efficient, and robust autonomous operation across multiple domains including humanoids, AGVs, cars, and drones.

Key Responsibilities

Develop and train policies from human demonstrations and teleoperation data.
Implement safe reinforcement learning approaches with constraints.
Design long-horizon planners using world models and uncertainty-aware control.
Implement safety shields, fallback controllers, and verify-before-deploy pipelines.
Collaborate with cross-functional teams to integrate RL policies with control systems.
Conduct sim-to-real transfer and ensure policies generalize in real-world settings.
Design reward functions and implement offline RL and behavioral cloning strategies.

Must-Haves

4–8+ years of experience in RL and control systems.
Strong expertise in Model Predictive Control (MPC), Control Barrier Functions (CBFs), reachability analysis, or similar methods.
Master's or PhD in Robotics, Control, AI, or a related field.
Experience with sim-to-real transfer, reward design, offline RL, and behavioral cloning.

Nice-to-Haves

Experience with multi-agent reinforcement learning.
Knowledge of hierarchical options and diffusion policies.
Familiarity with long-horizon task planning in complex environments.

Success Metrics

Task success rate in target domains.
Rate of human or system interventions during execution.
Compliance with energy, jerk, and other control limits.
Minimization of constraint violations in real-world deployment.

Domain Notes

Humanoids:

- Stable locomotion and bimanual task RL.

AGVs (Autonomous Ground Vehicles):

- Navigation in mixed human zones, traffic rule compliance, and aisle etiquette.

Cars:

- Interactive merges, handling unprotected turns, and safe navigation in dynamic traffic.

Drones:

- Wind-robust flight, safe landing and perching maneuvers.

Application Instructions

Interested candidates may apply by sending their resume and cover letter to with the subject line: "RL Research Engineer (Planning & Control) Application" .

RL Research Engineer

5 days ago

Vapi, Gujarat, India Meril Full time

Job Title: RL Research Engineer (Planning & Control) Location: Vapi, Gujarat Employment Type: Full-Time Overview We are seeking a highly skilled Reinforcement Learning (RL) Research Engineer specializing in planning and control. The role focuses on designing learning-based planners and policies (RL, imitation learning, model-based) and integrating...
RL Research Engineer

5 days ago

Vapi, Gujarat, India Meril Full time

Job Title: RL Research Engineer (Planning & Control)Location: Vapi, GujaratEmployment Type: Full-TimeOverviewWe are seeking a highly skilled Reinforcement Learning (RL) Research Engineer specializing in planning and control. The role focuses on designing learning-based planners and policies (RL, imitation learning, model-based) and integrating them with...
Advanced RL Control Systems Specialist

2 days ago

Vapi, Gujarat, India beBeeReinforcementLearningEngineer Full time ₹ 2,00,00,000 - ₹ 2,50,00,000

Key Position in Reinforcement Learning EngineeringThis is a challenging role for an expert in planning and control, focusing on the application of reinforcement learning (RL) to real-world systems.
VLM Research Engineer

5 days ago

Vapi, Gujarat, India Meril Full time

Job Title: VLM Research Engineer Location: Vapi, Gujarat Employment Type: Full-Time Overview We are seeking a highly skilled VLM Research Engineer to build multimodal (vision-language-action) models for instruction following, scene grounding, and tool use across platforms. The role involves developing advanced models that bridge perception and...
VLM Research Engineer

5 days ago

Vapi, Gujarat, India Meril Full time

Job Title: VLM Research EngineerLocation: Vapi, GujaratEmployment Type: Full-TimeOverviewWe are seeking a highly skilled VLM Research Engineer to build multimodal (vision-language-action) models for instruction following, scene grounding, and tool use across platforms. The role involves developing advanced models that bridge perception and language...
VLM Research Engineer

3 days ago

Vapi, Gujarat, India Meril Full time

Job DescriptionJob Title: VLM Research EngineerLocation: Vapi, GujaratEmployment Type: Full-TimeOverviewWe are seeking a highly skilled VLM Research Engineer to build multimodal (vision-language-action) models for instruction following, scene grounding, and tool use across platforms. The role involves developing advanced models that bridge perception and...
Autonomous Systems Engineer

5 days ago

Vapi, Gujarat, India beBeeResearch Full time ₹ 9,00,000 - ₹ 12,00,000

Job Title: Reinforcement Learning Engineer for Autonomous Systems Location: Vapi, Gujarat Employment Type: Full-TimeOverviewWe are seeking a highly skilled Reinforcement Learning (RL) Research Engineer specializing in planning and control. The role focuses on designing learning-based planners and policies (RL, imitation learning, model-based) and integrating...
VLM Research Engineer

2 days ago

Vapi, Gujarat, India Meril Full time

Job Title: VLM Research Engineer Location: Vapi, Gujarat Employment Type: Full-Time Overview We are seeking a highly skilled VLM Research Engineer to build multimodal (vision-language-action) models for instruction following, scene grounding, and tool use across platforms. The role involves developing advanced models that bridge perception and...
AI Research Planner

1 week ago

Vapi, Gujarat, India beBeeExpertise Full time ₹ 2,50,00,000 - ₹ 3,00,00,000

Job Title: Reinforcement Learning ExpertWe are seeking an exceptional Reinforcement Learning (RL) professional with expertise in planning and control. The role focuses on developing learning-based planners and policies (RL, imitation learning, model-based) and integrating them with classical control approaches to enable safe, efficient, and robust autonomous...
Research Developers

4 days ago

Vapi, Gujarat, India beBeeMachineLearning Full time ₹ 7,50,000 - ₹ 15,00,000

Job Title: Research Interns MS or PhDWe are seeking skilled professionals to contribute to cutting-edge research projects in vision-language models, reinforcement learning and planning, perception, SLAM, 3D vision, and simulation.Key Responsibilities:Own a focused research question and deliver results from baselines to state-of-the-art attempts, including...

Americas

Europe

Asia / Oceania

Africa

RL Research Engineer