RL Research Engineer

1 day ago


Vapi, Gujarat, India Meril Full time
Job Title: RL Research Engineer (Planning & Control)

Location: Vapi, Gujarat

Employment Type: Full-Time

Overview

We are seeking a highly skilled Reinforcement Learning (RL) Research Engineer specializing in planning and control. The role focuses on designing learning-based planners and policies (RL, imitation learning, model-based) and integrating them with classical control approaches to enable safe, efficient, and robust autonomous operation across multiple domains including humanoids, AGVs, cars, and drones.

Key Responsibilities

- Develop and train policies from human demonstrations and teleoperation data.
- Implement safe reinforcement learning approaches with constraints.
- Design long-horizon planners using world models and uncertainty-aware control.
- Implement safety shields, fallback controllers, and verify-before-deploy pipelines.
- Collaborate with cross-functional teams to integrate RL policies with control systems.
- Conduct sim-to-real transfer and ensure policies generalize in real-world settings.
- Design reward functions and implement offline RL and behavioral cloning strategies.

Must-Haves

- 4–8+ years of experience in RL and control systems.
- Strong expertise in Model Predictive Control (MPC), Control Barrier Functions (CBFs), reachability analysis, or similar methods.
- Master's or PhD in Robotics, Control, AI, or a related field.
- Experience with sim-to-real transfer, reward design, offline RL, and behavioral cloning.

Nice-to-Haves

- Experience with multi-agent reinforcement learning.
- Knowledge of hierarchical options and diffusion policies.
- Familiarity with long-horizon task planning in complex environments.

Success Metrics

- Task success rate in target domains.
- Rate of human or system interventions during execution.
- Compliance with energy, jerk, and other control limits.
- Minimization of constraint violations in real-world deployment.

Domain Notes

Humanoids:

- Stable locomotion and bimanual task RL.

AGVs (Autonomous Ground Vehicles):

- Navigation in mixed human zones, traffic rule compliance, and aisle etiquette.

Cars:

- Interactive merges, handling unprotected turns, and safe navigation in dynamic traffic.

Drones:

- Wind-robust flight, safe landing and perching maneuvers.

Application Instructions

Interested candidates may apply by sending their resume and cover letter to parijat.patel@merai.co with the subject line: "RL Research Engineer (Planning & Control) Application".
  • RL Research Engineer

    21 hours ago


    Vapi, Gujarat, India Meril Full time

    Job Title: RL Research Engineer (Planning & Control) Location: Vapi, Gujarat Employment Type: Full-Time Overview We are seeking a highly skilled Reinforcement Learning (RL) Research Engineer specializing in planning and control. The role focuses on designing learning-based planners and policies (RL, imitation learning, model-based) and integrating...

  • VLM Research Engineer

    21 hours ago


    Vapi, Gujarat, India Meril Full time

    Job Title: VLM Research Engineer Location: Vapi, Gujarat Employment Type: Full-Time Overview We are seeking a highly skilled VLM Research Engineer to build multimodal (vision-language-action) models for instruction following, scene grounding, and tool use across platforms. The role involves developing advanced models that bridge perception and...


  • Vapi, Gujarat, India beBeeResearch Full time ₹ 9,00,000 - ₹ 12,00,000

    Job Title: Reinforcement Learning Engineer for Autonomous Systems Location: Vapi, Gujarat Employment Type: Full-TimeOverviewWe are seeking a highly skilled Reinforcement Learning (RL) Research Engineer specializing in planning and control. The role focuses on designing learning-based planners and policies (RL, imitation learning, model-based) and integrating...


  • Vapi, Gujarat, India Meril Full time

    Job Title: VLM Research EngineerLocation: Vapi, GujaratEmployment Type: Full-TimeOverviewWe are seeking a highly skilled VLM Research Engineer to build multimodal (vision-language-action) models for instruction following, scene grounding, and tool use across platforms. The role involves developing advanced models that bridge perception and language...

  • AI Research Planner

    4 days ago


    Vapi, Gujarat, India beBeeExpertise Full time ₹ 2,50,00,000 - ₹ 3,00,00,000

    Job Title: Reinforcement Learning ExpertWe are seeking an exceptional Reinforcement Learning (RL) professional with expertise in planning and control. The role focuses on developing learning-based planners and policies (RL, imitation learning, model-based) and integrating them with classical control approaches to enable safe, efficient, and robust autonomous...

  • Research Developers

    14 hours ago


    Vapi, Gujarat, India beBeeMachineLearning Full time ₹ 7,50,000 - ₹ 15,00,000

    Job Title: Research Interns MS or PhDWe are seeking skilled professionals to contribute to cutting-edge research projects in vision-language models, reinforcement learning and planning, perception, SLAM, 3D vision, and simulation.Key Responsibilities:Own a focused research question and deliver results from baselines to state-of-the-art attempts, including...


  • Vapi, Gujarat, India Meril Full time

    Job Title: Research Interns (MS / PhD)Location: Vapi, GujaratEmployment Type: Internship (Full-Time)OverviewWe are seeking talented **Research Interns (MS/PhD)** to contribute to scoped projects across Vision-Language Models (VLM), Reinforcement Learning and Planning, Perception, SLAM, 3D Vision, and Simulation. Interns will focus on achieving publishable...


  • Vapi, Gujarat, India Meril Full time

    Job Title: Research Interns (MS / PhD) Location: Vapi, Gujarat Employment Type: Internship (Full-Time) Overview We are seeking talented **Research Interns (MS/PhD)** to contribute to scoped projects across Vision-Language Models (VLM), Reinforcement Learning and Planning, Perception, SLAM, 3D Vision, and Simulation. Interns will focus on achieving...


  • Vapi, Gujarat, India beBeeMachineLearning Full time ₹ 2,00,00,000 - ₹ 2,50,00,000

    Research EngineerWe are seeking a highly skilled Research Engineer to build multimodal models for instruction following, scene grounding, and tool use across platforms. The role involves developing advanced models that bridge perception and language understanding for autonomous systems.Requirements- 12+ years of experience in Computer Vision/Machine...


  • Vapi, Gujarat, India beBeeArtificialintelligence Full time ₹ 1,50,00,000 - ₹ 2,00,00,000

    We are seeking a skilled Multimodal AI Researcher to develop and implement multimodal models for instruction following, scene grounding, and tool use across various platforms.Key Responsibilities:Pretrain and fine-tune VLMs aligning them with robotics data including video, teleoperation, and language.Build perception-to-language grounding for referring...