AI Research Planner

4 days ago


Vapi, Gujarat, India beBeeExpertise Full time ₹ 2,50,00,000 - ₹ 3,00,00,000
Job Title: Reinforcement Learning Expert

We are seeking an exceptional Reinforcement Learning (RL) professional with expertise in planning and control. The role focuses on developing learning-based planners and policies (RL, imitation learning, model-based) and integrating them with classical control approaches to enable safe, efficient, and robust autonomous operation across multiple domains including humanoids, AGVs, cars, and drones.

Key Responsibilities:
  • Develop and train policies from human demonstrations and teleoperation data.
  • Implement safe reinforcement learning approaches with constraints.
  • Design long-horizon planners using world models and uncertainty-aware control.
  • Implement safety shields, fallback controllers, and verify-before-deploy pipelines.
  • Collaborate with cross-functional teams to integrate RL policies with control systems.
  • Conduct sim-to-real transfer and ensure policies generalize in real-world settings.
  • Design reward functions and implement offline RL and behavioral cloning strategies.
Required Skills and Qualifications:
  • 48+ years of experience in RL and control systems.
  • Strong expertise in Model Predictive Control (MPC), Control Barrier Functions (CBFs), reachability analysis, or similar methods.
  • Masters or PhD in Robotics, Control, AI, or a related field.
  • Experience with sim-to-real transfer, reward design, offline RL, and behavioral cloning.
Benefits:
  • Prominent roles in cutting-edge projects.
  • Unlimited opportunities for growth and development.
  • An engaging work environment that fosters collaboration and innovation.
Others:
  • Task success rate in target domains.
  • Rate of human or system interventions during execution.
  • Compliance with energy, jerk, and other control limits.
  • Minimization of constraint violations in real-world deployment.


  • Vapi, Gujarat, India Meril Full time

    Job Title: RL Research Engineer (Planning & Control) Location: Vapi, Gujarat Employment Type: Full-Time Overview We are seeking a highly skilled Reinforcement Learning (RL) Research Engineer specializing in planning and control. The role focuses on designing learning-based planners and policies (RL, imitation learning, model-based) and integrating...


  • Vapi, Gujarat, India Meril Full time

    Job Title: RL Research Engineer (Planning & Control)Location: Vapi, GujaratEmployment Type: Full-TimeOverviewWe are seeking a highly skilled Reinforcement Learning (RL) Research Engineer specializing in planning and control. The role focuses on designing learning-based planners and policies (RL, imitation learning, model-based) and integrating them with...


  • Vapi, Gujarat, India beBeeArtificialintelligence Full time ₹ 1,50,00,000 - ₹ 2,00,00,000

    We are seeking a skilled Multimodal AI Researcher to develop and implement multimodal models for instruction following, scene grounding, and tool use across various platforms.Key Responsibilities:Pretrain and fine-tune VLMs aligning them with robotics data including video, teleoperation, and language.Build perception-to-language grounding for referring...


  • Vapi, Gujarat, India beBeeReinforcement Full time ₹ 1,50,00,000 - ₹ 2,50,00,000

    Job Summary:We are seeking a highly skilled Reinforcement Learning (RL) Research Engineer specializing in planning and control to design learning-based planners and policies and integrate them with classical control approaches for safe, efficient, and robust autonomous operation.Main Responsibilities:Policy Development: Develop and train policies from human...


  • Vapi, Gujarat, India Meril Full time

    Job Title: Applied AI Engineer – Fresher (IIT/NIT Graduates Only)Job Type: Full-TimeExperience: 0–1 YearLocation: Vapi GujaratRole Overview:We are seeking high-potential graduates from IITs or NITs for the role of Applied AI Engineer (Fresher) to join our cutting-edge AI Engineering team. This opportunity is ideal for someone with a solid foundation in...


  • Vapi, Gujarat, India beBeeResearch Full time ₹ 9,00,000 - ₹ 12,00,000

    Job Title: Reinforcement Learning Engineer for Autonomous Systems Location: Vapi, Gujarat Employment Type: Full-TimeOverviewWe are seeking a highly skilled Reinforcement Learning (RL) Research Engineer specializing in planning and control. The role focuses on designing learning-based planners and policies (RL, imitation learning, model-based) and integrating...

  • RL Specialist

    1 day ago


    Vapi, Gujarat, India beBeeArtificial Full time ₹ 15,00,000 - ₹ 20,00,000

    Reinforcement Learning SpecialistWe are seeking a highly skilled Reinforcement Learning specialist with expertise in planning and control.Key Responsibilities:Develop and train policies from human demonstrations and teleoperation data.Implement safe reinforcement learning approaches with constraints.Design long-horizon planners using world models and...

  • Ui/ux Designer

    4 weeks ago


    Vapi, Gujarat, India Meril Full time

    Job Title: UI/UX Designer Location: Vapi, India Company: Meril (AI/ML Division)Experience: 5+ years About Meril Meril is an Artificial Intelligence and Machine Learning-driven company in the healthcare domain.We develop cutting-edge AI solutions to revolutionize healthcare, improving efficiency and patient outcomes.Job Overview We are seeking an experienced...

  • UI/UX Designer

    4 weeks ago


    Vapi, Gujarat, India Meril Full time

    Job Title: UI/UX DesignerLocation: Vapi, IndiaCompany: Meril (AI/ML Division)Experience: 5+ yearsAbout MerilMeril is an Artificial Intelligence and Machine Learning-driven company in the healthcare domain. We develop cutting-edge AI solutions to revolutionize healthcare, improving efficiency and patient outcomes.Job OverviewWe are seeking an experienced...


  • Vapi, Gujarat, India Meril Full time

    Job Description:• We are seeking a highly skilled and motivated Structural Biologist to join our drugdiscovery programs. The ideal candidate will bring deep expertise inmacromolecular structure determination and modeling, with a focus on elucidatingprotein structures, analyzing ligand binding, and driving rational drug design incollaboration with...