RL Research Engineer

4 days ago


New Delhi, India Meril Full time

Job Title:RL Research Engineer (Planning & Control) Location:Vapi, Gujarat Employment Type:Full-TimeOverview We are seeking a highly skilledReinforcement Learning (RL) Research Engineerspecializing in planning and control. The role focuses on designing learning-based planners and policies (RL, imitation learning, model-based) and integrating them with classical control approaches to enable safe, efficient, and robust autonomous operation across multiple domains including humanoids, AGVs, cars, and drones.Key Responsibilities Develop and train policies from human demonstrations and teleoperation data. Implement safe reinforcement learning approaches with constraints. Design long-horizon planners using world models and uncertainty-aware control. Implement safety shields, fallback controllers, and verify-before-deploy pipelines. Collaborate with cross-functional teams to integrate RL policies with control systems. Conduct sim-to-real transfer and ensure policies generalize in real-world settings. Design reward functions and implement offline RL and behavioral cloning strategies.Must-Haves 4–8+ years of experience in RL and control systems. Strong expertise in Model Predictive Control (MPC), Control Barrier Functions (CBFs), reachability analysis, or similar methods. Master’s or PhD in Robotics, Control, AI, or a related field. Experience with sim-to-real transfer, reward design, offline RL, and behavioral cloning.Nice-to-Haves Experience with multi-agent reinforcement learning. Knowledge of hierarchical options and diffusion policies. Familiarity with long-horizon task planning in complex environments.Success Metrics Task success rate in target domains. Rate of human or system interventions during execution. Compliance with energy, jerk, and other control limits. Minimization of constraint violations in real-world deployment.Domain Notes Humanoids: - Stable locomotion and bimanual task RL. AGVs (Autonomous Ground Vehicles): - Navigation in mixed human zones, traffic rule compliance, and aisle etiquette. Cars: - Interactive merges, handling unprotected turns, and safe navigation in dynamic traffic. Drones: - Wind-robust flight, safe landing and perching maneuvers.Application Instructions Interested candidates may apply by sending their resume and cover letter toparijat.patel@merai.cowith the subject line:“RL Research Engineer (Planning & Control) Application” .



  • Delhi, India Pebble Full time

    Role DescriptionThis is a full-time remote role for an AI Research Engineer specializing in Reinforcement Learning (RL). The AI Research Engineer will be responsible for developing and implementing state-of-the-art RL algorithms, collaborating on research projects, and integrating these algorithms into existing systems.QualificationsFull-stack engineering,...


  • Delhi, India Pebble Full time

    Role DescriptionThis is a full-time remote role for an AI Research Engineer specializing in Reinforcement Learning (RL). The AI Research Engineer will be responsible for developing and implementing state-of-the-art RL algorithms, collaborating on research projects, and integrating these algorithms into existing systems.Qualifications- Full-stack engineering,...


  • New Delhi, India AES Trading & Consultants Full time

    We are a proprietary trading firm specializing in HFT/MFT strategies, seeking a Quantitative Researcher skilled in Python, C++, AI (ML, & RL) to build and deploy alpha-generating models. This role offers significant autonomy, direct impact on PnL, and access to world-class data and infrastructure.Responsibilities- Research and prototype strategies using...


  • New Delhi, India Kalyani Group Full time

    Reinforcement Learning & Deep Learning for Robotic Arms Location: Bharat Forge, Mundhwa, Pune Job Type: Full-time Experience Level: 3-6 Years Industry: AI-Driven Robotics, Neural Network-Based Manipulation, Autonomous Dexterity Systems Job Overview We are looking for a highly technical Robotics Simulation Engineer specializing in reinforcement learning (RL)...


  • Delhi, India AES Trading & Consultants Full time

    We are a proprietary trading firm specializing in HFT/MFT strategies, seeking a Quantitative Researcher skilled in Python, C++, AI (ML, & RL) to build and deploy alpha-generating models. This role offers significant autonomy, direct impact on Pn L, and access to world-class data and infrastructure.Responsibilities- Research and prototype strategies using...


  • Delhi, India AES Trading & Consultants Full time

    We are a proprietary trading firm specializing in HFT/MFT strategies, seeking a Quantitative Researcher skilled in Python, C++, AI (ML, & RL) to build and deploy alpha-generating models. This role offers significant autonomy, direct impact on Pn L, and access to world-class data and infrastructure.Responsibilities- Research and prototype strategies using...


  • Delhi, India AES Trading & Consultants Full time

    We are a proprietary trading firm specializing in HFT/MFT strategies, seeking a Quantitative Researcher skilled in Python, C++, AI (ML, & RL) to build and deploy alpha-generating models. This role offers significant autonomy, direct impact on PnL, and access to world-class data and infrastructure.Responsibilities- Research and prototype strategies using...


  • Delhi, India AES Trading & Consultants Full time

    We are a proprietary trading firm specializing in HFT/MFT strategies, seeking a Quantitative Researcher skilled in Python, C++, AI (ML, & RL) to build and deploy alpha-generating models. This role offers significant autonomy, direct impact on PnL, and access to world-class data and infrastructure.Responsibilities- Research and prototype strategies using...


  • Delhi, India AES Trading & Consultants Full time

    We are a proprietary trading firm specializing in HFT/MFT strategies, seeking aQuantitative Researcherskilled inPython, C++, AI (ML, & RL)to build and deploy alpha-generating models. This role offers significant autonomy, direct impact on PnL, and access to world-class data and infrastructure.ResponsibilitiesResearch and prototype strategies usingML/RL and...


  • New Delhi, India Kalyani Group Full time

    Reinforcement Learning & Deep Learning for Robotic ArmsLocation: Bharat Forge, Mundhwa, PuneJob Type: Full-timeExperience Level: 3-6 YearsIndustry: AI-Driven Robotics, Neural Network-Based Manipulation, Autonomous Dexterity SystemsJob OverviewWe are looking for a highly technical Robotics Simulation Engineer specializing in reinforcement learning (RL) and...