AI Research Engineer, RL

1 week ago


Delhi, India Pebble Full time

Role DescriptionThis is a full-time remote role for an AI Research Engineer specializing in Reinforcement Learning (RL). The AI Research Engineer will be responsible for developing and implementing state-of-the-art RL algorithms, collaborating on research projects, and integrating these algorithms into existing systems.QualificationsFull-stack engineering, from data engineering to model architecture, RL and deployment.Experience with performance engineering and identifying bottlenecks in RL training.Experience with tuning reward functions, hyper-parameters and exploration strategies to solve complex tasks with deep RL.8+ years of Python programming experience.Nice to haveAdvanced degree (MS or PhD) in Computer Science or related fieldExperience with k8s, docker, GPU Performance / systems engineering and model inference.



  • Delhi, India Pebble Full time

    Role DescriptionThis is a full-time remote role for an AI Research Engineer specializing in Reinforcement Learning (RL). The AI Research Engineer will be responsible for developing and implementing state-of-the-art RL algorithms, collaborating on research projects, and integrating these algorithms into existing systems.Qualifications- Full-stack engineering,...


  • Delhi, India AES Trading & Consultants Full time

    We are a proprietary trading firm specializing in HFT/MFT strategies, seeking a Quantitative Researcher skilled in Python, C++, AI (ML, & RL) to build and deploy alpha-generating models. This role offers significant autonomy, direct impact on PnL, and access to world-class data and infrastructure.ResponsibilitiesResearch and prototype strategies using ...


  • Delhi, India AES Trading & Consultants Full time

    We are a proprietary trading firm specializing in HFT/MFT strategies, seeking a Quantitative Researcher skilled in Python, C++, AI (ML, & RL) to build and deploy alpha-generating models. This role offers significant autonomy, direct impact on Pn L, and access to world-class data and infrastructure.Responsibilities- Research and prototype strategies using...


  • Delhi, India AES Trading & Consultants Full time

    We are a proprietary trading firm specializing in HFT/MFT strategies, seeking a Quantitative Researcher skilled in Python, C++, AI (ML, & RL) to build and deploy alpha-generating models. This role offers significant autonomy, direct impact on Pn L, and access to world-class data and infrastructure.Responsibilities- Research and prototype strategies using...


  • Delhi, India AES Trading & Consultants Full time

    We are a proprietary trading firm specializing in HFT/MFT strategies, seeking a Quantitative Researcher skilled in Python, C++, AI (ML, & RL) to build and deploy alpha-generating models. This role offers significant autonomy, direct impact on PnL, and access to world-class data and infrastructure.Responsibilities- Research and prototype strategies using...


  • Delhi, India AES Trading & Consultants Full time

    We are a proprietary trading firm specializing in HFT/MFT strategies, seeking aQuantitative Researcherskilled inPython, C++, AI (ML, & RL)to build and deploy alpha-generating models. This role offers significant autonomy, direct impact on PnL, and access to world-class data and infrastructure.ResponsibilitiesResearch and prototype strategies usingML/RL and...


  • Delhi, India Kalyani Group Full time

    Reinforcement Learning & Deep Learning for Robotic ArmsLocation: Bharat Forge, Mundhwa, PuneJob Type: Full-timeExperience Level: 3-6 YearsIndustry: AI-Driven Robotics, Neural Network-Based Manipulation, Autonomous Dexterity SystemsJob OverviewWe are looking for a highly technical Robotics Simulation Engineer specializing in reinforcement learning (RL) and...


  • New Delhi, India Kalyani Group Full time

    Reinforcement Learning & Deep Learning for Robotic ArmsLocation: Bharat Forge, Mundhwa, PuneJob Type: Full-timeExperience Level: 3-6 YearsIndustry: AI-Driven Robotics, Neural Network-Based Manipulation, Autonomous Dexterity SystemsJob OverviewWe are looking for a highly technical Robotics Simulation Engineer specializing in reinforcement learning (RL) and...


  • Delhi, India Jewelbyte Full time

    We're building the cursor for jewelry production.AtJewelbyte , we’re tackling a frontier problem: training AI systems to design and produce jewelry CAD files with the skill and creativity of expert human designers. This means combining AI, reinforcement learning, and 3D geometry to create production-ready jewelry designs at scale. It’s a bold challenge...


  • Delhi, India Jewelbyte Full time

    We're building the cursor for jewelry production.At Jewelbyte, we’re tackling a frontier problem: training AI systems to design and produce jewelry CAD files with the skill and creativity of expert human designers. This means combining AI, reinforcement learning, and 3D geometry to create production-ready jewelry designs at scale. It’s a bold challenge...