LLM Model training for Coding

2 weeks ago


Delhi, India DeepLLMData Full time

Company Description DeepLLMData specializes in providing high-quality human data to power advanced machine learning models, including Coding SFT and RLHF. The company brings together experts in STEM fields, Mathematics, and domain-specific areas to develop and enhance AI models. DeepLLMData also offers image and video annotation services as well as data provisioning solutions to support cutting-edge AI and machine learning advancements. Our mission is to support innovation by enabling AI models to achieve superior performance across industries. Role Description This is a contract-based remote role for a professional with expertise in LLM model training for coding, specifically focused on Full Stack development using JavaScript / TypeScript with 3+ years of experience. The role involves preparing, refining, and delivering code data to train large language models (LLMs). Daily responsibilities include developing high-quality coding examples, collaborating with domain experts, validating coding outputs, and ensuring alignment with the evolving needs of the models. . Qualifications Strong knowledge and experience in Full Stack development, including JavaScript/TypeScript, with a minimum of 3 years of professional experience. Proficiency in back-end and front-end development with frameworks like Node.js, React, or Angular is preferred. Proficiency in creating, annotating, and testing code-specific datasets for training machine learning models with LLMs. Experience with REST APIs, database integration, debugging, and version control tools (e.g. Git). Excellent problem-solving abilities and communication skills conducive to remote collaboration. Prior experience with model training, coding supervision, supervised fine-tuning, or reinforcement learning is beneficial. A degree in Computer Science, Software Engineering, or a related field is preferred.



  • Delhi, India DeepLLMData Full time

    Company DescriptionDeepLLMData specializes in advancing frontier models by leveraging expert human data. The organization focuses on coding techniques like Supervised Fine-Tuning (SFT) and Reinforcement Learning with Human Feedback (RLHF). In addition, DeepLLMData collaborates with specialists in STEM fields, applied mathematics, and domain-specific areas....


  • Delhi, India DeepLLMData Full time

    Company Description DeepLLMData specializes in providing expert human-generated data to power advanced frontier models. Our services include coding-specific supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF). We are committed to fostering innovation in various domains through our teams of experts in STEM, Mathematics, and...


  • Delhi, India DeepLLMData Full time

    Company DescriptionDeepLLMData specializes in empowering advanced AI models with expertly curated human data. Our services include training data for Coding SFT (Supervised Fine-Tuning), RLHF (Reinforcement Learning with Human Feedback), and various STEM-related domains. With a focus on Python programming, we draw on skilled professionals, including PhD-level...


  • New Delhi, India DeepLLMData Full time

    Company Description DeepLLMData specializes in providing expert human-generated data to power advanced frontier models. Our services include coding-specific supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF). We are committed to fostering innovation in various domains through our teams of experts in STEM, Mathematics, and...


  • New Delhi, India DeepLLMData Full time

    Company Description DeepLLMData specializes in providing expert human-generated data to power advanced frontier models. Our services include coding-specific supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF). We are committed to fostering innovation in various domains through our teams of experts in STEM, Mathematics, and...


  • New Delhi, India DeepLLMData Full time

    Company DescriptionDeepLLMData specializes in providing expert human-generated data to power advanced frontier models. Our services include coding-specific supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF). We are committed to fostering innovation in various domains through our teams of experts in STEM, Mathematics, and...


  • New Delhi, India DeepLLMData Full time

    Company DescriptionDeepLLMData specializes in providing expert human-generated data to power advanced frontier models. Our services include coding-specific supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF). We are committed to fostering innovation in various domains through our teams of experts in STEM, Mathematics, and...


  • Delhi, India DeepLLMData Full time

    Company DescriptionDeepLLMData specializes in driving advancements in AI by providing expertly curated human data to power frontier models. With expertise in Coding SFT (Supervised Fine-Tuning) and Reinforcement Learning with Human Feedback (RLHF), the company collaborates with domain-specific experts across STEM, Mathematics, PhD fields, and more....


  • Delhi, India DeepLLMData Full time

    Company Description DeepLLMData specializes in driving advancements in AI by providing expertly curated human data to power frontier models. With expertise in Coding SFT (Supervised Fine-Tuning) and Reinforcement Learning with Human Feedback (RLHF), the company collaborates with domain-specific experts across STEM, Mathematics, PhD fields, and more....


  • Delhi, India DeepLLMData Full time

    Company DescriptionDeep LLMData specializes in providing expert human-generated data to power advanced frontier models. Our services include coding-specific supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF). We are committed to fostering innovation in various domains through our teams of experts in STEM, Mathematics, and...