LLM Model training for Coding

2 weeks ago


Delhi, India DeepLLMData Full time

Company DescriptionDeepLLMData specializes in advancing frontier models by leveraging expert human data. The organization focuses on coding techniques like Supervised Fine-Tuning (SFT) and Reinforcement Learning with Human Feedback (RLHF). In addition, DeepLLMData collaborates with specialists in STEM fields, applied mathematics, and domain-specific areas. The company also provides high-quality image and video annotation services, along with data provisioning. DeepLLMData supports innovation with a mission to enhance model training and performance through expert collaboration.Role DescriptionThis contract-based remote role focuses on training large language models (LLMs) for the Java programming language. The candidate will curate datasets, design tasks for model fine-tuning, and execute training processes for LLMs targeting Java programming.Qualifications- Proficiency and hands-on experience in Java programming with a minimum of 3 years experience- Experience in LLM training methods such as supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF)- Knowledge and expertise in creating annotated coding-specific datasets and task definitions for LLMs- Strong understanding of software development principles, debugging, and code optimization- Excellent communication skills to collaborate remotely with diverse technical teams effectively- B. Tech Computer Engg or ECE.



  • Delhi, India DeepLLMData Full time

    Company Description DeepLLMData specializes in providing expert human-generated data to power advanced frontier models. Our services include coding-specific supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF). We are committed to fostering innovation in various domains through our teams of experts in STEM, Mathematics, and...


  • Delhi, India DeepLLMData Full time

    Company DescriptionDeepLLMData specializes in empowering advanced AI models with expertly curated human data. Our services include training data for Coding SFT (Supervised Fine-Tuning), RLHF (Reinforcement Learning with Human Feedback), and various STEM-related domains. With a focus on Python programming, we draw on skilled professionals, including PhD-level...


  • New Delhi, India DeepLLMData Full time

    Company Description DeepLLMData specializes in providing expert human-generated data to power advanced frontier models. Our services include coding-specific supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF). We are committed to fostering innovation in various domains through our teams of experts in STEM, Mathematics, and...


  • Delhi, India DeepLLMData Full time

    Company Description DeepLLMData specializes in providing high-quality human data to power advanced machine learning models, including Coding SFT and RLHF. The company brings together experts in STEM fields, Mathematics, and domain-specific areas to develop and enhance AI models. DeepLLMData also offers image and video annotation services as well as data...


  • New Delhi, India DeepLLMData Full time

    Company Description DeepLLMData specializes in providing expert human-generated data to power advanced frontier models. Our services include coding-specific supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF). We are committed to fostering innovation in various domains through our teams of experts in STEM, Mathematics, and...


  • New Delhi, India DeepLLMData Full time

    Company DescriptionDeepLLMData specializes in providing expert human-generated data to power advanced frontier models. Our services include coding-specific supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF). We are committed to fostering innovation in various domains through our teams of experts in STEM, Mathematics, and...


  • New Delhi, India DeepLLMData Full time

    Company DescriptionDeepLLMData specializes in providing expert human-generated data to power advanced frontier models. Our services include coding-specific supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF). We are committed to fostering innovation in various domains through our teams of experts in STEM, Mathematics, and...


  • Delhi, India DeepLLMData Full time

    Company DescriptionDeepLLMData specializes in driving advancements in AI by providing expertly curated human data to power frontier models. With expertise in Coding SFT (Supervised Fine-Tuning) and Reinforcement Learning with Human Feedback (RLHF), the company collaborates with domain-specific experts across STEM, Mathematics, PhD fields, and more....


  • Delhi, India DeepLLMData Full time

    Company Description DeepLLMData specializes in driving advancements in AI by providing expertly curated human data to power frontier models. With expertise in Coding SFT (Supervised Fine-Tuning) and Reinforcement Learning with Human Feedback (RLHF), the company collaborates with domain-specific experts across STEM, Mathematics, PhD fields, and more....


  • Delhi, India DeepLLMData Full time

    Company DescriptionDeep LLMData specializes in providing expert human-generated data to power advanced frontier models. Our services include coding-specific supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF). We are committed to fostering innovation in various domains through our teams of experts in STEM, Mathematics, and...