LLM Model training for Coding

2 weeks ago


Delhi, India DeepLLMData Full time

Company Description DeepLLMData specializes in providing expert human data to power frontier models, including Coding SFT and RLHF models. Our team includes STEM professionals, PhD holders, mathematicians, and experts in domain-specific fields. We also offer image and video annotation, along with data provisioning for cutting-edge AI and machine learning projects. Our focus is on delivering high-quality, customized data solutions that meet the needs of the most advanced models. Role Description This is a remote, contract role for an experienced professional in training large language models (LLMs) for coding in Multiple languages: A. Bash/Shell, B. Rust, C. SQL. The primary responsibilities include creating and executing strategies to train, fine-tune, and evaluate advanced coding models. The role involves preprocessing data, developing training pipelines, debugging issues, and collaborating with a multidisciplinary team of AI researchers and developers to refine model performance. Qualifications Proficiency in Bash/Shell scripting/Rust/SQL programming languages Experience in building, fine-tuning, and training large language models for specific use cases Strong debugging, troubleshooting, and problem-solving skills in AI/ML model development At least 3 years of professional experience with any of the above languages relevant to LLM training, coding, or software development Relevant degree in Computer Science, Machine Learning, or a similar field; advanced degrees (MS/PhD) preferred Experience with Reinforcement Learning from Human Feedback (RLHF) is considered a plus Ability to work independently in a fast-paced remote work environment



  • Delhi, India DeepLLMData Full time

    Company DescriptionDeepLLMData specializes in advancing frontier models by leveraging expert human data. The organization focuses on coding techniques like Supervised Fine-Tuning (SFT) and Reinforcement Learning with Human Feedback (RLHF). In addition, DeepLLMData collaborates with specialists in STEM fields, applied mathematics, and domain-specific areas....


  • Delhi, India DeepLLMData Full time

    Company Description DeepLLMData specializes in providing expert human-generated data to power advanced frontier models. Our services include coding-specific supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF). We are committed to fostering innovation in various domains through our teams of experts in STEM, Mathematics, and...


  • Delhi, India DeepLLMData Full time

    Company DescriptionDeepLLMData specializes in empowering advanced AI models with expertly curated human data. Our services include training data for Coding SFT (Supervised Fine-Tuning), RLHF (Reinforcement Learning with Human Feedback), and various STEM-related domains. With a focus on Python programming, we draw on skilled professionals, including PhD-level...


  • New Delhi, India DeepLLMData Full time

    Company Description DeepLLMData specializes in providing expert human-generated data to power advanced frontier models. Our services include coding-specific supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF). We are committed to fostering innovation in various domains through our teams of experts in STEM, Mathematics, and...


  • Delhi, India DeepLLMData Full time

    Company Description DeepLLMData specializes in providing high-quality human data to power advanced machine learning models, including Coding SFT and RLHF. The company brings together experts in STEM fields, Mathematics, and domain-specific areas to develop and enhance AI models. DeepLLMData also offers image and video annotation services as well as data...


  • New Delhi, India DeepLLMData Full time

    Company Description DeepLLMData specializes in providing expert human-generated data to power advanced frontier models. Our services include coding-specific supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF). We are committed to fostering innovation in various domains through our teams of experts in STEM, Mathematics, and...


  • New Delhi, India DeepLLMData Full time

    Company DescriptionDeepLLMData specializes in providing expert human-generated data to power advanced frontier models. Our services include coding-specific supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF). We are committed to fostering innovation in various domains through our teams of experts in STEM, Mathematics, and...


  • New Delhi, India DeepLLMData Full time

    Company DescriptionDeepLLMData specializes in providing expert human-generated data to power advanced frontier models. Our services include coding-specific supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF). We are committed to fostering innovation in various domains through our teams of experts in STEM, Mathematics, and...


  • Delhi, India DeepLLMData Full time

    Company DescriptionDeepLLMData specializes in driving advancements in AI by providing expertly curated human data to power frontier models. With expertise in Coding SFT (Supervised Fine-Tuning) and Reinforcement Learning with Human Feedback (RLHF), the company collaborates with domain-specific experts across STEM, Mathematics, PhD fields, and more....


  • Delhi, India DeepLLMData Full time

    Company Description DeepLLMData specializes in driving advancements in AI by providing expertly curated human data to power frontier models. With expertise in Coding SFT (Supervised Fine-Tuning) and Reinforcement Learning with Human Feedback (RLHF), the company collaborates with domain-specific experts across STEM, Mathematics, PhD fields, and more....