LLM Model training for Coding

2 weeks ago


bangalore, India DeepLLMData Full time

Company DescriptionDeepLLMData specializes in advancing frontier models by leveraging expert human data. The organization focuses on coding techniques like Supervised Fine-Tuning (SFT) and Reinforcement Learning with Human Feedback (RLHF). In addition, DeepLLMData collaborates with specialists in STEM fields, applied mathematics, and domain-specific areas. The company also provides high-quality image and video annotation services, along with data provisioning. DeepLLMData supports innovation with a mission to enhance model training and performance through expert collaboration.Role DescriptionThis contract-based remote role focuses on training large language models (LLMs) for the Java programming language. The candidate will curate datasets, design tasks for model fine-tuning, and execute training processes for LLMs targeting Java programming.QualificationsProficiency and hands-on experience in Java programming with a minimum of 3 years experienceExperience in LLM training methods such as supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF)Knowledge and expertise in creating annotated coding-specific datasets and task definitions for LLMsStrong understanding of software development principles, debugging, and code optimizationExcellent communication skills to collaborate remotely with diverse technical teams effectivelyB. Tech Computer Engg or ECE.



  • bangalore, India DeepLLMData Full time

    Company Description DeepLLMData specializes in advancing frontier models by leveraging expert human data. The organization focuses on coding techniques like Supervised Fine-Tuning (SFT) and Reinforcement Learning with Human Feedback (RLHF). In addition, DeepLLMData collaborates with specialists in STEM fields, applied mathematics, and domain-specific areas....


  • bangalore, India DeepLLMData Full time

    Company Description DeepLLMData specializes in advancing frontier models by leveraging expert human data. The organization focuses on coding techniques like Supervised Fine-Tuning (SFT) and Reinforcement Learning with Human Feedback (RLHF). In addition, DeepLLMData collaborates with specialists in STEM fields, applied mathematics, and domain-specific areas....


  • bangalore, India DeepLLMData Full time

    Company Description DeepLLMData specializes in providing expert human-generated data to power advanced frontier models. Our services include coding-specific supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF). We are committed to fostering innovation in various domains through our teams of experts in STEM, Mathematics, and...


  • bangalore, India DeepLLMData Full time

    Company Description DeepLLMData specializes in providing expert human-generated data to power advanced frontier models. Our services include coding-specific supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF). We are committed to fostering innovation in various domains through our teams of experts in STEM, Mathematics, and...


  • bangalore, India DeepLLMData Full time

    Company Description DeepLLMData specializes in providing expert human-generated data to power advanced frontier models. Our services include coding-specific supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF). We are committed to fostering innovation in various domains through our teams of experts in STEM, Mathematics, and...


  • bangalore, India DeepLLMData Full time

    Company Description DeepLLMData specializes in empowering advanced AI models with expertly curated human data. Our services include training data for Coding SFT (Supervised Fine-Tuning), RLHF (Reinforcement Learning with Human Feedback), and various STEM-related domains. With a focus on Python programming, we draw on skilled professionals, including...


  • bangalore, India DeepLLMData Full time

    Company DescriptionDeepLLMData specializes in empowering advanced AI models with expertly curated human data. Our services include training data for Coding SFT (Supervised Fine-Tuning), RLHF (Reinforcement Learning with Human Feedback), and various STEM-related domains. With a focus on Python programming, we draw on skilled professionals, including PhD-level...


  • bangalore, India DeepLLMData Full time

    Company Description DeepLLMData specializes in providing high-quality human data to power advanced machine learning models, including Coding SFT and RLHF. The company brings together experts in STEM fields, Mathematics, and domain-specific areas to develop and enhance AI models. DeepLLMData also offers image and video annotation services as well as data...


  • bangalore, India DeepLLMData Full time

    Company Description DeepLLMData specializes in providing high-quality human data to power advanced machine learning models, including Coding SFT and RLHF. The company brings together experts in STEM fields, Mathematics, and domain-specific areas to develop and enhance AI models. DeepLLMData also offers image and video annotation services as well as data...


  • bangalore, India DeepLLMData Full time

    Company DescriptionDeepLLMData specializes in driving advancements in AI by providing expertly curated human data to power frontier models. With expertise in Coding SFT (Supervised Fine-Tuning) and Reinforcement Learning with Human Feedback (RLHF), the company collaborates with domain-specific experts across STEM, Mathematics, PhD fields, and more....