LLM Model training for Coding

2 weeks ago


bangalore, India DeepLLMData Full time

Company DescriptionDeepLLMData specializes in empowering advanced AI models with expertly curated human data. Our services include training data for Coding SFT (Supervised Fine-Tuning), RLHF (Reinforcement Learning with Human Feedback), and various STEM-related domains. With a focus on Python programming, we draw on skilled professionals, including PhD-level experts in mathematics and domain-specific areas. Additionally, we provide top-tier image and video annotators alongside specialized data for multimedia applications, driving innovation in the AI industry.Role DescriptionThis is a contract remote role for a Python-focused LLM (Large Language Model) Model Trainer with extensive experience in coding. The role involves curating and producing high-quality training data for LLMs, fine-tuning models using state-of-the-art techniques, and applying Python expertise for building and improving datasets. QualificationsProficient expertise in Python programming with 3+ years of experienceDemonstrated skills in machine learning, AI, and LLM fine-tuning and trainingStrong background in coding, data curation, and reinforcement learning techniquesProficiency in collaborating within a remote team environment and autonomous work capabilitiesExperience with data annotation tools, particularly for coding and multimedia projectsB. Tech Computers or EcE



  • bangalore, India DeepLLMData Full time

    Company Description DeepLLMData specializes in advancing frontier models by leveraging expert human data. The organization focuses on coding techniques like Supervised Fine-Tuning (SFT) and Reinforcement Learning with Human Feedback (RLHF). In addition, DeepLLMData collaborates with specialists in STEM fields, applied mathematics, and domain-specific areas....


  • bangalore, India DeepLLMData Full time

    Company Description DeepLLMData specializes in advancing frontier models by leveraging expert human data. The organization focuses on coding techniques like Supervised Fine-Tuning (SFT) and Reinforcement Learning with Human Feedback (RLHF). In addition, DeepLLMData collaborates with specialists in STEM fields, applied mathematics, and domain-specific areas....


  • bangalore, India DeepLLMData Full time

    Company DescriptionDeepLLMData specializes in advancing frontier models by leveraging expert human data. The organization focuses on coding techniques like Supervised Fine-Tuning (SFT) and Reinforcement Learning with Human Feedback (RLHF). In addition, DeepLLMData collaborates with specialists in STEM fields, applied mathematics, and domain-specific areas....


  • bangalore, India DeepLLMData Full time

    Company Description DeepLLMData specializes in providing expert human-generated data to power advanced frontier models. Our services include coding-specific supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF). We are committed to fostering innovation in various domains through our teams of experts in STEM, Mathematics, and...


  • bangalore, India DeepLLMData Full time

    Company Description DeepLLMData specializes in providing expert human-generated data to power advanced frontier models. Our services include coding-specific supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF). We are committed to fostering innovation in various domains through our teams of experts in STEM, Mathematics, and...


  • bangalore, India DeepLLMData Full time

    Company Description DeepLLMData specializes in providing expert human-generated data to power advanced frontier models. Our services include coding-specific supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF). We are committed to fostering innovation in various domains through our teams of experts in STEM, Mathematics, and...


  • bangalore, India DeepLLMData Full time

    Company Description DeepLLMData specializes in empowering advanced AI models with expertly curated human data. Our services include training data for Coding SFT (Supervised Fine-Tuning), RLHF (Reinforcement Learning with Human Feedback), and various STEM-related domains. With a focus on Python programming, we draw on skilled professionals, including...


  • bangalore, India DeepLLMData Full time

    Company Description DeepLLMData specializes in providing high-quality human data to power advanced machine learning models, including Coding SFT and RLHF. The company brings together experts in STEM fields, Mathematics, and domain-specific areas to develop and enhance AI models. DeepLLMData also offers image and video annotation services as well as data...


  • bangalore, India DeepLLMData Full time

    Company Description DeepLLMData specializes in providing high-quality human data to power advanced machine learning models, including Coding SFT and RLHF. The company brings together experts in STEM fields, Mathematics, and domain-specific areas to develop and enhance AI models. DeepLLMData also offers image and video annotation services as well as data...


  • bangalore, India DeepLLMData Full time

    Company DescriptionDeepLLMData specializes in driving advancements in AI by providing expertly curated human data to power frontier models. With expertise in Coding SFT (Supervised Fine-Tuning) and Reinforcement Learning with Human Feedback (RLHF), the company collaborates with domain-specific experts across STEM, Mathematics, PhD fields, and more....