Technical Language Model Evaluator

1 week ago


jaipur, India beBeeEvaluator Full time

About This RoleIn this role, you will be responsible for evaluating language models (LLMs) to ensure their outputs are accurate and coherent. The LLMs are used in technical areas such as programming and algorithms.Key responsibilities include assessing AI responses, developing prompts, and providing structured feedback. You will also work on ensuring annotation consistency, data integrity, and high-quality labeling across assigned rating tasks.This is an exciting opportunity to bridge your academic expertise with cutting-edge LLM development. You will have the chance to work on creating AI that understands, explains, and reasons about code like a human expert.You will be working closely with our team to evaluate LLM outputs, assess AI responses, develop prompts, and provide structured feedback. Your expertise will bring precision and technical depth to training advanced AI.Required Skills and QualificationsEvaluating LLM outputs for accuracy and coherenceAssessing AI responses for logical consistency and technical accuracyDeveloping effective prompts for LLMsProviding structured feedback on AI responsesWe are looking for someone who has strong analytical and communication skills. If you have experience working with LLMs or AI, we would love to hear from you.This is a remote position, and you will have the flexibility to work from anywhere. However, you must be able to work independently and manage your time effectively.



  • jaipur, India beBeeContentEvaluator Full time

    Job Description:We're seeking highly skilled professionals to evaluate AI-generated responses in various Indic languages and identify toxic or harmful content. This role involves comparing model outputs, assessing performance across multiple datasets, and classifying the type and severity of toxicity.This is an excellent opportunity for individuals with...


  • jaipur, India beBeeContentEvaluator Full time

    Job OpportunityWe are seeking an exceptional individual to evaluate AI model outputs in Marathi, identifying and mitigating toxic or harmful content.About the Role:Evaluate AI model outputs in Marathi with high accuracyFlag and address toxic, harmful, or hate-based content promptlyAnalyze model responses and provide comprehensive performance...

  • Senior AI Evaluator

    2 weeks ago


    jaipur, India beBeeEthics Full time

    Job OpportunityWe are seeking a skilled professional to assess and ensure the ethical development and deployment of AI models across multiple platforms.This role involves evaluating traditional ML models, Large Language Models (LLMs), and Generative AI systems with a focus on fairness, transparency, privacy, security, and accountability in diverse...

  • Data Modeler

    3 days ago


    jaipur, India beBeeMachineLearning Full time

    We are seeking a skilled AI Model Developer with 3–8 years of experience in designing and deploying machine learning models. The ideal candidate has a strong foundation in classification, anomaly detection, and time-series modeling.The responsibilities include developing, training, and evaluating ML models for tasks such as data analysis, predictive...

  • AI Evaluator

    1 week ago


    jaipur, India beBeeFinance Full time

    Unlock the Potential of AI with a Career as a STEM Rater About the RoleThis is an exciting opportunity to join our team of experts in shaping the future of artificial intelligence. As a STEM Rater, you will play a crucial role in evaluating AI-generated responses for accuracy, clarity, and pedagogical effectiveness. ResponsibilitiesCreate high-quality...


  • Jaipur, India DeepLLMData Full time

    Company DescriptionDeepLLMData specializes in driving advancements in AI by providing expertly curated human data to power frontier models. With expertise in Coding SFT (Supervised Fine-Tuning) and Reinforcement Learning with Human Feedback (RLHF), the company collaborates with domain-specific experts across STEM, Mathematics, PhD fields, and more....


  • jaipur, India beBeeLinguistics Full time

    Job DescriptionWe are seeking a skilled Prompt Engineer to join our team. In this role, you will work with large language models (LLMs) to automate data labeling, classification, localization, and annotation tasks.You will programmatically use LLMs to automate tasks and develop data pipelines for model integration.Additionally, you will ensure that solutions...


  • jaipur, India beBeeHindi Full time

    Job Opportunity: Hindi Language SpecialistWe are seeking a highly skilled Hindi language specialist to collaborate with our AI platform and enhance its comprehension of Hindi language prompts.The selected candidate will be responsible for reviewing, evaluating, and refining AI-generated content based on Hindi inputs for accuracy, relevance, and...


  • jaipur, India beBeeLLMOps Full time

    Job Opportunity:We are seeking a highly skilled professional to design, implement, and scale our Large Language Model operations infrastructure.Lead the development of scalable infrastructure for training, fine-tuning, deployment, and inference of LLMs.Establish best practices for model deployment, monitoring, drift detection, and lifecycle...

  • AI Model Assessor

    3 days ago


    jaipur, India beBeeContentEvaluator Full time

    Job Overview: We are seeking highly skilled AI Content Evaluators with proficiency in Indian languages to assess and review AI-generated responses.Key Responsibilities:Evaluate AI-generated outputs across various Indic languages.Flag harmful, toxic, or hate-based content, including subtle context-dependent cases.Compare and score AI model responses according...