Evaluating the Trustworthiness of Advanced Language Models
2 days ago
AI Assurance ExpertIn this pivotal role, you will be instrumental in ensuring the trustworthiness and performance of cutting-edge Large Language Models (LLMs), sub-models, and multi-agent systems through comprehensive evaluation strategies.This involves simulating real-world scenarios and edge cases using Python scripting and synthetic data generation, collaborating closely with Data Science and Engineering teams, and maintaining detailed documentation and prompt libraries.Key Responsibilities:Designing and executing evaluation strategies to validate LLM performance.Performing manual and automated testing to ensure model reliability.Developing synthetic test data to simulate real-world scenarios.Ensuring boundary condition coverage for robustness.Preparing and maintaining prompt libraries for efficient testing.Evaluating multi-model architectures for optimal performance.Applying and interpreting evaluation metrics to inform development decisions.Documenting test plans and evaluation reports for transparency and reproducibility.Requirements:5+ years of experience in quality assurance and AI/ML evaluation.Strong hands-on experience with LLM evaluation techniques.Proficiency in Python programming language.Deep understanding of AI model architecture and its implications.Familiarity with prompt engineering principles.Experience with AI/ML testing frameworks and tools.Solid grasp of evaluation metrics and their application.Excellent analytical, documentation, and communication skills.Prior experience in QA for AI/ML products is a plus.Benefits:This role offers a unique opportunity to contribute to the development of cutting-edge AI technologies and work closely with cross-functional teams. As an AI Assurance Expert, you will have the chance to grow professionally and personally in a dynamic and supportive environment.
-
Advanced Coding Model Developer
1 week ago
dindigul, India beBeeLanguageModeler Full timeJob Description:This position entails developing and refining large language models for coding in various programming languages, including Bash, Shell, Rust, and SQL.The primary responsibilities include crafting and implementing strategies to train, fine-tune, and evaluate advanced coding models. This involves collaborating with AI researchers and developers...
-
Telugu Language Content Evaluator
3 days ago
dindigul, India beBeeEvaluator Full timeAbout the RoleWe seek skilled professionals to evaluate, review and compare AI-generated responses in Telugu. This role requires a strong command of Telugu language, ability to identify harmful or toxic content, and deep cultural understanding for high-quality model performance.Evaluate AI-generated outputs specifically in Telugu (both native script and...
-
Advanced AI Model Developer
1 week ago
dindigul, India beBeeGenerativeai Full timeJob Opportunity for Advanced AI Model Developer About the Role:We seek a skilled professional to lead our Generative AI team in defining technical direction and best practices.Identify high-impact use cases for AI-driven transformation, execute them, and integrate data for intelligent workflows.Design, build, and deploy advanced generative and large...
-
Coding Models Trainer
4 days ago
dindigul, India beBeeLlmTraining Full timeAdvanced Coding Model TrainingWe are seeking an experienced professional to train, fine-tune, and evaluate advanced coding models using Bash/Shell/Rust/SQL.Responsibilities include preprocessing data, developing training pipelines, debugging issues, and collaborating with AI researchers and developers to refine model performance.RequirementsProficiency in...
-
Advanced AI Model Developer
4 days ago
dindigul, India beBeeEngineering Full timeAs a Lead AI Engineer, you will be responsible for spearheading the development and implementation of advanced AI and machine learning models.This leadership role involves guiding a team of engineers to successfully deploy projects that leverage AI/ML technologies to solve complex problems.The ideal candidate should have hands-on expertise in NLP, Computer...
-
Content Evaluator – Malayalam
2 days ago
dindigul, India beBeecontent Full timeJob Title: Content Evaluator – MalayalamAt our organization, we are seeking a skilled Content Evaluator to evaluate AI-generated content in the Malayalam language. This role is ideal for individuals who possess strong proficiency in both Malayalam and English.The successful candidate will be responsible for evaluating AI-generated outputs in Malayalam,...
-
Language Expertise Seeker
2 weeks ago
dindigul, India beBeeTechnical Full timeJob Title: Language SpecialistJob Description: We are seeking a skilled Language Specialist to evaluate, annotate, and provide structured feedback on AI-generated content produced by Large Language Models (LLMs).Required Skills and Qualifications: Evaluate LLM outputs for correctness, coherence, and relevance in technical areas such as programming,...
-
Bengali Language Content Expert
4 days ago
dindigul, India beBeeContentEvaluation Full timeContent Evaluation Specialist Job DescriptionEvaluate AI model outputs in Bengali, identifying toxic or harmful content and assessing model performance across multiple datasets.Classify toxicity into hate speech, harassment, abusive language, etc.Provide brief explanations for flagged items where required.Key Qualifications:Proficient in English and...
-
English Language Evaluation Specialist
2 weeks ago
dindigul, India beBeeEvaluator Full timeJob OpportunityWe are seeking skilled professionals to evaluate academic content for our clients.Evaluate essays, dissertation papers, and open-ended responses reliably as per established scoring guidelines.Apply client-provided training and guidelines accurately while grading.Maintain consistency, objectivity, and high-quality scoring standards in all...
-
Advanced Machine Learning Model Developer
2 weeks ago
dindigul, India beBeeMachineLearning Full timeA highly skilled professional in machine learning and deep learning is required to develop advanced algorithms and models for industry-specific problems.Key responsibilities include collecting and analyzing large datasets, developing machine learning models using Python and SQL, training and mentoring junior team members, and designing and implementing...