AI and Multimodal Model Specialist

5 days ago


India beBeeArtificial Full time ₹ 5,00,000 - ₹ 8,00,000
Job Description

We are seeking a highly skilled Research Engineer to join our team in developing cutting-edge AI solutions that integrate various data modalities and enhance interactive systems.

The ideal candidate will be passionate about AI and Natural Language Processing (NLP) and have the ability to develop and deploy machine learning models in practical environments, with a focus on Text-to-Speech (TTS) applications and agentic workflows.

  • Research & Development:
  • Develop state-of-the-art m-LLMs and generative AI models that process and integrate multiple data modalities, including text, audio, and visual inputs.
  • Design and conduct experiments to test new algorithms, architectures, and fine-tuning techniques for TTS applications and agentic workflows.
  • Stay updated with the latest academic and industry advancements in m-LLMs, TTS, and AI agents to inform ongoing projects.
Required Skills and Qualifications
  • Educational Background:
  • Pursuing an advanced degree (Master's or Ph.D.) in Computer Science, Machine Learning, NLP, or a closely related field.
  • Technical Expertise:
  • Experience or coursework in large language models, deep learning, NLP, and TTS technologies.
  • Proficiency in programming languages such as Python, with experience in frameworks like TensorFlow or PyTorch.
  • Understanding of algorithm design, data structures, and model optimization techniques relevant to m-LLMs and TTS systems.
Benefits
  • Collaboration Opportunities:
  • Work closely with cross-functional teams to translate research insights into tangible products that utilize m-LLMs and TTS technologies within agentic workflows.
  • Professional Growth:
  • Engage in a culture of learning and innovation, contributing to team knowledge sharing on m-LLMs, TTS, and AI agents.


  • India beBeeResearch Full time ₹ 40,00,000 - ₹ 50,00,000

    Research Scientist - Multimodal Large Language ModelsWe are seeking a highly motivated Research Scientist with expertise in developing and deploying cutting-edge multimodal large language models (m-LLMs) for interactive systems.The ideal candidate will have a strong background in natural language processing, deep learning, and TTS technologies. They will...


  • India beBeeResearch Full time US$ 1,20,000 - US$ 1,50,000

    Job OpportunityWe are seeking a skilled Research Engineer to join our team in developing cutting-edge AI solutions that integrate various data modalities and enhance interactive systems.This role offers a unique chance to engage in research and development of state-of-the-art Multimodal Large Language Models (m-LLMs) and generative AI models that process and...


  • India beBeeMultimodal Full time ₹ 1,78,10,000 - ₹ 2,19,40,000

    Senior Speech TechnologistWe are looking for a skilled Senior Speech Technologist to join our team. In this strategic role, you will have significant ownership on technical direction and drive innovation in scaling data processing, engineering efficiency and ease of AOAI customization, and generative AI.About the RoleYou will develop advanced customization...


  • India beBeeSoftware Full time US$ 1,80,000 - US$ 2,50,000

    Unlock the Future of Human-Machine Interaction As a Principal Software Engineer, you will play a pivotal role in shaping the future of human-machine interaction by leading the design and development of advanced infrastructure and tooling for customising multilingual speech models, AOAI systems, and multimodal generative AI.The technologies you will work on...


  • India beBeeAIENGINEER Full time ₹ 1,50,00,000 - ₹ 2,00,00,000

    Research Opportunity:We are seeking a highly skilled Research Engineer to engage in cutting-edge research and development of AI solutions that integrate various data modalities.Main Responsibilities:Multimodal Research:Develop state-of-the-art LARGE LANGUAGE MODELS and generative AI models that process and integrate multiple data modalities, including text,...

  • AI Model Architect

    2 weeks ago


    India beBeeMachineLearning Full time ₹ 1,50,00,000 - ₹ 2,00,00,000

    At the forefront of machine learning innovation, we're seeking a visionary scientist to push the boundaries of AI model architectures and training techniques.The ideal candidate will have a deep understanding of large language models, multimodal models, and image/video generation. Proficiency in Python, deep learning frameworks, and distributed training...


  • India Workana Full time

    About this project We are seeking a skilled ai developer to build an advanced multimodal ai assistant. This assistant should be capable of interacting through voice, gesture, image, and text inputs, providing a versatile user experience. The core functionality will involve integrating various technologies to process these different input modalities and...


  • India beBeeAIENGINEER Full time ₹ 15,00,000 - ₹ 25,00,000

    Transforming the Future of Video Creation with AI">Job Description: We're building a revolutionary product focused on generative AI video.We're looking for an experienced AI Engineer to join our team and take it to the next level. Your primary responsibility will be to optimize and fine-tune generative AI models for video and audio.">Key...


  • India Level AI Full time

    As a Computer Vision Engineering Intern, you will be responsible for the Neural network workload analysis and modeling AI accelerators and testing. **Roles and Responsibilities**: - As a Computer Vision Engineer, you’ll be working on several areas like Visual Recognition, Segmentation, Feature Extraction/Representation Learning. - You’ll be responsible...


  • India beBeeResearch Full time ₹ 1,00,00,000 - ₹ 2,50,00,000

    Job Description:As a Research Engineer in Artificial Intelligence and Natural Language Processing, you will be responsible for developing innovative solutions that integrate various data modalities. This includes text, audio, and visual inputs.You will assist in designing and conducting experiments to test new algorithms, architectures, and fine-tuning...