Multimodal AI Research Specialist

2 days ago


India beBeeResearch Full time US$ 1,20,000 - US$ 1,50,000

Job Opportunity

We are seeking a skilled Research Engineer to join our team in developing cutting-edge AI solutions that integrate various data modalities and enhance interactive systems.

This role offers a unique chance to engage in research and development of state-of-the-art Multimodal Large Language Models (m-LLMs) and generative AI models that process and integrate multiple data modalities, including text, audio, and visual inputs.

Key Responsibilities:

  • Develop novel m-LLMs and generative AI models for complex NLP tasks with a focus on TTS and multimodal integration.
  • Design and conduct experiments to test new algorithms, architectures, and fine-tuning techniques for TTS applications and agentic workflows.
  • Stay updated with the latest advancements in m-LLMs, TTS, and AI agents to inform ongoing projects.

Model Design & Optimization:

  • Participate in developing, fine-tuning, and evaluating AI models for real-world production systems, ensuring seamless interaction between components in agentic workflows.
  • Optimize models for scalability, efficiency, and deployment in production systems, focusing on naturalness and responsiveness of TTS systems.
  • Experiment with novel methods in model training, domain adaptation, and performance evaluation to enhance TTS systems.

Collaboration & Learning:

  • Work closely with cross-functional teams to translate research insights into tangible products that utilize m-LLMs and TTS technologies within agentic workflows.
  • Engage in a culture of learning and innovation, contributing to team knowledge sharing on m-LLMs, TTS, and AI agents.
  • Collaborate with external research communities, potentially contributing to conferences and publications in the fields of m-LLMs, TTS, and agentic AI systems.

Requirements:

  • Educational Background: Master's or Ph.D. in Computer Science, Machine Learning, NLP, or a closely related field.
  • Technical Expertise: Experience or coursework in large language models, deep learning, NLP, and TTS technologies.
  • Proficiency in programming languages such as Python, with experience in frameworks like TensorFlow or PyTorch.
  • Understanding of algorithm design, data structures, and model optimization techniques relevant to m-LLMs and TTS systems.

Preferred Qualifications:

  • Advanced Research Experience: Progress toward a Ph.D. in a relevant discipline with a record of publications or patents.
  • Experience in pioneering research in AI, with familiarity in m-LLM frameworks and toolkits.
  • Specialized Skills: Experience with TTS systems, including speech synthesis and voice conversion technologies.


  • India beBeeResearch Full time ₹ 40,00,000 - ₹ 50,00,000

    Research Scientist - Multimodal Large Language ModelsWe are seeking a highly motivated Research Scientist with expertise in developing and deploying cutting-edge multimodal large language models (m-LLMs) for interactive systems.The ideal candidate will have a strong background in natural language processing, deep learning, and TTS technologies. They will...


  • India beBeeMultimodal Full time ₹ 1,78,10,000 - ₹ 2,19,40,000

    Senior Speech TechnologistWe are looking for a skilled Senior Speech Technologist to join our team. In this strategic role, you will have significant ownership on technical direction and drive innovation in scaling data processing, engineering efficiency and ease of AOAI customization, and generative AI.About the RoleYou will develop advanced customization...

  • AI Research Engineer

    7 hours ago


    India FlashIntel Full time

    Role Overview FlashIntel is seeking a dedicated and innovative Research Engineer with a focus on Multimodal Large Language Models (m-LLMs), Text-to-Speech (TTS) technologies, and agentic workflows. This position offers a unique opportunity to engage in cutting-edge research and development of AI solutions that integrate various data modalities and enhance...


  • India beBeeResearch Full time ₹ 1,00,00,000 - ₹ 2,50,00,000

    Job Description:As a Research Engineer in Artificial Intelligence and Natural Language Processing, you will be responsible for developing innovative solutions that integrate various data modalities. This includes text, audio, and visual inputs.You will assist in designing and conducting experiments to test new algorithms, architectures, and fine-tuning...


  • India beBeeSoftware Full time US$ 1,80,000 - US$ 2,50,000

    Unlock the Future of Human-Machine Interaction As a Principal Software Engineer, you will play a pivotal role in shaping the future of human-machine interaction by leading the design and development of advanced infrastructure and tooling for customising multilingual speech models, AOAI systems, and multimodal generative AI.The technologies you will work on...


  • India beBeeArtificial Full time ₹ 5,00,000 - ₹ 8,00,000

    Job DescriptionWe are seeking a highly skilled Research Engineer to join our team in developing cutting-edge AI solutions that integrate various data modalities and enhance interactive systems.The ideal candidate will be passionate about AI and Natural Language Processing (NLP) and have the ability to develop and deploy machine learning models in practical...


  • India Narayana Nethralaya Foundation Full time US$ 90,000 - US$ 1,20,000 per year

    Our team is dedicated to transforming patient outcomes by building advanced diagnostic tools and decision-support systems using cutting-edge machine learning and deep learning technologies. Our team collaborates with top-tier medical institutions and researchers to develop robust solutions for medical imaging, including radiology, pathology, and multimodal...


  • India Workana Full time

    About this project We are seeking a skilled ai developer to build an advanced multimodal ai assistant. This assistant should be capable of interacting through voice, gesture, image, and text inputs, providing a versatile user experience. The core functionality will involve integrating various technologies to process these different input modalities and...

  • Research Analyst

    1 week ago


    India Scry AI Full time ₹ 9,00,000 - ₹ 12,00,000 per year

    Position: Research Analyst (Subject Matter Expert – BFSI)Location: India (Remote)Employment Type: Full-TimeSchedule: Monday to Friday, Day ShiftExperience: 5 YearsCompany DescriptionScry AI is a leading innovator in AI-powered financial intelligence platforms tailored for Banking, Financial Services, and Insurance (BFSI) organizations. Our Collatio...


  • India Level AI Full time

    As a Computer Vision Engineering Intern, you will be responsible for the Neural network workload analysis and modeling AI accelerators and testing. **Roles and Responsibilities**: - As a Computer Vision Engineer, you’ll be working on several areas like Visual Recognition, Segmentation, Feature Extraction/Representation Learning. - You’ll be responsible...