LLM Researcher

2 weeks ago


malappuram, India Attentions AI Full time

Greetings From Attentions.ai


Job Description for LLM Researcher (Audio)


Position Title: LLM Researcher (Audio)

Location: Pune, India

Department: Research and Development

Employment Type: Full-time


Company Overview:

Attentions is a cutting-edge generative AI company dedicated to developing innovative products across all modalities, including text, vision, audio, and more. Our mission is to continuously push the boundaries of AI through ongoing research and development, ensuring we deliver market-leading solutions that transform digital interactions. We pride ourselves on fostering a collaborative and forward-thinking environment, committed to ethical AI practices and excellence in everything we do.


Role Summary:


We are seeking a highly skilled Research Scientist to join our team in developing and enhancing the capabilities of Whisper, an open-source speech recognition model. This role focuses on dual channel translation and transcription, dairization, language detection, voice sentiment analysis, and the development of comprehensive quality metrics. The ideal candidate will contribute to both theoretical advancements and practical implementations in these areas.


Responsibilities:


  • Conduct innovative research and develop algorithms for speech recognition, translation, and transcription using the Whisper model.
  • Enhance capabilities for dual-channel audio processing, improving accuracy and efficiency.
  • Develop and refine techniques for speaker dairization to distinguish and manage multiple speakers within audio streams.
  • Implement and optimize algorithms for automatic language detection to enhance model versatility across different languages.
  • Research and apply methods for analyzing voice sentiment, aiming to extract emotional states from speech.
  • Design and implement rigorous quality metrics to evaluate the performance and reliability of the model in various scenarios.
  • Collaborate with cross-functional teams to integrate these technologies into broader applications and products.
  • Publish research findings in top-tier journals and conferences and contribute to the academic and open-source communities.


Qualifications:


  • B.Tech in Computer Science, Electrical Engineering, Computational Linguistics, or a related field.
  • Proven experience in machine learning and natural language processing, particularly in speech recognition and audio analysis.
  • Strong programming skills in Python, and familiarity with machine learning frameworks such as TensorFlow or PyTorch.
  • Experience with audio signal processing and familiarity with tools like Librosa or SoX.
  • Knowledge of advanced statistical techniques and deep learning methodologies applicable to speech and audio processing.
  • Excellent problem-solving skills, ability to conduct independent research, and readiness to challenge the status quo.
  • Strong communication skills, both verbal and written, with the ability to present complex technical details to a non-technical audience.


Preferred Skills:


  • Previous experience working with the Whisper model or other similar speech recognition technologies.
  • Contributions to open-source projects or a strong publication record in relevant fields.
  • Experience in handling dual-channel audio data and real-time speech processing systems.


  • Lead Data Scientist

    6 days ago


    Malappuram, India Turing Full time

    Data Science Lead/ManagerAbout Turing:We are a pioneering organization at the forefront of the AI and Machine Learning industry, specializing in training Large Language Models (LLMs) through sophisticated Software Engineering, Data Science, and Machine Learning techniques. Our work encompasses a range of technical tasks aimed at enhancing the capabilities of...


  • Malappuram, India Supereps.ai Full time

    Supereps.ai is an innovative retail intelligence platform, dedicated to transforming the retail industry. Our advanced AI-driven solution captures and analyses in-store conversations, providing actionable insights to help retailers maximise their revenue potential. By leveraging state-of-the-art speech recognition and language models, we empower retail...