Speech Recognition Intern

3 weeks ago


Mount Abu, India Sony Research India Full time

Sony Research India is seeking a dynamic and motivated Speech Recognition Intern to join our innovative research team. As a Speech Recognition Intern, you will have the opportunity to work on cutting-edge projects in the field of speech recognition technologies. This internship is designed for individuals passionate about advancing their skills and knowledge in speech recognition, speech activity detection, speaker diarization, machine learning, and artificial intelligence.


Key Responsibilities:

  • Research and Development: Collaborate with our research team to design, implement, and evaluate state-of-the-art speech recognition algorithms and speaker diarization algorithms including models.
  • Algorithm Optimization: Work on optimizing existing speech recognition algorithms and speaker diarization algorithms for enhanced accuracy, speed, and efficiency.
  • Stay Current: Stay updated of the latest developments in the field of speech recognition, speaker diarization and contribute insights to enhance the teams knowledge base.


Work Location:

  • Remote


Duration of the paid Internship:

  • This paid internship will be for a period of 6 months starting January first week of 2025
  • 9:00 to 18:00 (Monday to Friday)


Qualifications:

Currently pursuing/completed Masters in (Research) or Ph.D. in deep learning/machine learning with hands-on experience on Transformer models with an applications audio/speech.


Must Have Skills:

  • Strong programming skills in Python, shell scripting, PERL
  • Hands-on deep learning, machine learning (Pytorch, Tensorflow)
  • Sound knowledge on speech technologies


Good to have skills

  • Expertise in Pytorch
  • Prior experience in development of Indian Languages ASR ( Automatic Speech Recognition) and speaker diarization.

  • CUX Designer

    2 weeks ago


    Mount Abu, India Skit.ai Full time

    About usSkit.ai is the leading conversational Voice AI platform in the accounts and receivables (ARM) industry, enabling collection agencies to streamline and accelerate revenue recovery. Skit.ai's Compliant, Configurable, and Easy-to-deploy Conversational Voice AI platform is enabling enterprises to automate nearly one million consumer conversations...