Advanced Speech Recognition Developer for Kaizen Voiz

5 days ago


India Kaizen Voiz Full time
Job Description

We are seeking a highly skilled Advanced Speech Recognition Developer to join Kaizen Voiz. The ideal candidate will be at the forefront of developing and deploying advanced speech recognition systems integrated with NLP, NLU, Generative AI, and LLMs to create innovative voice-driven solutions.

About the Role

  • ASR System Development: Design and optimize state-of-the-art Automatic Speech Recognition (ASR) systems using frameworks like Kaldi, Whisper, NeMo, or wav2vec.
  • Language Model Fine-Tuning: Fine-tune and deploy Large Language Models (LLMs) such as Open AI's GPT, BERT, or T5 for speech-to-text and text generation applications.
  • NLP/NLU Integration: Develop pipelines for understanding natural language inputs, enhancing accuracy for tasks like intent recognition, entity extraction, and sentiment analysis.
  • Generative AI Applications: Implement Generative AI techniques for conversational AI systems, voice assistants, and real-time transcription services.
  • End-to-End Pipelines: Build and optimize scalable and real-time speech-to-text pipelines using tools like TensorFlow, PyTorch, or ONNX.
  • Acoustic Modeling: Work on advanced acoustic modeling techniques, including self-supervised learning for low-resource languages.
  • Cross-Functional Collaboration: Collaborate with data scientists, linguists, and product teams to deliver high-quality, user-centric solutions.
  • Performance Optimization: Continuously enhance system accuracy and latency while maintaining privacy and robustness.
  • Cloud Deployment: Deploy speech recognition systems on cloud platforms like AWS, Azure, or Google Cloud using services such as AWS Transcribe or Azure Cognitive Services.

Requirements

  • Strong programming expertise in Python, C++, or Java.
  • Proficiency in ML/DL frameworks such as TensorFlow, PyTorch, or Scikit-learn.
  • Hands-on experience with ASR tools like Kaldi, Whisper, or wav2vec 2.0.
  • In-depth knowledge of NLP/NLU frameworks: Hugging Face Transformers, SpaCy, or NLTK.
  • Familiarity with Generative AI technologies and tools like LangChain, Haystack, or Prompt Engineering for LLMs.
  • Experience in training and fine-tuning deep learning models for both speech and text processing.
  • Strong knowledge of signal processing, acoustic features, and feature extraction techniques (e.g., MFCC, PLP, spectrograms).
  • Understanding of real-time streaming technologies and APIs (e.g., Flask, WebSockets).

Preferred Qualifications

  • MS or PhD in Computer Science, AI, ML, or related fields.
  • Experience working on low-resource language ASR systems.
  • Familiarity with privacy-preserving AI techniques.
  • Proven track record of deploying production-ready models in cloud environments.

Compensation

The estimated annual salary for this position is around $150,000 - $200,000 based on industry standards and location. Additionally, we offer a comprehensive benefits package, including medical, dental, and vision insurance, 401(k) matching, and generous paid time off.



  • india Kaizen Voiz Full time

    Speech Recognition Engineer Job Description We are seeking a highly skilled Speech Recognition Engineer to join our team. The ideal candidate will be at the forefront of developing and deploying advanced speech recognition systems integrated with NLP, NLU, Generative AI, and LLMs to create innovative voice-driven solutions. Key Responsibilities: ASR...


  • india Kaizen Voiz Full time

    Speech Recognition EngineerJob DescriptionWe are seeking a highly skilled Speech Recognition Engineer to join our team. The ideal candidate will be at the forefront of developing and deploying advanced speech recognition systems integrated with NLP, NLU, Generative AI, and LLMs to create innovative voice-driven solutions.Key Responsibilities:ASR Development:...


  • india Kaizen Voiz Full time

    Speech Recognition Engineer Job Description We are seeking a highly skilled Speech Recognition Engineer to join our team. The ideal candidate will be at the forefront of developing and deploying advanced speech recognition systems integrated with NLP, NLU, Generative AI, and LLMs to create innovative voice-driven solutions. Key Responsibilities: ASR...


  • india Kaizen Voiz Full time

    The ideal candidate will be a creative and analytical thinker. They will be able to conduct insightful market research to establish a marketing strategy that will effectively reach the target audience. They should be comfortable evaluating the marketing process, and work to critique and improve its outcomes.    Responsibilities 1. Strong fundamentals in...


  • india Kaizen Voiz Full time

    The ideal candidate will be a creative and analytical thinker. They will be able to conduct insightful market research to establish a marketing strategy that will effectively reach the target audience. They should be comfortable evaluating the marketing process, and work to critique and improve its outcomes.  Responsibilities1. Strong fundamentals in...


  • India Kaizen Voiz Full time

    Speech Recognition Engineer Job Description We are seeking a highly skilled Speech Recognition Engineer to join our team. The ideal candidate will be at the forefront of developing and deploying advanced speech recognition systems integrated with NLP, NLU, Generative AI, and LLMs to create innovative voice-driven solutions. Key Responsibilities: ...


  • india Sony Research India Full time

    Sony Research India is seeking a dynamic and motivated Speech Recognition Intern to join our innovative research team. As a Speech Recognition Intern, you will have the opportunity to work on cutting-edge projects in the field of speech recognition technologies. This internship is designed for individuals passionate about advancing their skills and knowledge...


  • india Sony Research India Full time

    Sony Research India is seeking a dynamic and motivated Speech Recognition Intern to join our innovative research team. As a Speech Recognition Intern, you will have the opportunity to work on cutting-edge projects in the field of speech recognition technologies. This internship is designed for individuals passionate about advancing their skills and knowledge...


  • India gnani.ai Full time

    We are seeking a talented Speech Analytics Engineer with 4+ years of experience to join our Conversational AI team. You will play a pivotal role in enhancing our AI-powered conversational systems by analyzing vast amounts of voice and text data. Your expertise in speech recognition, natural language processing, and machine learning will be instrumental in...


  • india gnani.ai Full time

    We are seeking a talented Speech Analytics Engineer with 4+ years of experience to join our Conversational AI team. You will play a pivotal role in enhancing our AI-powered conversational systems by analyzing vast amounts of voice and text data. Your expertise in speech recognition, natural language processing, and machine learning will be instrumental in...


  • india Murf AI Full time

    Revolutionize Speech Technology with Murf.AIThe Speech team at Murf.AI is seeking a passionate Research Scientist Intern to join us in building the future of voice creation. We empower users to craft high-quality voiceovers quickly and effortlessly, shattering communication barriers worldwide.Make Your Mark:Conduct cutting-edge research in machine learning...


  • india Kaizen Empire Full time

    About Us Kaizen Empire is a dynamic and rapidly expanding eCommerce company headquartered in the United States. We specialize in designing, manufacturing, and selling an extensive range of children’s toys on Amazon US and through our own online store. With a diverse portfolio of over 100 products, we have successfully achieved a revenue of over $80...


  • India Kaizen Adventours Full time

    About UsKaizen Adventours in Gurugram is a leading travel company dedicated to providing enriching experiences by facilitating hassle-free bookings and planning. We believe in the transformative power of travel, building relationships, and gaining life-changing experiences. Our goal is to make travel a way of life for our customers.Job OverviewThis is a...


  • India Ridgehead Software Full time

    Company Overview: Ridgehead Software is a leading company in ssoftware development and integration for the BPO and call center community,and is committed to leveraging cutting-edge technology to enhance our operations and customer experience. We are currently seeking an experienced Offshore Development Resource for Speech Analytics with expertise in the...


  • India Ridgehead Software Full time

    Company Overview: Ridgehead Software is a leading company in ssoftware development and integration for the BPO and call center community,and is  committed to leveraging cutting-edge technology to enhance our operations and customer experience. We are currently seeking an experienced Offshore Development Resource for Speech Analytics with expertise in...


  • India Ridgehead Software Full time

    Company Overview: Ridgehead Software is a leading company in software development and integration for the BPO and call center community. We are committed to leveraging cutting-edge technology to enhance our operations and customer experience.Job Summary: We are seeking an experienced Senior Speech Analytics Solutions Developer with expertise in the Nexidia...


  • india Murf AI Full time

    Revolutionize Speech Technology with Murf.AI The Speech team at Murf.AI is seeking a passionate Research Scientist Intern to join us in building the future of voice creation. We empower users to craft high-quality voiceovers quickly and effortlessly, shattering communication barriers worldwide. Make Your Mark: Conduct cutting-edge research in machine...


  • India Shaip Full time € 30,000

    Job Description: Are you fluent in Tibetan and looking for a flexible, remote work opportunity? Shaip is seeking freelancers to contribute to our Tibetan Speech Collection Project, aimed at enhancing AI and voice recognition technologies. By participating, you'll help create accurate and diverse speech datasets essential for advancing these...


  • India Teach and Learn Child Development and Rehab Center Full time

    Company Description Teach and Learn Child Development and Rehab Center in Hyderabad is dedicated to empowering individuals to reach their full potential. We offer a range of services including Speech Therapy, Occupational Therapy, Behavioral Therapy, School Readiness, and Psychology. Role Description This is a full-time on-site role for a Speech...


  • india Teach and Learn Child Development and Rehab Center Full time

    Company Description Teach and Learn Child Development and Rehab Center in Hyderabad is dedicated to empowering individuals to reach their full potential. We offer a range of services including Speech Therapy, Occupational Therapy, Behavioral Therapy, School Readiness, and Psychology. Role Description This is a full-time on-site role for a Speech Language...