Lead Research Scientist, Speech and Audio Foundation Models

2 weeks ago


Delhi, Delhi, India Krutrim Full time
Lead Research Scientist, Speech and Audio Foundation Models

Location:

Bangalore (India), Singapore and Palo Alto (CA, US)
Type of Job:

Full-time

About Krutrim:
is building AI computing for the future. Our envisioned AI computing stack encompasses the AI computing infrastructure, AI Cloud, multilingual and multimodal foundational models, and AI-powered end applications. We are India's first AI unicorn and built the first foundation model from the country.
Our AI stack is empowering consumers, startups, enterprises and scientists across India and the world to build their end AI applications or AI models. While we are building foundational models across text, voice, and vision relevant to our focus markets, we are also developing AI training and inference platforms that enable AI research and development across industry domains. The platforms being built by Krutrim have the potential to impact millions of lives in India, across income and education strata, and across languages.
The team at Krutrim represents a convergence of talent across AI research, Applied AI, Cloud Engineering, and semiconductor design. Our teams operate from three locations: Bangalore, Singapore & San Francisco.
Job Description:
We are seeking a highly skilled and experienced Senior Research Lead for Speech, Audio, and Conversational AI to join our innovative team. In this role, you will spearhead the research and development of cutting-edge technologies in speech processing, text-to-speech (TTS), audio analysis, and real-time conversational AI. You will push the boundaries of what's possible in automatic speech recognition (ASR), speaker identification, diarization, speech synthesis, and audio generation. Working closely with a team of talented engineers and researchers, you'll design, implement, and optimize state-of-the-art systems that contribute to creating more natural, human-like, and high-quality speech and audio solutions for a variety of applications.
Key Responsibilities:
Bring the state of the art in Audio/Speech and Large Language Models to develop advanced Audio Language Models and Speech Language Models.
Research, architect, and deploy new generative AI methods such as autoregressive models, causal models, and diffusion models
Design and implement low-latency end-to-end models with multilingual speech/audio as both input and output.
Conduct experiments to evaluate and improve the performance of these models, focusing on accuracy, naturalness, efficiency, and real-time capabilities across multiple languages.
Stay at the forefront of advancements in speech processing, audio analysis, and large language models, integrating new techniques into our foundation models.
Collaborate with cross-functional teams to integrate these foundation models into Krutrim's AI stack and products.
Publish research findings in top-tier conferences and journals such as INTERSPEECH, ICASSP, ICLR, ICML, NeurIPS, and IEEE/ACM Transactions on Audio, Speech, and Language Processing.
Mentor and guide junior researchers and engineers, fostering a collaborative and innovative team environment.
Drive the adoption of best practices in model development, including rigorous testing, documentation, and ethical considerations in multilingual AI.
Qualifications:
Ph.D. with 5+ years or MS with 8+ years of experience in Computer Science, Electrical Engineering, or a related field with a focus on speech processing, audio analysis, and machine learning.
Train or finetune speech / audio models for representation (like, W2V-BERT, SONAR, AST), generation (like, Hi-Fi GAN, VQ-GAN, AudioLDM), Conformers, multilingual multitask models (like, SeamlessM4T).
Expertise with Audio Language Models like AudioPALM, Moshi and Seamless M4T
Proven track record of developing and applying novel neural network architectures such as Transformers, Mixture of Experts, Diffusion Models, and State Space Machines (MAMBA, SAMBA).
Extensive experience in developing and optimizing models for low-latency, real-time applications.
Strong background in multilingual speech recognition and synthesis, with an understanding of the challenges specific to different language families.
Proficiency in deep learning frameworks (e.g., TensorFlow, PyTorch) and experience deploying large-scale speech and audio models.
Demonstrated expertise in high-performance computing with proficiency in Python, C/C++, CUDA, and kernel-level programming for AI applications.
Experience with audio signal processing techniques and their application in end-to-end neural models.
Strong track record of publications in top AI conferences and journals, particularly in the areas of speech, audio, and language models.
Excellent communication skills, with the ability to explain complex technical concepts to both technical and non-technical audiences.
Passion for pushing the boundaries of what's possible in speech and audio AI, with a focus on practical, real-world applications.

Join Krutrim to shape the future of AI and make a significant impact on 100s of millions of lives across India and the world. If you're passionate about pushing the boundaries of AI and want to work with a team at the forefront of innovation, we want to hear from you


  • Delhi, Delhi, India Krutrim Full time

    Lead Generative AI Engineer / Scientist - Large-Scale AI ModelsLocation:Bangalore (India)Type of Job:Full-timeAbout Krutrim:is building AI computing for the future. Our envisioned AI computing stack encompasses the AI computing infrastructure, AI Cloud, multilingual and multimodal foundational models, and AI-powered end applications. We are India's first AI...


  • Delhi, Delhi, India Krutrim Full time

    Senior Distributed Training Research Engineer (Frontier LLMs)Location:Bangalore (India)Type of Job:Full-timeAbout Krutrim:is building AI computing for the future. Our envisioned AI computing stack encompasses the AI computing infrastructure, AI Cloud, multilingual and multimodal foundational models, and AI-powered end applications. We are India's first AI...


  • Delhi, Delhi, India Krutrim Full time

    Multimodal and Vision AI Research Engineer / ScientistLocation:Bangalore (India)Type of Job:Full-timeAbout Krutrim:is building AI computing for the future. Our envisioned AI computing stack encompasses AI infrastructure, AI Cloud, multilingual and multimodal foundational models, and AI-powered applications. As India's first AI unicorn, we built the country's...


  • Delhi, Delhi, India GLOWTOUCH TECHNOLOGIES PVT. LTD. Full time

    Title of the Position : Data Scientist - Machine LearningDesignation : Senior Data ScientistWork Location : RemoteBusiness Unit : Engineering (Lightbird) Shifts : General (candidate should be flexible) Who We Are : unifyCX is an emerging Global Business Process Outsourcing company with a strong presence in the U.S., Colombia, Dominican Republic, India,...


  • Delhi, Delhi, India GLOWTOUCH TECHNOLOGIES PVT. LTD. Full time

    Title of the Position : Data Scientist - Machine LearningDesignation : Senior Data ScientistWork Location : RemoteBusiness Unit : Engineering (Lightbird) Shifts : General (candidate should be flexible) Who We Are : unifyCX is an emerging Global Business Process Outsourcing company with a strong presence in the U.S., Colombia, Dominican Republic, India,...

  • AI Research Scientist

    2 weeks ago


    Delhi, Delhi, India ShoppinPal Full time

    About Shoppin' :Shoppin' is an AI - powered visual fashion search engine - if Google's search exhaustiveness and Pinterest's social DNA were to have a baby, it'd be us. Today, gen-z shopping is super trend and intent-led, where they know exactly what they want to look for. our multi modal search engine allows you to discover fashion with personalised...

  • AI Research Scientist

    3 weeks ago


    Delhi, Delhi, India ShoppinPal Full time

    About Shoppin' :Shoppin' is an AI - powered visual fashion search engine - if Google's search exhaustiveness and Pinterest's social DNA were to have a baby, it'd be us. Today, gen-z shopping is super trend and intent-led, where they know exactly what they want to look for. our multi modal search engine allows you to discover fashion with personalised...


  • Delhi, Delhi, India NicheHR LLP Full time

    Senior Software Engineer Audio DSP JOB DESCRIPTIONWe are Looking for a Senior Software Engineer Audio DSP for one of clients the role is completely remote Title Senior Software Engineer Audio DSP Industry ITExperience 10 years of experience Job Profile As a Senior Software Engineer specializing in Audio and Digital Signal Processing for Voice AI...


  • Delhi, Delhi, India Krutrim Full time

    Principal Research Scientist, AI Alignment (Reinforcement Learning, Red Teaming, Explainability)Location:Bangalore (India)About Us:is building AI computing for the future. Our envisioned AI computing stack encompasses the AI computing infrastructure, AI Cloud, multilingual and multimodal foundational models, and AI-powered end applications. We are India's...


  • Delhi, Delhi, India Futures And Careers Full time

    Hi All..Hiring for Advanced Technical Trainer for our Client.Experience: 10 years to 20 yearsJob Title: Advanced Technical Trainer – Microsoft AIJob Summary: We are seeking a passionate and highly skilled Advanced Technical Trainer to create and deliver training programs for Microsoft AI technologies. The ideal candidate will possess a deep understanding...

  • Data Scientist

    2 weeks ago


    Delhi, Delhi, India Pearlcon Technologies Full time

    Role: Data Scientist / Machine Learning EngineerSalary Range: 14 LPAExperience Required: 5+ YearsNotice Period: Immediate to 15 daysMode of Work-RemotePreferred Qualifications- Hands-on experience with ML Ops tools (MLflow, Kubeflow). - Familiarity with vector search (FAISS, Milvus) for retrieval-based ML models. - Experience with audio, video, or text...

  • Data Scientist

    2 weeks ago


    Delhi, Delhi, India SMARTWORK IT SERVICES Full time

    Job Title: Data Scientist. Location: Remote. Notice Period: Immediate. Experience Required: 5-10 Years. Job Description. About the Role. We are seeking a Data Scientist to contribute to the development of advanced AI-driven applications. The role involves designing, building, and optimizing machine learning models for personalized recommendations, content...

  • Data Scientist

    1 week ago


    Delhi, Delhi, India SMARTWORK IT SERVICES Full time

    Job Title: Data Scientist. Location: Remote. Notice Period: Immediate. Experience Required: 5-10 Years. Job Description. About the Role. We are seeking a Data Scientist to contribute to the development of advanced AI-driven applications. The role involves designing, building, and optimizing machine learning models for personalized recommendations, content...


  • Delhi, Delhi, India SaveLIFE Foundation Full time

    Job Description :Are you passionate about making a positive impact on road safety? Do you thrive in a collaborative team environment built on mutual respect and integrity?Are you an experienced professional with a strong drive to research and contribute to enhancing safety across the country? If so, we have an exciting opportunity for you to join our team at...


  • Delhi, Delhi, India SaveLIFE Foundation Full time

    Job Description :Are you passionate about making a positive impact on road safety? Do you thrive in a collaborative team environment built on mutual respect and integrity?Are you an experienced professional with a strong drive to research and contribute to enhancing safety across the country? If so, we have an exciting opportunity for you to join our team at...


  • Delhi, Delhi, India Qrata Full time

    Position SummaryWe are seeking a talented AI Research Scientist with expertise in deep learning to join our dynamic team. The ideal candidate will have a strong background in artificial intelligence, machine learning, and deep learning, with a focus on developing, deploying, and optimizing generative models. You will work closely with our cross-functional...

  • Data Scientist

    2 weeks ago


    Delhi, Delhi, India Pearlcon Technologies Full time

    Role: Data Scientist / Machine Learning EngineerSalary Range: 14 LPAExperience Required: 5+ YearsNotice Period: Immediate to 15 daysMode of Work-RemotePreferred Qualification sHands-on experience with ML Ops tools (MLflow, Kubeflow ).Familiarity with vector search (FAISS, Milvus ) for retrieval-based ML models.Experience with audio, video, or text...


  • Delhi, Delhi, India PharmSight Research and Analytics Full time

    About PharmSight Research and AnalyticsPharmSight Research and Analytics is a leading innovator in bio-pharma analytics, providing cutting-edge AI-powered solutions that transform product research, market intelligence, and healthcare decision-making. Our mission is to improve patient outcomes and drive advancements in the pharmaceutical industry through the...


  • Delhi, Delhi, India NBI Biosciences Full time

    We are seeking a Lead Scientist in Drug Development to join our team at NBI Biosciences. As a key member of our research and development team, you will play a critical role in advancing our pharmaceutical science programs.The ideal candidate will have a PhD degree in a relevant field, such as Biotechnology, Biochemistry, or Chemistry, and at least 8-12 years...


  • Delhi, Delhi, India The Blessed Ones Full time

    Company DescriptionThe Blessed Ones is a team of professionals dedicated to helping kids with special needs lead happier and more independent lives. Our mission is to create a future where these children feel safe, secure, and included. Located in Pune, we are a group of doctors, therapists, and educators who are also proud parents of children with special...