Senior Audio ML Engineer

13 hours ago


Chennai, Tamil Nadu, India a1dae599-2f6a-4efa-8047-b08ceea3ac8f Full time ₹ 15,00,000 - ₹ 25,00,000 per year

ROLE OVERVIEW

We are seeking a highly skilled Senior Audio ML Engineer who can develop, optimize, and deploy advanced speech processing models across distributed GPU clusters. You will drive audio ML initiatives, architect scalable training pipelines for speech generation and recognition systems, and ensure production-ready deployment of state-of-the-art audio models including TTS, ASR, voice cloning, and speech translation systems.

KEY ROLES

  • Design, develop, and enhance speech processing models including TTS, ASR, Speaker Diarization, Source Separation, and Speech-to-Speech Translation systems for production use cases.
  • Architect and optimize distributed training pipelines across GPU clusters for large-scale speech model training using advanced parallelization strategies.
  • Fine-tune and customize speech foundation models using proprietary audio datasets, advanced training techniques, and comprehensive evaluation frameworks.
  • Develop state-of-the-art voice cloning systems with zero-shot capabilities, emotion control, accent flexibility, pitch variation, and cross-lingual expressivity.
  • Design high-performance inference pipelines for speech models using TensorRT, ONNX, quantization, streaming, and GPU optimization techniques.
  • Ensure all speech models are production-grade—robust, scalable, monitored, and integrated into real-time audio processing systems.
  • Research and evaluate cutting-edge architectures in speech synthesis, recognition, and multimodal audio-visual systems.
  • Collaborate with the audio ML team to drive technical excellence and knowledge sharing across speech processing initiatives.

RESPONSIBLITES

  • Architect end-to-end speech processing systems including distributed training, model serving, real-time inference, and continuous model improvements.
  • Work with infrastructure teams to optimize GPU cluster utilization, implement efficient data loading pipelines, and manage large-scale audio dataset processing.
  • Build comprehensive model evaluation frameworks—WER, MOS scores, speaker similarity metrics, latency benchmarks, and audio quality assessments.
  • Drive experimentation with novel architectures including neural vocoders, diffusion-based TTS, transformer variants, and multimodal speech systems.
  • Collaborate cross-functionally with product, backend, audio engineering, and DevOps teams to deliver end-to-end speech AI features.
  • Implement robust training monitoring, experiment tracking, and model versioning systems for reproducible speech model development.
  • Handle domain-shifted conditions, multilingual datasets, and challenging acoustic environments in production deployments.
  • Contribute to team knowledge sharing through technical documentation, code reviews, and best practices in distributed speech model training.

REQUIRED QUALIFICATIONS

  • 3-5+ years of experience in audio/speech machine learning, deep learning for speech processing, or audio signal processing systems.
  • Proven expertise with state-of-the-art speech frameworks including ASR models (Whisper, Conformer), TTS systems (VITS, FastSpeech, Tacotron), and voice cloning architectures.
  • Hands-on experience with distributed training across GPU clusters using PyTorch DDP, DeepSpeed, FairScale, or similar frameworks.
  • Strong knowledge of audio processing libraries (librosa, torchaudio, SpeechBrain) and speech-specific data pipelines.
  • Expert-level experience with model optimization for speech (TensorRT, ONNX Runtime, quantization) and real-time audio inference systems.
  • Solid understanding of GPU cluster management, CUDA optimization, mixed precision training, and large-scale audio data handling.
  • Experience with Speaker Diarization, Source Separation, Noise Cancellation, and Speech-to-Speech Translation systems.
  • Strong technical communication skills and ability to work collaboratively in cross-functional teams.
  • Master's or PhD in Electrical Engineering, Computer Science, or related field with specialization in Speech Processing or Audio ML.

NOTE - We accept international applicants also.



  • Chennai, Tamil Nadu, India Logitech Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Logitech is the Sweet Spot for people who want their actions to have a positive global impact while having the flexibility to do it in their own way.Role Overview:You will join Logitech's Hardware Audio DSP and ML Product team to develop real-time Audio ML solutions that redefine customer audio experiences. The role requires a strong foundation in Audio...

  • Senior ML Engineer

    2 weeks ago


    Chennai, Tamil Nadu, India Ford Motor Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    DescriptionData Platform Engineering  Team is responsible for Building Data platfrom for enterprise  The ML enablement team has a combination of data engineers, machine learning engineers (MLEs). MLEs would be supposed to implement sophisticated machine learning models that can transform DPE. Responsibilities• Build Deep learning models to understand...

  • Senior AI Engineer

    2 days ago


    Chennai, Tamil Nadu, India Sampoorna Consultants Pvt. Ltd Full time ₹ 8,00,000 - ₹ 24,00,000 per year

    Senior AI Engineer - J49722Bachelors or Masters degree in Computer Science, Engineering, or related field.5-8 years of experience in AI/ML engineering, with at least 2 years focused on GenAI and LLMs.Proven experience deploying agentic AI systems in production environments.Strong understanding of NLP, deep learning, and multi-modal AI (text, image,...

  • Audio DSP engineer

    2 weeks ago


    Chennai, Tamil Nadu, India Logitech Full time ₹ 4,00,000 - ₹ 12,00,000 per year

    Logitech is the Sweet Spot for people who want their actions to have a positive global impact while having the flexibility to do it in their own way.Job DescriptionThe Role:In this role you will be part of the Logitech Hardware Audio DSP and ML team supporting with DSP firmware development and working in the Audio laboratory collecting data and running...

  • Audio DSP engineer

    2 weeks ago


    Chennai, Tamil Nadu, India Logitech Full time ₹ 5,00,000 - ₹ 15,00,000 per year

    Logitech is the Sweet Spot for people who want their actions to have a positive global impact while having the flexibility to do it in their own way.Job DescriptionThe Role:In this role you will be part of the Logitech Hardware Audio DSP and ML team supporting with DSP firmware development and working in the Audio laboratory collecting data and running...

  • AI/ML - Lead

    7 days ago


    Chennai, Tamil Nadu, India ACL Digital Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    Responsibilities100% hands-on Experience is a must in development and deployment of production-level AI models at enterprise scale. (Build Vs Buy Decision Maker)Drive innovation in AI/ML applications across various business domains and modalities (vision, language, audio).Knowledge of Best practices in AI/ML, MLOps, DevOps, and CI/CD for AI/ML...


  • Chennai, Tamil Nadu, India Ducima Analytics Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    Hi allImmediate Hiring Role - AI/ML Engineer (Expert in Gen AI)Experience - 1 to 2 years and 4 to 7yearsMandatory - Excellent English Communication SkillsWork Location - Chennai- Work from Office (Candidates from Chennai are Preferred)Connect me at -Share your CV at Call to check your...


  • Chennai, Tamil Nadu, India Logitech Full time ₹ 6,00,000 - ₹ 18,00,000 per year

    Logitech is the Sweet Spot for people who want their actions to have a positive global impact while having the flexibility to do it in their own way.About Us:At Logitech, we are committed to delivering exceptional audio experiences through innovative products. We are seeking a dedicated and skilled Audio Quality Assurance Engineer to join our development...

  • Android Audio DSP SE

    20 hours ago


    Chennai, Tamil Nadu, India Aptiv Full time ₹ 10,00,000 - ₹ 25,00,000 per year

    Help shape the future of mobility.Imagine a world with zero vehicle accidents, zero vehicle emissions, and wireless vehicle connectivity all around us. Every day, we move closer to making that world a reality. Aptiv's passionate team of engineers and developers creates advanced safety systems, high-performance electrification solutions and data connectivity...


  • Chennai, Tamil Nadu, India Logitech Full time ₹ 6,00,000 - ₹ 18,00,000 per year

    Logitech is the Sweet Spot for people who want their actions to have a positive global impact while having the flexibility to do it in their own way.About Us: At Logitech, we are committed to delivering exceptional audio experiences through innovative products. We are seeking a dedicated and skilled Audio Quality Assurance Engineer to join our development...