Principal Machine Learning Engineer

4 weeks ago

Bangalore Division, India Mulya Technologies Full time

Principal Machine Learning Engineer - Multimodal AI & Inference Bangalore Founded in 2023,by Industry veterans HQ in California,US We are revolutionizing sustainable AI compute through intuitive software with composable silicon Overview: You will design, optimize, and deploy large multimodal models (language, vision, audio, video) to run efficiently on a compact, high-performance AI appliance capable of supporting 100B+ parameter models at real-time speeds. Your mission is to deliver state-of-the-art multimodal inference locally through advanced model optimization, quantization, and system-level integration. Key Responsibilities: 1. Model Integration & Porting Optimize large-scale foundation models (e.g., Llama, gpt-oss, Whisper, HiDream, Qwen, Wan etc) for on-device inference. Adapt pre-trained models for multimodal tasks (text, image, audio, video, or cross-modal reasoning). Ensure seamless interoperability between modalities — e.g., enabling the system to “see, hear, and talk” naturally. 2. Model Optimization for Edge Hardware Quantize and compress large models (4-bit or mixed precision) while maintaining high accuracy and low latency. Implement and benchmark inference runtimes using frameworks like Llama.cpp, Ollama, vLLM, ONNX etc. Collaborate with hardware engineers to co-design model architectures optimized for the appliance’s compute fabric. 3. Inference Pipeline Development Build and maintain scalable, high-throughput inference pipelines capable of handling concurrent multimodal requests (text, audio, image, video). Implement token streaming, caching, and scheduling strategies for real-time responses. Develop APIs for low-latency local inference accessible via a web interface. 4. Evaluation & Benchmarking Profile and benchmark performance (throughput, latency, energy efficiency) of deployed models. Run regression tests to validate numerical accuracy after quantization or pruning. Define KPIs for multimodal model performance under real-world usage. 5. Research & Prototyping Investigate emerging multimodal architectures and lightweight model variants for local deployment. Prototype hybrid models that combine LLMs, diffusion models, and ASR/TTS pipelines for advanced multimodal applications. Stay current on state-of-the-art inference frameworks, compression techniques, and multimodal learning trends. Required Qualifications: Strong background in deep learning and model deployment, with hands-on experience in PyTorch and/or TensorFlow. Expertise in model optimization — quantization, pruning, distillation, or mixed-precision inference. Practical knowledge of inference engines (vLLM, llama.cpp, ONNX Runtime or similar). Experience deploying large models locally or on edge devices with limited memory/compute constraints. Familiarity with multimodal model architectures — e.g., CLIP, Flamingo, LLaVA, or AudioGPT-style systems. Strong software engineering skills (Python, C++, CUDA) and experience integrating models into production systems. Understanding of GPU/accelerator utilization, memory bandwidth optimization, and distributed inference. Preferred Qualifications: experience-10+ years Experience with model-parallel or tensor-parallel inference at scale. Contributions to open-source inference frameworks or model serving systems. Familiarity with hardware-aware training or co-optimization of neural networks and hardware. Background in speech, vision, or multimodal ML research. Track record of deploying models that run entirely offline or on embedded/edge systems. Contact: Uday Mulya Technologies muday_bhaskar@yahoo.com "Mining The Knowledge Community"

Machine Learning Engineer

7 days ago

bangalore, India Machine Learning Plus Full time

Key responsibilities:1. Design, develop, and implement machine learning/python models and algorithms for production systems.2. Build and maintain scalable ML training and inference pipelines.3. Preprocess and analyze large datasets, including feature engineering and data validation.4. Deploy machine learning models into production environments and ensure...
Machine Learning Engineer

6 days ago

bangalore, India Machine Learning Plus Full time

Key responsibilities: 1. Design, develop, and implement machine learning/python models and algorithms for production systems. 2. Build and maintain scalable ML training and inference pipelines. 3. Preprocess and analyze large datasets, including feature engineering and data validation. 4. Deploy machine learning models into production environments and ensure...
Machine Learning Engineer

2 weeks ago

bangalore, India Adastra Full time

Job Description: Machine Learning Ops Engineer Job Summary We are seeking a highly experienced Principal MLOps Engineer with 5–10 years of industry experience to lead the design, deployment, and optimization of machine learning infrastructure. This role requires deep expertise in Kubernetes (K8s), cloud-native technologies, and scalable ML systems. The...
Principal Machine Learning Engineer

5 days ago

bangalore, India Mulya Technologies Full time

Principal Machine Learning Engineer - Multimodal AI & InferenceBangaloreFounded in 2023,by Industry veterans HQ in California,USWe are revolutionizing sustainable AI compute through intuitive software with composable silicon Overview:You will design, optimize, and deploy large multimodal models (language, vision, audio, video) to run efficiently on a...
Machine Learning Associate

1 week ago

bangalore, India Machine Learning Plus Full time

Key responsibilities:1. Build, test, and optimize high-performance Python systems used in large-scale AI and data pipelines.2. Design and maintain modular, clean, and production-ready Python codebases.3. Collaborate with cross-functional teams (data, infra, and research engineers) to deliver reliable backend components.4. Write efficient, maintainable, and...
Machine Learning Associate

7 days ago

bangalore, India Machine Learning Plus Full time

Key responsibilities: 1. Build, test, and optimize high-performance Python systems used in large-scale AI and data pipelines. 2. Design and maintain modular, clean, and production-ready Python codebases. 3. Collaborate with cross-functional teams (data, infra, and research engineers) to deliver reliable backend components. 4. Write efficient, maintainable,...
Machine Learning Engineer

2 days ago

bangalore, India Spydra Full time

Job Summary: We are seeking a talented and motivated Machine Learning Engineer to join our team. The ideal candidate will have a strong background in machine learning algorithms, data analysis, and software development. You will be responsible for designing, developing, and deploying machine learning models and systems that drive our products and...
Machine Learning Engineer

3 days ago

bangalore, India Spydra Full time

Job Summary: We are seeking a talented and motivated Machine Learning Engineer to join our team. The ideal candidate will have a strong background in machine learning algorithms, data analysis, and software development. You will be responsible for designing, developing, and deploying machine learning models and systems that drive our products and services....
Senior Machine Learning Engineer

3 weeks ago

Bangalore Division, India NAZZTEC Full time

🚀 We’re Hiring: Senior ML Engineer (Gen AI, Machine Learning) 📌 Mandatory Skills: Graph ML, Agentic AI 🌍 Location: Remote 🧠 Domain: Generative AI | Machine Learning | Distributed Systems 🔹 Overview Are you energized by the idea of innovating with Generative AI ? Do you want to build next-gen AI-driven products with real-world global impact ?...
Machine Learning Engineer

7 days ago

bangalore, India Capgemini Full time

AIML Engineer Your Role Must have experience with Machine Learning Model Development Expert Level Proficiency in Data Handling (SQL) Hands-on with Model Engineering and Improvement Strong experience in Model Deployment and Productionlization Your Profile 5-14 years of experience in developing and implementing machine learning, Deep Learning, NLP models...

Americas

Europe

Asia / Oceania

Africa

Principal Machine Learning Engineer