Principal Machine Learning Engineer
4 days ago
Principal Machine Learning Engineer - Multimodal AI & InferenceBangaloreFounded in 2023,by Industry veterans HQ in California,US- We are revolutionizing sustainable AI compute through intuitive software with composable siliconOverview:You will design, optimize, and deploy large multimodal models (language, vision, audio, video) to run efficiently on a compact, high-performance AI appliance capable of supporting 100B+ parameter models at real-time speeds. Your mission is to deliver state-of-the-art multimodal inference locally through advanced model optimization, quantization, and system-level integration.Key Responsibilities:1. Model Integration & Porting- Optimize large-scale foundation models (e.g., Llama, gpt-oss, Whisper, HiDream, Qwen, Wan etc) for on-device inference.- Adapt pre-trained models for multimodal tasks (text, image, audio, video, or cross-modal reasoning).- Ensure seamless interoperability between modalities — e.g., enabling the system to "see, hear, and talk" naturally.2. Model Optimization for Edge Hardware- Quantize and compress large models (4-bit or mixed precision) while maintaining high accuracy and low latency.- Implement and benchmark inference runtimes using frameworks like Llama.cpp, Ollama, vLLM, ONNX etc.- Collaborate with hardware engineers to co-design model architectures optimized for the appliance's compute fabric.3. Inference Pipeline Development- Build and maintain scalable, high-throughput inference pipelines capable of handling concurrent multimodal requests (text, audio, image, video).- Implement token streaming, caching, and scheduling strategies for real-time responses.- Develop APIs for low-latency local inference accessible via a web interface.4. Evaluation & Benchmarking- Profile and benchmark performance (throughput, latency, energy efficiency) of deployed models.- Run regression tests to validate numerical accuracy after quantization or pruning.- Define KPIs for multimodal model performance under real-world usage.5. Research & Prototyping- Investigate emerging multimodal architectures and lightweight model variants for local deployment.- Prototype hybrid models that combine LLMs, diffusion models, and ASR/TTS pipelines for advanced multimodal applications.- Stay current on state-of-the-art inference frameworks, compression techniques, and multimodal learning trends.Required Qualifications:- Strong background in deep learning and model deployment, with hands-on experience in PyTorch and/or TensorFlow.- Expertise in model optimization — quantization, pruning, distillation, or mixed-precision inference.- Practical knowledge of inference engines (vLLM, llama.cpp, ONNX Runtime or similar).- Experience deploying large models locally or on edge devices with limited memory/compute constraints.- Familiarity with multimodal model architectures — e.g., CLIP, Flamingo, LLaVA, or AudioGPT-style systems.- Strong software engineering skills (Python, C++, CUDA) and experience integrating models into production systems.- Understanding of GPU/accelerator utilization, memory bandwidth optimization, and distributed inference.Preferred Qualifications:experience-10+ years- Experience with model-parallel or tensor-parallel inference at scale.- Contributions to open-source inference frameworks or model serving systems.- Familiarity with hardware-aware training or co-optimization of neural networks and hardware.- Background in speech, vision, or multimodal ML research.- Track record of deploying models that run entirely offline or on embedded/edge systems.Contact:UdayMulya Technologiesmuday_bhaskar@yahoo.com"Mining The Knowledge Community"
-
Principal Scientist, Machine Learning
2 weeks ago
Bangalore, India Nykaa Full timePrincipal Machine Learning Scientist - Search Location: About the Team: Join Nykaa's Data Science team as a Principal Machine Learning Scientist, where you'll play a pivotal role in driving advancements in search relevance and ranking across our platforms. In this role, you will analyze data, develop machine learning models, and enhance search algorithms,...
-
Machine Learning Engineer
1 week ago
Bangalore, India Capgemini Full timeAIML Engineer Your Role Must have experience with Machine Learning Model Development Expert Level Proficiency in Data Handling (SQL) Hands-on with Model Engineering and Improvement Strong experience in Model Deployment and Productionlization Your Profile 5-14 years of experience in developing and implementing machine learning, Deep Learning, NLP models...
-
Machine Learning Engineer
1 day ago
bangalore, India Capgemini Full timeAIML EngineerYour RoleMust have experience with Machine Learning Model DevelopmentExpert Level Proficiency in Data Handling (SQL)Hands-on with Model Engineering and ImprovementStrong experience in Model Deployment and ProductionlizationYour Profile5-14 years of experience in developing and implementing machine learning, Deep Learning, NLP models across...
-
Machine Learning Engineer
1 week ago
bangalore, India S3B Global Full timeJob Requirement – Machine Learning EngineerJob Title: Machine Learning Engineer Location: Bangalore, India (Hybrid Onsite – Local candidates only) Interview Mode: Video Duration: 12+ Months (Contract to Hire)Role OverviewWe are seeking a highly skilled Machine Learning Engineer with hands-on experience designing, building, and optimizing ML models in...
-
Machine Learning Engineer
1 week ago
Bangalore, India S3B Global Full timeJob Requirement – Machine Learning Engineer Job Title: Machine Learning Engineer Location: Bangalore, India (Hybrid Onsite – Local candidates only ) Interview Mode: Video Duration: 12+ Months (Contract to Hire) Role Overview We are seeking a highly skilled Machine Learning Engineer with hands-on experience designing, building, and optimizing ML models in...
-
Machine Learning Engineer
5 days ago
bangalore, India beBeeMachineLearning Full timeMachine Learning Engineer PositionWe are seeking a skilled Machine Learning Engineer to join our dynamic team. In this role, you will work on cutting-edge machine learning projects, leveraging large datasets to develop innovative solutions that drive business insights and improve decision-making processes.Key Responsibilities:Develop and implement machine...
-
Machine Learning Engineer
5 days ago
bangalore, India Jumio Full timeRole Purpose: The Machine Learning Engineer III will play a critical role in advancing Jumio's Biometric Verification team's mission to develop and enhance state-of-the-art solutions for liveness detection. This role is essential for ensuring the highest standards of security and user verification through the application of advanced machine learning and...
-
Machine Learning Engineer
4 weeks ago
bangalore, India V3 Staffing Full timeWe are hiring passionate Machine Learning Engineers and ML Leads for one of our product-based clients , driving innovation in AI, data engineering, and MLOps. If you have hands-on experience in ML model development, cloud platforms, and Generative AI tools , this opportunity is for you. Responsibilities: Build and maintain scalable data & ML pipelines using...
-
Machine Learning Engineer
2 weeks ago
bangalore, India Aqilea (formerly Soltia) Full timeWe are a consulting company with a bunch of technology-interested and happy people We love technology, we love design and we love quality. Our diversity makes us unique and creates an inclusive and welcoming workplace where each individual is highly valued.With us, each individual is her/himself and respects others for who they are and we believe that when a...
-
Senior Machine Learning Engineer-AI, ML
6 days ago
bangalore, India Dell Full timeSoftware Senior Principal EngineerThe Software Engineering team delivers next-generation application enhancements and new products for a changing world. Working at the cutting edge, we design and develop software for platforms, peripherals, applications and diagnostics — all with the most advanced technologies, tools, software engineering methodologies and...