Machine Learning Engineer

2 weeks ago


Bengaluru, Karnataka, India Sarvam AI Full time ₹ 12,00,000 - ₹ 24,00,000 per year
Company Overview

is a pioneering generative AI startup headquartered in Bengaluru, India. We are dedicated to leading transformative research and development in the field of language technologies. With a focus on building scalable and efficient Large Language Models (LLMs) that support a wide range of languages, particularly Indic languages, aims to reimagine human-computer interaction and build novel AI-driven solutions. Join us as we push the boundaries of AI to create more inclusive and intelligent language processing tools for diverse communities worldwide.

Job Summary

We are looking for an experienced Machine Learning Engineer specializing in model inference and optimization to join our team. This role focuses on improving the efficiency and scalability of LLMs in production, including model deployment, quantization, and inference acceleration. The ideal candidate will have 2-3 years of experience working with ML frameworks such as PyTorch or TensorFlow, a deep understanding of neural network architectures, and a strong interest in LLM inference optimization.

Key Responsibilities
  • Research and implement model optimization techniques for LLMs, including quantization, pruning, distillation, and efficient fine-tuning.

  • Develop and optimize LLM inference pipelines to improve latency and efficiency across CPU/GPU/TPU environments.

  • Benchmark and profile models to identify performance bottlenecks and implement solutions for inference acceleration.

  • Deploy scalable and efficient LLM inference solutions on cloud and on-prem infrastructures.

  • Work with cross-functional teams to integrate optimized models into production systems.

  • Stay up-to-date with the latest advancements in LLM inference, distributed computing, and AI hardware accelerators.

  • Maintain and improve code quality, documentation, and experiment tracking for continuous development.

Must-Have Qualifications
  • Experience: 2-3 years in ML engineering, with a focus on model inference and optimization.

  • Education: Bachelor's or Master's degree in Computer Science, AI/ML, Data Science, or a related field.

  • ML Frameworks: Proficiency in PyTorch or TensorFlow for model training and deployment.

  • Model Optimization: Hands-on experience with quantization (INT8, FP16), pruning, and knowledge distillation.

  • Inference Acceleration: Experience with ONNX, TensorRT, DeepSpeed, or Hugging Face Optimum for optimizing inference workloads.

  • Cloud & Deployment: Experience deploying ML models on AWS, Azure, or GCP using cloud-native ML tools.

  • Profiling & Benchmarking: Familiarity with NVIDIA Nsight, PyTorch Profiler, or TensorBoard for analyzing model performance.

  • Problem-Solving: Strong analytical skills to troubleshoot ML model efficiency and deployment challenges.

Preferred Qualifications
  • Experience with distributed training and inference frameworks (e.g., vLLM, DeepSpeed, FSDP).

  • Understanding of GPU/TPU optimizations, CUDA programming, or low-level ML hardware acceleration.

  • Familiarity with edge and offline model deployment strategies.

Contributions to open-source projects related to ML inference, LLMs, or optimization.



  • Bengaluru, Karnataka, India TCP Corps Full time ₹ 12,00,000 - ₹ 24,00,000 per year

    Responsibilities: • Develop, deploy, and maintain machine learning models using AWS Sagemaker and MLFlow. • Implement end-to-end ML pipelines, from data ingestion to model deployment. • Optimize model performance and scalability. • Collaborate with data scientists to transition models from development to production. • Implement Data Science...


  • Bengaluru, Karnataka, India NatWest Group Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    Machine Learning Engineer Join us as a Machine Learning EngineerWe're looking for someone to deploy, automate, maintain and monitor machine learning models and algorithms to make sure they work effectively in a production environment Day-to-day, you'll collaborate with colleagues to design and develop state-of-the-art machine learning products which...


  • Bengaluru, Karnataka, India Apple Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    Imagine what you could do here. At Apple, new ideas have a way of becoming phenomenal products, services, and customer experiences very quickly. Every single day, people do amazing things at Apple. Do you want to impact the future of Manufacturing here at Apple through cutting edge ML techniques? This position involves a wide variety of skills, innovation,...


  • Bengaluru, Karnataka, India Catalyst IQ Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Description : Years of Experience : YearsJob Title : Machine Learning Engineer (Python Coding with ML Experience)Location : Bangalore (5 days WFO) NO WFH allowed at the moment.Job Summary : We are seeking a highly skilled and versatile Machine Learning Engineer who embodies the rare combination of a strong software engineer and ML exposure with experience...


  • Bengaluru, Karnataka, India Jumio Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Role Purpose: The Machine Learning Engineer III will play a critical role in advancing Jumio's Biometric Verification team's mission to develop and enhance state-of-the-art solutions for liveness detection. This role is essential for ensuring the highest standards of security and user verification through the application of advanced machine learning and...


  • Bengaluru, Karnataka, India Jumio Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Role Purpose:The Machine Learning Engineer III will play a critical role in advancing Jumio's Biometric Verification team's mission to develop and enhance state-of-the-art solutions for liveness detection. This role is essential for ensuring the highest standards of security and user verification through the application of advanced machine learning and deep...


  • Bengaluru, Karnataka, India Huntsmen and Barons Full time ₹ 27,00,000 - ₹ 34,00,000 per year

    BAND: B3Years of Experience: YearsJob Title: Machine Learning Engineer (Python Coding with ML Experience)Location: Bangalore (5 Days Working from KODATHI ODC)NO WFH allowed at the moment.Job Summary: We are seeking a highly skilled and versatile Machine Learning Engineer who embodies the rare combination of a strong software engineer and ML exposure with...


  • Bengaluru, Karnataka, India Catalysts HR Full time ₹ 25,00,000 - ₹ 35,00,000 per year

    Required Qualifications: Education: Master's degree in computer science, Machine Learning, Data Science,Electrical Engineering, or a related quantitative field. Experience: 5+ years of professional experience in Machine Learning Engineering,Software Engineering with a strong ML focus, or a similar role. Must have Programming Skills: Expert-level...


  • Bengaluru, Karnataka, India Angel and Genie Full time ₹ 5,00,000 - ₹ 15,00,000 per year

    As a Machine Learning Engineer (Azure Databricks), your role will involve: - Leading machine learning projects and taking ownership of the development and optimization of algorithms. - Preparing and transforming datasets for analysis and model training. - Evaluating model performance and ensuring successful deployment in production environments. -...


  • Bengaluru, Karnataka, India FxConsulting Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    We are hiring for the position ofSenior Machine Learning Engineerfor a leadingE-commerce organizationbased inBangalore.Experience:4–6 yearsIf you have strong expertise inMachine Learning, Time Series Forecasting, Deep Learning, andLLMs, we would love to connect with youWhat would you be doing/ Expected from this role?• Collaborate with cross-functional...