
ML Inference Platform Intern
3 days ago
AION is building the next generation of AI cloud platform by transforming the future of high-performance computing (HPC) through its decentralized AI cloud. Purpose-built for bare-metal performance, AION democratizes access to compute power for AI training, fine-tuning, inference, data labeling, and full stack AI/ML lifecycle.
Led by high-pedigree founders with previous exits, AION is well-funded by major VCs with strategic global partnerships. Headquartered in the US with global presence, the company is building its initial core team across India, London and SF.
Who You AreYou're an ML systems engineer who's passionate about building high-performance inference infrastructure. You don't need to be an expert in everything - this field is evolving too rapidly for that - but you have strong fundamentals and the curiosity to dive deep into optimization challenges. You thrive in early-stage environments where you'll learn cutting-edge techniques while building production systems. You think systematically about performance bottlenecks and are excited to push the boundaries of what's possible in AI infrastructure.
RequirementsKey Responsibilities
- Learn and implement ML inference optimization techniques including KV-cache management, dynamic batching, and quantization under mentorship.
- Contribute to GPU optimization projects using CUDA with hands-on learning of Triton kernel development and performance tuning.
- Build model benchmarking and evaluation frameworks to assess performance across different models and optimization strategies.
- Research and experiment with trending open-source models (DeepSeek R1, Qwen 3, Llama variants) to understand optimization opportunities.
- Implement cost-performance analysis tools to understand tradeoffs between speed, quality, and resource usage.
- Explore agent system implementations and multi-step reasoning workflows for future platform capabilities.
- Document learning and create technical guides for internal team knowledge sharing and customer education.
- High agency individual with strong willingness to experiment and learn with the team.
- Previous internships or projects in ML infrastructure, contributions using PyTorch/ML frameworks, competitive programming achievements, research experience in ML systems, familiarity with agent systems or reasoning techniques.
- Strong coding and implementation skills in Python and C++ with demonstrated ability to write performant, production-quality code.
- Experience reading and contributing to large codebases with proof of open-source contributions (GitHub profile required).
- Proof of technical work through projects like Google Summer of Code, hackathon wins, competitive programming, or significant open-source contributions.
- Working knowledge of deep learning fundamentals including neural networks, transformers, and basic training/inference concepts.
- Basic understanding of PyTorch including model development and tensor operations.
- Fundamental knowledge of GPU computing or strong willingness to learn CUDA programming.
- Working knowledge of at least one inference framework (vLLM, TensorRT-LLM, Hugging Face) through coursework or personal projects.
- Understanding of distributed systems concepts and performance optimization principles.
Join the ground floor of a mission-driven AI startup revolutionizing compute infrastructure.
Learn from world-class engineers and gain hands-on experience with cutting-edge inference optimization techniques.
- Work with a high-caliber, globally distributed team backed by major VCs.
- Significant learning and growth opportunity in one of the fastest-moving areas of AI infrastructure.
- Competitive internship compensation with potential for full-time conversion.
- Fast-paced, flexible work environment with room for ownership and impact.
-
ML Inference Platform Intern
2 days ago
Bengaluru, Karnataka, India aion Full time ₹ 6,00,000 - ₹ 18,00,000 per yearAbout AIONAION is building the next generation of AI cloud platform by transforming the future of high-performance computing (HPC) through its decentralized AI cloud. Purpose-built for bare-metal performance, AION democratizes access to compute power for AI training, fine-tuning, inference, data labeling, and full stack AI/ML lifecycle.Led by high-pedigree...
-
ML Training Platform Intern
2 days ago
Bengaluru, Karnataka, India AION Full time ₹ 9,00,000 - ₹ 12,00,000 per yearAION is building the next generation of AI cloud platform by transforming the future of high-performance computing (HPC) through its decentralized AI cloud. Purpose-built for bare-metal performance, AION democratizes access to compute power for AI training, fine-tuning, inference, data labeling, and beyond.By leveraging underutilized resources such as idle...
-
ML Training Platform Intern
2 days ago
Bengaluru, Karnataka, India aion Full time ₹ 9,00,000 - ₹ 12,00,000 per yearAION is building the next generation of AI cloud platform by transforming the future of high-performance computing (HPC) through its decentralized AI cloud. Purpose-built for bare-metal performance, AION democratizes access to compute power for AI training, fine-tuning, inference, data labeling, and beyond.By leveraging underutilized resources such as idle...
-
AI/ML Engineer
4 weeks ago
Bengaluru, Karnataka, India Avia Technologies Full timeJob DescriptionPrimary Title: Machine Learning EngineerIndustry & Sector: A high-growth company in the Artificial Intelligence / Machine Learning software sector building production-grade AI systems that deliver intelligent automation, predictive analytics, and real-time decisioning for enterprise customers. We develop end-to-end ML servicesfrom data...
-
Ml Engineer
2 days ago
Bengaluru, Karnataka, India Wipro Full time US$ 1,50,000 - US$ 2,00,000 per yearRole & responsibilitiesML EngineerRole Summary Implements and operationalizes machine learning models, managing the end-to-end ML lifecycle including training, deployment, and monitoring.Detailed ResponsibilitiesPackage and deploy ML models using Azure ML pipelines.Automate retraining and versioning workflows.Build and maintain model scoring APIs and batch...
-
AI/ML Engineer
3 days ago
Bengaluru, Karnataka, India Avia Technologies Full time ₹ 20,00,000 - ₹ 25,00,000 per yearPrimary Title: Machine Learning EngineerIndustry & Sector:A high-growth company in the Artificial Intelligence / Machine Learning software sector building production-grade AI systems that deliver intelligent automation, predictive analytics, and real-time decisioning for enterprise customers. We develop end-to-end ML services—from data ingestion and model...
-
Software Engineer – AI Platform
3 weeks ago
Bengaluru, Karnataka, India Jumbo Consulting Full timeSoftware Engineer – AI Platform (Full Stack or Backend) Location: 100% Remote Salary: upto ₹18 LPA Company: Healthtech AI Startup Key Responsibilities: • Develop and maintain Python-based microservices and APIs (FastAPI preferred) to expose AI/ML functionality. • Build data pipelines and orchestration logic connecting model outputs, inference runs,...
-
Dell - AI/ML Firmware Engineer
4 weeks ago
Bengaluru, Karnataka, India Hirist Full timeNote : If shortlisted, you will be invited for initial rounds on 13th September'25 (Saturday) in will : - Lead the design, development, training, and deployment of AI/ML models including : 1. Traditional ML models (e.g., classification, regression, clustering) 2. Generative AI models (e.g., GPT, LLaMA, Mistral, Claude) 3. On-device and edge inference models...
-
Senior ML Engineer
2 days ago
Bengaluru, Karnataka, India Sarvam Full time ₹ 20,00,000 - ₹ 25,00,000 per yearSarvam AI is building the next-generation media AI engine to automate movie and OTT content creation & localization across India's diverse languages. We're seeking a Senior Machine Learning Engineer to oversee the orchestration of our end-to-end AI in Media pipeline, which combines cutting-edge models, agile workflows, and media-grade quality...
-
Hiring for Senior ML Engineer
2 weeks ago
Bengaluru, Karnataka, India Grapevine Round1 AI Full time ₹ 20,00,000 - ₹ 25,00,000 per yearAbout the RoleSeeking a Senior Machine Learning Engineer rigorn the orchestration and evolution of end-to-end Media AI pipeline. This role demands deep expertise in speech/audio AI, scalable ML systems, and agent-based automation. You will build and lead production-grade pipelines that fuse reliability, rapid experimentation, and frontier AI...