AI intern
2 weeks ago
Job Description: AI Agent & LLM Systems Intern (Fine-tuning, Evaluation, MoE, KV Caching)
Position: AI Agent Development Intern (LLM Systems)
Location: Hybrid (Office + Remote)
Duration: 6 months
Department: AI / Research Engineering
About the Role
We are seeking a highly motivated AI Agent & LLM Systems Intern with a strong interest in LLM internals, fine-tuning, model evaluation, and systems-level optimization. This work directly feeds into our upcoming platform in wealthtech space.
This role goes well beyond basic AI usage — you will work on understanding, modifying, and optimizing model behavior, including:
- Fine-tuning open-source LLMs
- Evaluating models on relevance, accuracy, ranking, and safety metrics
- Deep-dive systems topics: Mixture-of-Experts (MoE), KV caching, attention optimizations, and runtime efficiency
- Integrating optimized models into multi-agent workflows used in production
This internship is ideal for candidates who want hands-on R&D experience in LLM engineering, model mechanics, and AI agent infrastructure.
Key Responsibilities
1. LLM Fine-Tuning & Adaptation
- Assist in fine-tuning LLMs using LoRA, QLoRA, DPO, and supervised pipelines with strict GPU budgets
- Create domain-specific agents for finance, data extraction, and research workflows.
- Prepare training datasets, perform cleaning, filtering, and alignment steps.
2. Model Evaluation & Benchmarking
- Evaluate model performance using:
- Accuracy / Precision / Recall
- F1-Score
- ROUGE
- NDCG (ranking relevance)
- Domain-specific metrics
- Compare models (Q&A models, MoE architectures, small vs. large models).
- Design evaluation scripts that run on GPUs and track improvements.
3. Understanding LLM Internals (Under-the-hood Systems Work)
Hands-on exposure to topics such as:
- KV caching and inference-time acceleration
- Mixture-of-Experts (MoE) routing and efficiency
- Transformer architecture internals (attention, feed-forward, positional encodings)
- GPU memory optimization for fine-tuning
- Quantization (INT8, INT4, GPTQ, AWQ)
- Understanding tokenization, context window management, and attention scaling
4. Agent & Pipeline Integration
- Integrate tuned models into multi-agent systems using LangChain / custom orchestration.
- Create tools and APIs for agent workflows such as data extraction, reasoning, and ranking.
- Optimize inference pipelines for latency, batching, and caching.
5. Research Engineering Support
- Read and summarize research on MoE, KV caching, and training optimization.
- Experiment with model variants (SFT, DPO, RLHF-lite).
- Document experiments, lessons, and hyperparameter choices thoroughly.
Required SkillsTechnical Skills
- Strong Python skills (PyTorch experience preferred).
- Understanding of Transformers and attention.
- Basic knowledge of LLM fine-tuning (LoRA/QLoRA).
- Experience with datasets, data loaders, and training loops.
- Familiarity with GPUs, CUDA basics, or model quantization.
- Ability to analyze model outputs and compute evaluation metrics.
Preferred Skills (Bonus)
- Experience with MoE models or study of MoE architectures
- Understanding of KV cache mechanics or inference acceleration
- Experience with FAISS/Chroma vector stores
- Familiarity with multi-agent frameworks or LangChain
- Exposure to training dashboards (Weights & Biases, TensorBoard)
Soft Skills
- Strong analytical mindset and curiosity about how models work under the hood
- Ability to read research papers and convert ideas into code
- Clear documentation and structured thinking
- Comfortable working in fast, iterative development cycles
What You Will Learn
- Real-world LLM fine-tuning workflows on NVIDIA GPUs
- Implementing KV cache optimizations in agents
- Working with MoE architectures and understanding load balancing
- Benchmarking models at scale
- Deploying optimized LLMs inside production agent systems
- Building evaluation datasets across 100k+ samples
- Understanding how model internals map to performance and accuracy
Why Join Us?
- Direct mentorship in:
- LLM engineering
- World models
- Model optimization
- Work impacts real AI products in finance and knowledge automation
- Full-time Research Engineer or Agent Developer role
Job Type: Internship
Contract length: 6 months
Pay: From ₹5,000.00 per month
Application Question(s):
- Provide a summary of AI agentic framework and the topics worked, and topics which have self studied.
Experience:
- Agent development : 1 year (Required)
Work Location: Remote
-
AI Product Intern
2 weeks ago
Remote - India Oliv AI Full time ₹ 50,000 - ₹ 1,50,000 per yearAbout UsOliv.AI is a SalesTech global startup headquartered in San Francisco, debuting the world's first team of AI Agents for sales. With our recent $5.2M Seed funding, we solve one of the biggest problems for revenue teams: unreliable deal data. Oliv captures Deal Intelligence from every meeting, call, and email—without any rep involvement. The result...
-
Ai Experts
2 weeks ago
Remote, India Indika AI Full timeIndika AI, a global data service company, helping AI companies to build state-of-the-art AI Models with high-quality training data annotation services, is looking for experts in the field of artificial intelligence, machine learning, computer vision, natural language processing, and data science. Having a very strong work experience in building AI solutions...
-
Ai Experts
4 days ago
Remote, India Indika AI Full timeRemote **Job description**: Indika AI, a global data service company, helping AI companies to build state-of-the-art AI Models with high-quality training data annotation services, is looking for experts in the field of artificial intelligence, machine learning, computer vision, natural language processing, and data science. Having a very strong work...
-
AI Intern
1 week ago
Remote, India Seertech Systems Full timeAI Intern – Remote (3 Months)Location: RemoteDuration: 3 MonthsStipend: As per performanceStart Date: ImmediateAbout the RoleWe are looking for a motivated AI Intern to support our team in developing, training, and testing AI/ML models. This is a hands-on remote internship designed to give you real project experience in Artificial Intelligence.Key...
-
Ai Trainer Intern
2 weeks ago
Remote, India jobscout Full time**AI Trainer Intern (Mathematics Focus)** **About Us**: Jobscout collaborates with leading tech companies to enhance AI models by providing essential human feedback. We are seeking enthusiastic Mathematics students who are passionate about training AI models and shaping the future of artificial intelligence. **About the Opportunity**: We are looking for...
-
AI Intern
1 week ago
Remote, India TSAR IT PRIVATE LIMITED Full timeAbout the RoleWe are looking for passionate and enthusiastic AI Interns who are eager to learn, experiment, and build real-world AI-powered applications. You will work closely with our development team on AI/ML models, automation tools, and chatbot solutions.ResponsibilitiesAssist in developing and training Machine Learning and Deep Learning models.Work on...
-
Ai Agent Intern
4 days ago
Remote, India Tripify Full timeWe are seeking a detail-oriented and curious AI Agent Intern to support the development, testing, and refinement of our AI-based tools and automated agents. This is an exciting opportunity to work at the intersection of travel, customer experience, and artificial intelligence. **Key Responsibilities**: - Assist in training, testing, and optimizing AI...
-
AI/ML Intern
6 hours ago
Remote, India Katyayani Organics Full timeAbout the InternshipKatyayani Organics, a leader in sustainable agri-based and organic solutions, invites applications for a Generative AI (GenAI) Internship. This program offers hands-on exposure to advanced GenAI technologies with real-world applications in agri-tech and sustainability. We're looking for a GenAI intern with a special interest in using AI...
-
Django Developer Intern
2 days ago
Remote, India Genovance AI Consulting Full timeWe are seeking an enthusiastic full-time intern to assist in backend development, prompt engineering, and LLM fine-tuning. You'll work on improving chatbot intelligence using NLP techniques, contribute to building real-world AI solutions, and support frontend integration using where required.Responsibilities: ➢ Build software backend in Django➢Design &...
-
Machine Learning Intern
2 weeks ago
Remote, India Constems-AI Full timeWork with executives and business line stakeholders to define the problems to solve with AI. Help prioritizes and rank the solutions, develop business cases. Working and understanding AI modules and various implications in algorithms -Creating architecture for data analysis Writing clean code and documents properly Working on state-of-the-art deep...