Senior Deep Learning Engineer
1 week ago
Join Nanonets to push the boundaries of what's possible with deep learning. We're not just implementing models – we're setting new benchmarks in document AI, with our open-source models achieving nearly 1 million downloads on Hugging Face and recognition from global AI leaders.
Backed by $40M+ in total funding including our recent $29M Series B from Accel, alongside Elevation Capital and Y Combinator, we're scaling our deep learning capabilities to serve enterprise clients including Toyota, Boston Scientific, and You'll work on challenging problems at the intersection of computer vision, NLP, and generative AI.
What You'll BuildCore Technical Challenges:
- Train & Fine-tune SOTA Architectures: Adapt and optimize transformer-based models, vision-language models, and custom architectures for document understanding at scale
- Production ML Infrastructure: Design high-performance serving systems handling millions of requests daily using frameworks like TorchServe, Triton Inference Server, and vLLM
- Agentic AI Systems: Build reasoning-capable OCR that goes beyond extraction – models that understand context, chain operations, and provide confidence-grounded outputs
Optimization at Scale: Implement quantization, distillation, and hardware acceleration techniques to achieve fast inference while maintaining accuracy
- Multi-modal Innovation: Tackle alignment challenges between vision and language models, reduce hallucinations, and improve cross-modal understanding using techniques like RLHF and PEFT
- Design distributed training pipelines for models with billions of parameters using PyTorch FSDP/DeepSpeed
- Build comprehensive evaluation frameworks benchmarking against GPT-4V, Claude, and specialized document AI models
- Implement A/B testing infrastructure for gradual model rollouts in production
- Create reproducible training pipelines with experiment tracking
- Optimize inference costs through dynamic batching, model pruning, and selective computation
We're on a mission to hire the very best and are committed to creating exceptional employee experiences where everyone is respected and has access to equal opportunity.
Technical RequirementsMust-Have:
- 4+ years of hands-on deep learning experience with production deployments
- Strong PyTorch expertise – ability to implement custom architectures, loss functions, and training loops from scratch
- Experience with distributed training and large-scale model optimization
- Proven track record of taking models from research to production
- Solid understanding of transformer architectures, attention mechanisms, and modern training techniques
- B.E./B.Tech from top-tier engineering colleges
- Experience with model serving frameworks (TorchServe, Triton, Ray Serve, vLLM)
- Knowledge of efficient inference techniques (ONNX, TensorRT, quantization)
- Contributions to open-source ML projects
- Experience with vision-language models and document understanding
- Familiarity with LLM fine-tuning techniques (LoRA, QLoRA, PEFT)
- Proven Impact: Our models approaching 1 million downloads – your work will have global reach
- Real Scale: Your models will process millions of documents daily for Fortune 500 companies
- Well-Funded Innovation: $40M+ in funding means significant GPU resources and freedom to experiment
- Open Source Leadership: Publish your work and contribute to models already trusted by nearly a million developers
- Research-Driven Culture: Regular paper reading sessions, collaboration with research community
- Rapid Growth: Strong financial backing and Series B momentum mean ambitious projects and fast career progression
- Nanonets-OCR model: \~1 million downloads on Hugging Face – one of the most adopted document AI models globally
- Launched industry-first Automation Benchmark defining new standards for AI reliability
- Published research recognized by leading AI researchers
- Built agentic OCR systems that reason and adapt, not just extract
- Secured $40M+ in total funding from Accel, Elevation Capital, and Y Combinator
-
Deep Learning Engineer
2 days ago
Bengaluru, Karnataka, India Aavadhi Ai Full time ₹ 5,00,000 - ₹ 25,00,000 per yearRole & responsibilitiesThe primary responsibilities of the AI Developer would be:Identifying, evaluating and training the right Deep Learning models required for the Traffic Violations or Security Surveillance Use-CasesWork on Streaming and Ingestion Use-CasesShould be able to work on Data Engineering in order to perform Pre-processing & Data Handling...
-
Senior Deep Learning Engineer
5 days ago
Bengaluru, Karnataka, India Nanonets Full time ₹ 20,00,000 - ₹ 25,00,000 per yearLocation: Bangalore (Hybrid) | $40M+ Funded | Building State-of-the-Art AIJoin Nanonets to push the boundaries of what's possible with deep learning. We're not just implementing models – we're setting new benchmarks in document AI, with our open-source models achievingnearly 1 million downloads on Hugging Face and recognition from global AI leaders.Backed...
-
Junior Deep Learning Engineer
2 weeks ago
Bengaluru, Karnataka, India Nanonets Full time ₹ 15,00,000 - ₹ 25,00,000 per yearLocation - Bangalore (Hybrid)Nanonets has a vision to help computers see the world starting with reading and understanding documents.Machine Learning (ML) is no longer a futuristic concept—it's a present-day powerhouse transforming the business landscape. Nanonets is at the forefront of this transformation, offering innovative ML solutions designed to make...
-
Bengaluru, Karnataka, India Qualcomm Full time ₹ 12,00,000 - ₹ 36,00,000 per yearCompany:Qualcomm India Private LimitedJob Area:Engineering Group, Engineering Group > Systems EngineeringGeneral Summary:Design machine frameworks for Adreno GPU accelerator for computer vision and generative Ai needs.Good understanding of network architectures, how a deep learning framework optimizes it to efficiently run on target accelerator.OpenCL (or...
-
Junior Deep Learning Engineer
2 weeks ago
Bengaluru, Karnataka, India Nanonets Full time ₹ 8,00,000 - ₹ 24,00,000 per yearNanonets has a vision to help computers see the world starting with reading and understanding documents. Machine Learning (ML) is no longer a futuristic concept—it's a present-day powerhouse transforming the business landscape. Nanonets is at the forefront of this transformation, offering innovative ML solutions designed to make document related processes...
-
Bengaluru, Karnataka, India Ctruh Full time ₹ 12,00,000 - ₹ 36,00,000 per yearWe're seeking an experienced engineer with expertise in computer vision, deep learning, and AI to drive impactful solutions.Key ResponsibilitiesAlgorithm Development : Design and optimize computer vision and deep learning algorithms for 3D applications.Model Deployment : Setup end-end Deep Learning pipeline for data ingestion, preparation, model training,...
-
Bengaluru, Karnataka, India b116c656-2f20-4dcf-8012-b205e222dc6a Full time ₹ 12,00,000 - ₹ 36,00,000 per yearAbout The RoleThe MLE/SMLE is tasked with enhancing interactions across user gaming experiences and systems.This involves designing, coding, training, documenting, deploying, and evaluating large-scale machine learning systems in a cost-effective manner.We seek individuals capable of creating exceptional products and experiences for millions of users within...
-
Intern Deep Learning Computer Vision
7 days ago
Bengaluru, Karnataka, India de506a80-ae10-4ec2-8b8d-af460ab36056 Full time ₹ 60,000 per yearWe are seeking highly motivated, curious, and technically strong interns who are passionate about advancing the field of Artificial Intelligence. As a Deep Learning / Computer Vision Intern, you will work at the intersection of AI research and real-world product innovation, contributing to cutting-edge solutions in image and video intelligence.What You'll...
-
Bengaluru, Karnataka, India Nanonets Full time ₹ 12,00,000 - ₹ 36,00,000 per yearDescriptionJoin Nanonets to push the boundaries of what's possible with deep learning. We're not just implementing models - we're setting new benchmarks in document AI, with our open-source models achieving nearly 1 million downloads on Hugging Face and recognition from global AI leaders.Backed by $40M+ in total funding including our recent $29M Series B...
-
Senior Machine Learning Engineer
5 days ago
Bengaluru, Karnataka, India Tech Beam Designs Inc Full time ₹ 8,00,000 - ₹ 24,00,000 per yearSenior Machine Learning EngineerLocation:Bangalore (Onsite) (In-person)Experience:6+ yearsEmployment Type:Full-timeAbout the Role:We're seeking aSenior Machine Learning Engineerwho thrives on solving complex, high-impact problems. You'll lead the design, experimentation, and deployment of models that drive real-world impact — fromdemand forecastingandETA...