Senior Deep Learning Engineer
1 week ago
Join Nanonets to push the boundaries of what's possible with deep learning. We're not just implementing models – we're setting new benchmarks in document AI, with our open-source models achievingnearly 1 million downloads onHugging Faceand recognition from global AI leaders.Backed by$40M+ in total fundingincluding our recent $29M Series B from Accel, alongside Elevation Capital and Y Combinator, we're scaling our deep learning capabilities to serve enterprise clients including Toyota, Boston Scientific, and Bill.com. You'll work on genuinely challenging problems at the intersection of computer vision, NLP, and generative AI.Here's a quick 1-minuteintro video .What You'll BuildCore Technical Challenges:Train & Fine-tune SOTA Architectures : Adapt and optimize transformer-based models, vision-language models, and custom architectures for document understanding at scaleProduction ML Infrastructure : Design high-performance serving systems handling millions of requests daily using frameworks like TorchServe, Triton Inference Server, and vLLMAgentic AI Systems : Build reasoning-capable OCR that goes beyond extraction – models that understand context, chain operations, and provide confidence-grounded outputsOptimization at Scale : Implement quantization, distillation, and hardware acceleration techniques to achieve fast inference while maintaining accuracyMulti-modal Innovation : Tackle alignment challenges between vision and language models, reduce hallucinations, and improve cross-modal understanding using techniques like RLHF and PEFTEngineering Responsibilities:Design distributed training pipelines for models with billions of parameters using PyTorch FSDP/DeepSpeedBuild comprehensive evaluation frameworks benchmarking against GPT-4V, Claude, and specialized document AI modelsImplement A/B testing infrastructure for gradual model rollouts in productionCreate reproducible training pipelines with experiment trackingOptimize inference costs through dynamic batching, model pruning, and selective computationWe’re on a mission to hire the very best and are committed to creating exceptional employee experiences where everyone is respected and has access to equal opportunity.Technical RequirementsMust-Have:4+ years of hands-on deep learning experience with production deployments.Strong PyTorch expertise – ability to implement custom architectures, loss functions, and training loops from scratch.Experience with distributed training and large-scale model optimizationProven track record of taking models from research to productionSolid understanding of transformer architectures, attention mechanisms, and modern training techniques.B.E./B.Tech from top-tier engineering collegesHighly Valued:Experience with model serving frameworks (TorchServe, Triton, Ray Serve, vLLM)Knowledge of efficient inference techniques (ONNX, TensorRT, quantization)Contributions to open-source ML projectsExperience with vision-language models and document understandingFamiliarity with LLM fine-tuning techniques (LoRA, QLoRA, PEFT)Why This Role is ExceptionalProven Impact : Our models approaching1 million downloads– your work will have global reachReal Scale : Your models will process millions of documents daily for Fortune 500 companiesWell-Funded Innovation : $40M+ in funding means significant GPU resources and freedom to experimentOpen Source Leadership : Publish your work and contribute to models already trusted by nearly a million developersResearch-Driven Culture : Regular paper reading sessions, collaboration with research communityRapid Growth : Strong financial backing and Series B momentum mean ambitious projects and fast career progressionOur Recent AchievementsNanonets-OCR model: ~1 million downloads on Hugging Face– one of the most adopted document AI models globallyLaunched industry-first Automation Benchmark defining new standards for AI reliabilityPublished research recognized by leading AI researchersBuilt agentic OCR systems that reason and adapt, not just extract
-
Deep Learning Engineer
4 weeks ago
Delhi, India MAHTO Full timeCompany DescriptionMAHTO is a platform that connects contractors and homeowners with skilled blue-collar workers, primarily from the home building industry. MAHTO also offers Full Stack Home Building Services under the brand "mine", dedicated to delivering exceptional projects across residential, commercial, and industrial sectors. Committed to quality,...
-
Deep Learning Engineer
4 weeks ago
Delhi, India MAHTO Full timeCompany DescriptionMAHTO is a platform that connects contractors and homeowners with skilled blue-collar workers, primarily from the home building industry. MAHTO also offers Full Stack Home Building Services under the brand "mine", dedicated to delivering exceptional projects across residential, commercial, and industrial sectors. Committed to quality,...
-
Deep learning engineer
4 weeks ago
Delhi, India MAHTO Full timeCompany DescriptionMAHTO is a platform that connects contractors and homeowners with skilled blue-collar workers, primarily from the home building industry. MAHTO also offers Full Stack Home Building Services under the brand "mine", dedicated to delivering exceptional projects across residential, commercial, and industrial sectors. Committed to quality,...
-
Deep Learning Engineer
15 hours ago
New Delhi, India Ignitarium Full timeThis is a full-time, on-site role for a Deep Learning Engineer based in Chennai. The Deep Learning Engineer will focus on designing, developing, and optimizing deep learning models for diverse applications. Responsibilities include researching novel deep learning methodologies, implementing algorithms, fine-tuning models, and deploying solutions on embedded...
-
Deep Learning Engineer
4 weeks ago
New Delhi, India MAHTO Full timeCompany DescriptionMAHTO is a platform that connects contractors and homeowners with skilled blue-collar workers, primarily from the home building industry. MAHTO also offers Full Stack Home Building Services under the brand "mine", dedicated to delivering exceptional projects across residential, commercial, and industrial sectors. Committed to quality,...
-
Deep Learning Engineer
3 days ago
Delhi, India Tomorrow World Technology (TWT) Full timePosition: Deep Learning Engineer – Computer Vision & Autonomy Engagement Type: Remote Location: Remote Budget: 1.50 LPM + GST EXP 7-9+ YOE Position Overview: An experienced Deep Learning Engineer specializing in Computer Vision, Sensor Fusion, and Multimodal AI to advance R&D; in autonomous aerial systems and geospatial intelligence, working with...
-
Senior Deep Learning Engineer
1 week ago
Delhi, India Nanonets Full timeJoin Nanonets to push the boundaries of what's possible with deep learning. We're not just implementing models – we're setting new benchmarks in document AI, with our open-source models achieving nearly 1 million downloads on Hugging Face and recognition from global AI leaders.Backed by $40M+ in total funding including our recent $29M Series B from...
-
Delhi, India HCLTech Full timeHCLTech is looking for a highly talented and self- motivated Technical Architect Machine learning/Deep Learning to join in advancing the technological world through innovation and creativityRole: Technical architects (Machine Learning/Deep Learning)Location :Noida/BangaloreMode of Work :HybridExp:12- 15 yrsJob Description:Skills : Python, C++ , Pandas,...
-
Delhi, India HCLTech Full timeHCLTech is looking for a highly talented and self- motivated Technical Architect Machine learning/Deep Learning to join in advancing the technological world through innovation and creativityRole: Technical architects (Machine Learning/Deep Learning)Location :Noida/BangaloreMode of Work :HybridExp:12- 15 yrsJob Description:Skills : Python, C++ , Pandas,...
-
Deep Learning
2 weeks ago
Delhi, India Wynploy Full timeJob Title Deep Learning Computer Vision Engineer Company Confidential Location On-site Delhi Employment Type Full-Time Experience 3-5 Years Role Overview We are seeking a Deep Learning Computer Vision Engineer to design develop and deploy AI-powered solutions You will be responsible for building real-time computer vision pipelines implementing...