Computer Vision

2 days ago


New Delhi, India doAZ Full time

Computer Vision & Multimodal LLMIntern (Drawing Change Analysis Agent)About DoazDoaz turns fragmented industrial knowledge into instant, actionable insight. We build LLM- and Vision-AI solutions for construction, heavy industry, and finance—helping teams convert drawings, specs, and regulations into real-time decisions. We’re expanding our GeoAI programs (incl. joint work with POSCO E&C) and launching drawing-change detection services that compare plan versions, detect deltas, and explain design impacts. Why You’ll Love Working HereShip real things: Your models and tools can reach production pilots in weeks. Mentorship, not bureaucracy: Learn directly from senior CV/LLM engineers and domain SMEs. Global crew: 30 teammates across KR/ PK/ IN ; English-first collaboration. Tech playground: YOLO/RT-DETR, Gemma-VL/Qwen-VL/LLaVA, PaddleOCR, LayoutLMv3, Triton—hands-on. Role OverviewAs a CV & Multimodal LLMIntern, you’ll support the end-to-end development of a version-aware drawing-diff engine (PDF/DWG raster & vector), symbol/text extraction, and change-impact narratives powered by RAG/LLM. You’ll prototype, evaluate, and iterate with fast feedback from real engineering users. What You’ll Do (Intern Scope)Drawing Change Analysis (CV): assist in rasterization, layer parsing, vector geometry ops; train/evaluate detectors (YOLOv8/RT-DETR/SAM); implement geometry-aware post-processing (IoU/topology/snapping). Document & Layout Understanding: combine OCR (PaddleOCR/Tesseract) with layout models (DocFormer/LayoutLMv3/Donut); normalize to structured JSON; help with version-aware entity tracking (gridlines, BH IDs, coordinates). GeoAI & LLM/RAG: set up retrieval (BM25 + vector with reranking); ground LLM answers with citations and clickable evidence; draft change-impact summaries with rule prompts + LLM verification. Productization Basics: package prototypes as FastAPI services or notebooks; write READMEs; contribute datasets, labeling guides, and simple A/B or ablation tests. Minimum QualificationsBS/MS student or recent graduate in CS/EE/CE/Geoinformatics/Civil (or similar). Solid Python (3.x); foundations in DS/algorithms, linear algebra, probability. Coursework/projects in CV and/or document AI (detection, segmentation, OCR, layout). Familiar with PyTorch or TensorFlow; Git, Linux, Jupyter. Clear written English; high learning velocity and ownership. Nice to HaveHands-on with YOLO/RT-DETR/Detectron2/SAM; PaddleOCR/Tesseract; LayoutLMv3/Donut. Exposure to VLMs (Gemma-VL, Qwen-VL, LLaVA), CLIP, rerankers. Experience with engineering drawings/CAD/PDF toolchains. Basic FastAPI, Docker, ONNX/TensorRT/Triton. Frontend (TypeScript/React) for quick review UIs. Internship Details & BenefitsType/Duration:Paid internship — 4 months (full-time preferred). Compensation (India):Stipend prorated from 6 LPA(INR600,000 annualized), paid monthly ( ≈ INR 50,000/monthduring the internship). For candidates outside India, compensation will bebenchmarked to local market equivalents. Conversion:High performers will receive a full-time offerupon successful completion of the 4-month internship. Perks: Mentorship, cloud/GPU credits, real production impact. Hiring Process (fast)Intro call (15–20 min). 48-hour mini task: simple drawing diff or OCR/layout extraction + short README (clarity > polish). Tech chat (45–60 min): approach, trade-offs, evaluation. Founder chat on culture & goals. Offer. How to ApplyEmaildoaz@doaz.ai with subject[CV/LLM Intern – Your Name]and include: Résumé/CV (highlight courses/projects; metrics if available). GitHub or demo links (CV/doc-AI/RAG preferred). Availability (start date, weekly hours). (Optional) A one-page diagram of your “Drawing Revision → Detection → Evidence → LLM Narrative” pipeline. Ready to learn fast and turn messy drawings into trusted intelligence? Join Doaz and build with us.



  • New Delhi, India Jyodha innovations private limited Full time

    This is a contract, remote role for a Computer Vision Engineer. The Computer Vision Engineer will be responsible for developing and implementing computer vision algorithms and working with pattern recognition. The responsibilities include implementing production-grade algorithms for facial feature detection under varied lighting, semantic segmentation and...


  • New Delhi, India Green HR Solutions Full time

    Hiring for a USA based multinational Software CompanyWe are seeking a talented Computer Vision Engineer to join our team and develop innovative solutions using cutting-edge AI and image processing technologies. The ideal candidate will have strong experience in computer vision, machine learning, and deep learning frameworks to build real-world applications...


  • New Delhi, India Omnipresent Robot Tech Full time

    Position Title: Computer Vision Engineer – Drone-Based SolutionsAbout Us:Omnipresent Robot Tech Pvt. Ltd. is an innovative startup pushing the boundaries of robotics, drones, and space tech. We recently contributed to ISRO’s Chandrayaan-3 missionby developing the perception and navigation module for the Pragyaan rover. Currently, we are developing...


  • New Delhi, India Omnipresent Robot Tech Full time

    Position Title: Computer Vision Engineer – Drone-Based SolutionsAbout Us: Omnipresent Robot Tech Pvt. Ltd. is an innovative startup pushing the boundaries of robotics, drones, and space tech. We recently contributed to ISRO’s Chandrayaan-3 missionby developing the perception and navigation module for the Pragyaan rover. Currently, we are developing...

  • Computer Vision

    3 weeks ago


    New Delhi, India TekPillar® Full time

    Job Title:Computer Vision / Machine Vision Engineer Experience:5 to 8 Years Location:Manesar, GurgaonNotice Period: Immediate to 30 DaysKey Responsibilities Develop machine vision systems forinspection, measurement, and quality control . Configure vision software and hardware ( cameras, lighting, lenses ) to meet project requirements. Integrate vision...


  • New Delhi, India Eternal Robotics Full time

    About Eternal Robotics – HyperviseHypervise is the flagship industrial AI platform of Eternal Robotics, built to transform real-time quality inspection and automation through advanced computer vision and edge AI. Our clients span automotive, apparel, pharmaceuticals, and more — industries where speed, accuracy, and reliability are mission-critical.Role...


  • New Delhi, India ACL Digital Full time

    Proficiency in basis of CVML (Computer Vision Machine Learning), Python and GUI Building, Exposure to LLM, Mechanical Engineering basics.


  • New Delhi, India Eternal Robotics Full time

    About Eternal Robotics – HyperviseHypervise is the flagship industrial AI platform of Eternal Robotics, built to transform real-time quality inspection and automation through advanced computer vision and edge AI. Our clients span automotive, apparel, pharmaceuticals, and more — industries where speed, accuracy, and reliability are mission-critical.Role...


  • New Delhi, India Whatjobs IN C2 Full time

    About Eternal Robotics – Hypervise Hypervise is the flagship industrial AI platform of Eternal Robotics, built to transform real-time quality inspection and automation through advanced computer vision and edge AI. Our clients span automotive, apparel, pharmaceuticals, and more — industries where speed, accuracy, and reliability are mission-critical. Role...


  • New Delhi, India IIT Mandi iHub and HCi Foundation Full time

    IIT Mandi iHub and HCi Foundation (iHub) is a section 8 company established under the National Mission on Interdisciplinary Cyber- Physical Systems (NM-ICPS). The focus area of IIT Mandi iHub is “Human-Computer Interaction.” Role We are looking for candidates to contribute to two of our flagship projects in deep-tech areas of computer vision, AI/ML, and...