Vision-Language Models and Generative AI

3 days ago


bangalore district, India Bosch Global Software Technologies Full time

Job Description Roles & Responsibilities: Conduct deep research in: Vision-Language and Multimodal AI for perception and semantic grounding Cross-modal representation learning for real-world sensor fusion (camera, lidar, radar, text) Multimodal generative models for scene prediction, intent inference, or simulation Efficient model architectures for edge deployment in automotive and factory systems Evaluation methods for explain ability, alignment, and safety of VLMs in mission-critical applications Spin newer research directions and drive AI research programs for autonomous driving, ADAS, and Industry 4.0 applications. Create new collaborations within and outside of Bosch in relevant domains. Contribute to Bosch’s internal knowledge base, open research assets, and patent portfolio. Lead internal research clusters or thematic initiatives across autonomous systems or industrial AI. Mentor and guide research associates, interns, and young scientists. Qualifications Educational qualification: Ph.D. in Computer Science / Machine Learning / AI / Computer Vision or equivalent Experience: 8+ years (post PhD) in AI related to Vision and Language modalities, excellent exposure and hands on research in GenAI, VLMs, Multimodal AI, or Applied AI Research. Mandatory/requires Skills: Deep expertise in: Vision-Language Models (CLIP, Flamingo, Kosmos, BLIP, GIT) and multimodal transformers Open- and closed-source LLMs (e.g., LLaMA, GPT, Claude, Gemini) with visual grounding extensions Contrastive learning, cross-modal fusion, and structured generative outputs (e.g., scene graphs) PyTorch, HuggingFace, OpenCLIP, and deep learning stack for computer vision Evaluation on ADAS/mobility benchmarks (e.g., nuScenes, BDD100k) and industrial datasets Strong track record of publications in relevant AI/ML/vision venues Demonstrated capability to lead independent research programs Familiarity with multi-agent architectures, RLHF, and goal-conditioned VLMs for autonomous agents Preferred Skills: Hands-on experience with: Perception stacks for ADAS, SLAM, or autonomous robots Vision pipeline tools (MMDetection, Detectron2, YOLOv8) and video understanding models Semantic segmentation, depth estimation, 3D vision, and temporal models Industrial datasets and tasks: defect detection, visual inspection, operator assistance Lightweight or compressed VLMs for embedded hardware (e.g., in vehicle ECUs or factory edge) Knowledge of reinforcement learning or planning in embodied AI context Strong academic or industry research collaborations Understanding of Bosch domains and workflows in mobility and manufacturing



  • bangalore, India Bosch Global Software Technologies Full time

    Job Description Roles & Responsibilities: Conduct deep research in: Vision-Language and Multimodal AI for perception and semantic grounding Cross-modal representation learning for real-world sensor fusion (camera, lidar, radar, text) Multimodal generative models for scene prediction, intent inference, or simulation Efficient model architectures for edge...


  • Bangalore, India Bosch Global Software Technologies Full time

    Job Description Roles & Responsibilities: Conduct deep research in: Vision-Language and Multimodal AI for perception and semantic grounding Cross-modal representation learning for real-world sensor fusion (camera, lidar, radar, text) Multimodal generative models for scene prediction, intent inference, or simulation Efficient model architectures for edge...


  • bangalore, India Bosch Global Software Technologies Full time

    Job DescriptionRoles & Responsibilities:Conduct deep research in:Vision-Language and Multimodal AI for perception and semantic groundingCross-modal representation learning for real-world sensor fusion (camera, lidar, radar, text)Multimodal generative models for scene prediction, intent inference, or simulationEfficient model architectures for edge deployment...


  • Bangalore Urban, India Bosch Global Software Technologies Full time

    Job DescriptionRoles & Responsibilities:Conduct deep research in:Vision-Language and Multimodal AI for perception and semantic groundingCross-modal representation learning for real-world sensor fusion (camera, lidar, radar, text)Multimodal generative models for scene prediction, intent inference, or simulationEfficient model architectures for edge deployment...


  • Bangalore, India Bosch Global Software Technologies Full time

    Job Description Roles & Responsibilities: Conduct deep research in: - Vision-Language and Multimodal AI for perception and semantic grounding - Cross-modal representation learning for real-world sensor fusion (camera, lidar, radar, text) - Multimodal generative models for scene prediction, intent inference, or simulation - Efficient model architectures for...

  • AI Researcher

    1 day ago


    bangalore, India Stealth AI Startup Full time

    Job Title: AI Researcher – Large Language Models (LLMs) Workplace Type: Work from Office @ Hyderabad Location: Hyderabad, Telangana, India Job Type: Full-Time About Us: We are a well-funded stealth-mode AI startup on a mission to redefine the boundaries of artificial intelligence. Our team is working on next-generation large language models (LLMs) and AI...

  • Generative AI

    2 weeks ago


    bangalore district, India Bosch Global Software Technologies Full time

    Senior Expert – Generative AI (GenAI) No.123 Industrial layout Hosur road Koramangala,, Bengaluru , India Full-time Legal Entity: Bosch Global Software Technologies Private Limited Job Description We are seeking a Senior Expert – Generative AI (GenAI) to join Bosch’s central AI research group at CR/RTC-IN. This role focuses on foundational AI research...


  • Bangalore, Karnataka, India Sarvam AI Full time

    Machine Learning Engineer - Computer Vision Vision-Language Models VLMs About Sarvam AI Sarvam ai is a pioneering generative-AI startup headquartered in Bengaluru India We are dedicated to transformative R D in language technologies building scalable and efficient Large Language Models LLMs that serve a wide spectrum of languages-especially Indic languages...


  • bangalore district, India Ascendion Full time

    About Ascendion Ascendion is a leading provider of AI-powered software engineering solutions that help businesses innovate faster, smarter, and with greater impact. We partner with over 400 Global 2000 clients across North America, APAC, and Europe to tackle complex challenges in applied AI, cloud, data, experience design, and workforce transformation....

  • Director of AI

    2 weeks ago


    bangalore district, India GalaxEye Full time

    Position Overview GalaxEye is seeking an exceptional Director of AI & Computer Vision to spearhead our artificial intelligence and computer vision initiatives. This leadership role will drive the development of cutting-edge algorithms that unlock intelligence from satellite imagery, positioning our company at the forefront of space-based AI applications. Key...