Multi Modal AI Senior Developer

2 weeks ago

Gurgaon, Haryana, India dentsu Full time ₹ 20,00,000 - ₹ 25,00,000 per year

We are seeking a seasoned and experienced Multi Modal AI Senior Developer with a deep expertise in leveraging Generative AI for creative and content generation at scale.

The ideal candidate would have a deep understanding of Multi Modal AI, the ability to leverage Gen-AI models for all creative output and content types, and the ability to work across all content and interaction modalities – including text, visuals, audio/speech and video. A strong foundation in Large Language models and Vision Language Models (VLM) is also highly desirable.

As a Multi Modal AI Senior Developer, you will play a key role in building cutting-edge products and innovative solutions, combining the full power of Creative AI workflows, Generative AI, LLMs, and Agentic AI.

Your primary focus will be on building bespoke products and creative workflows leveraging Gen-AI models to help build out our creative product portfolio for some of our largest, most strategic enterprise product solutions.

The candidate should have a good technical background in Product Development, Cloud-Native App Dev, Front-End and Back-End Web Application Development, and the ability to build these solutions in cloud environments such as Azure and AWS, integrating with the appropriate multi modal AI services.

The candidate will also need to have strong expertise with Cloud AI services such as Azure OpenAI, AWS Bedrock, and Google Gemini, and all the foundation models hosted within those services. Additionally, hands-on experience with a variety of foundation models, Vision Language Models (VLMs), Creative AI Services and APIs, and the ability to seamlessly integrate all of these together, in an automated workflow using APIs and AI Assistants, will be an essential skillset.

Job Description:
Key Skills required :
Generative AI, Multi Modal AI

Creative AI solutions and workflows across all creative content types including Copy/Text, Imagery, Key Visuals, Characters, Avatars, Audio, Speech and Video AI

Creative AI Automation workflows with content creation and content editing at scale, using AI services and AI APIs

Experience with multiple Multi Modal AI Foundation Models

LLM, LLM App Dev

AI Agents, Agentic AI Workflows

Responsibilities
:

Design and build web apps and solutions that leverage Creative AI Services, Multi Modal AI models, and Generative AI workflows
Leverage Multi modal AI capabilities supporting all content types and modalities, including text, imagery, audio, speech and video
Build creative automation workflows that help produce creative concepts, creative production deliverables, and integrated creative outputs, leveraging AI and Gen-AI models
Integrate AI Image Gen Models and AI Image Editing models from key technology partners
Integrate Text / Copy Gen Models for key LLM providers
Integrate Speech / Audio Gen and Editing models for use cases such as transcription, translation, and AI generated audio narration
Integrate AI enabled Video Gen and Video Editing models
Fine-Tune Multi Modal AI models for brand specific usage and branded content generation
Constantly Research and explore emerging trends and techniques in the field of generative AI and LLMs to stay at the forefront of innovation.
Drive product development and delivery within tight timelines
Collaborate with full-stack developers, engineers, and quality engineers, to develop and integrate solutions into existing enterprise products.
Collaborate with technology leaders and cross-functional teams to develop and validate client requirements and rapidly translate them into working solutions.
Develop, implement and optimize scalable AI-enabled products
Integrate Gen-AI and Multi Modal AI solutions into Cloud Platforms, Cloud Native Apps, and custom Web Apps
Execute implementation across all layers of the application stack – including front-end, back-end, APIs, data and AI services
Build enterprise products and full-stack applications on the MERN + Python stack, with a clear separation of concerns across layers

Skills and Competencies:

Deep Hands-on Experience in Multi modal AI models and tools.
Hands-on Experience in API integration with AI services
Multi Modal AI – competencies :
Hands-on Experience with intelligent document processing and document indexing + document content extraction and querying, using multi modal AI Models
Hands-on Experience with using Multi modal AI models and solutions for Imagery and Visual Creative – including text-to-image, image-to-image, image composition, image variations, etc.
Hands-on Experience with popular AI Image Composition and Editing models from providers such as Adobe Firefly, Getty Images, ShutterStock, Flux and Flux Pro, and Stable Diffusion, and the ability to integrate them programmatically over API calls and workflows
Hands-on Experience with Computer Vision and Image Processing using Multi-modal AI – for use cases such as object detection, automated captioning, automated masking, and image segmentation – again all done programmatically over API calls and Workflows
Hands-on Experience with using Multi modal AI for Speech – including Text to Speech, Speech to Text, and use of Pre-built vs. Custom Voices
Hands-on Experience with building Voice-enabled and Voice-activated experiences, using Speech AI and Voice AI solutions
Hands-on Experience with AI Character and AI Avatar development, using a variety of different tools and platforms
Fine-Tuning Creative AI Content models for Custom Styles, Custom Characters, and Custom Brand specific imagery
Fine-Tuning Speech Models for Custom Voices
Good understanding of advanced fine-tuning techniques such as LoRA
Ability to execute and run fine-tuning workflows, end-to-end, in particular for Image Gen and Image Editing models
Hands-on Experience with leveraging APIs to orchestrate across Multi Modal AI models
Hands-on Experience with building workflows that orchestrate across Multi Modal AI models
Good Experience with using AI Assistants to drive natural language interactions and orchestration with Multi Modal AI models
Good Experience with use of AI Agents and Agentic AI workflows to drive dynamic orchestration across Multi Modal AI services and models
Programming Skills :
Good Expertise in MERN stack (JavaScript) including client-side and server-side JavaScript
Good Expertise in Python based development, including Python App Dev for Multi Modal AI Integration
Well-rounded in both programming languages
Strong experience in client-side JavaScript Apps and building Static Web Apps + Dynamic Web Apps both in JavaScript
Hands-on Experience in front-end and back-end development
Minimum 2+ years hands-on experience in working with Full-Stack MERN apps, using both client-side and server-side JavaScript
Minimum 2 years hands-on experience in Python development
Minimum 2 years hands-on experience in working with LLMs and LLM models, using Python
LLM Dev Skills :
Solid Hands-on Experience with building end-to-end RAG pipelines and custom AI indexing solutions to ground LLMs and enhance LLM output
Good Experience with building AI and LLM enabled Workflows
Hands-on Experience integrating LLMs with external tools such as Web Search
Ability to leverage advanced concepts such as tool calling and function calling, with LLM models
Hands-on Experience with Conversational AI solutions and chat-driven experiences
Experience with multiple LLMs and models – primarily GPT-4o, GPT o1, and o3 mini, and preferably also Gemini, Claude Sonnet, etc.
Experience and Expertise in Cloud Gen-AI platforms, services, and APIs, primarily Azure OpenAI, and perferably also AWS Bedrock, and/or GCP Vertex AI.
Hands-on Experience with Assistants and the use of Assistants in orchestrating with LLMs
Hands-on Experience working with AI Agents and Agent Services.

Nice-to-Have capabilities (Not essential) :

Hands-on Experience with building Agentic AI workflows that enable iterative improvement of output
Hands-on experience with both Single-Agent and Multi-Agent Orchestration solutions and frameworks
Hands-on experience with different Agent communication and chaining patterns
Ability to leverage LLMs for Reasoning and Planning workflows, that enable higher order "goals" and automated orchestration across multiple apps and tools
Ability to leverage Graph Databases and "Knowledge Graphs" as an alternate method / replacement of Vector Databases, for enabling more relevant semantic querying and outputs via LLM models.
Good Background with Machine Learning solutions
Good foundational understanding of Transformer Models
Good foundational understanding of Diffusion Models
Some Experience with custom ML model development and deployment is desirable.
Proficiency in deep learning frameworks such as PyTorch, or Keras.
Experience with Cloud ML Platforms such as Azure ML Service, AWS Sage maker, and NVidia AI Foundry.

Location:
DGS India - Pune - Kharadi EON Free Zone

Brand:
Dentsu Creative

Time Type:
Full time

Contract Type:
Permanent

Generative AI Manager Google Cloud

2 weeks ago

Gurgaon, Haryana, India Ventures Hrd Centre Full time ₹ 8,33,333 - ₹ 25,00,000 per year

Lead Generative AI solutions with LLMs, LangChain, Hugging Face, Google Cloud; architect multi-modal systems, RAG, vector databases; , security, stakeholder alignment; mentor teams; drive deployments, product strategy, and enterprise-scale AI.
Senior Consultant

2 weeks ago

Gurgaon, Haryana, India WNS Holdings Full time ₹ 12,00,000 - ₹ 36,00,000 per year

Role - Data ScientistSkill - Data Scientist + LLM + Gen AI + Deep LearningDesignation - Sr. Consultant (AVP)The Senior AI Scientist is a technical role responsible for driving the design, development, and deployment of advanced AI components, models and systems. With 7+ years of experience in artificial intelligence and machine learning, this role is focused...
Gen AI Manager

2 weeks ago

Gurgaon, Haryana, India VE Commercial Vehicles Full time ₹ 15,00,000 - ₹ 25,00,000 per year

Role & responsibilitiesLead the development of Generative AI models using frameworks like Hugging Face Transformers, LangChain, and LlamaIndex.Architect and manage multi-modal AI systems integrating text, image, and structured data inputs.Implement and optimize LLM agents, retrieval-augmented generation (RAG), and vector database integrations.Utilize Google...
Generative AI Engineer

1 week ago

Gurgaon, Haryana, India Inizio Advisory Full time ₹ 20,00,000 - ₹ 25,00,000 per year

Role OverviewWe are looking for highly skilled with 4 to 5 years experiencedGenerative AI Engineerto design and deploy enterprise-grade GenAI systems. This role blends platform architecture, LLM integration, and operationalization—ideal for engineers with strong hands-on experience in large language models, RAG pipelines, and AI...
Senior Python Developer

2 weeks ago

Gurgaon, Haryana, India Darwix AI Full time ₹ 15,00,000 - ₹ 20,00,000 per year

We're Hiring:Senior Python Developer – Backend Engineering Gurgaon (On-site) | Full-Time | 2–6 Years ExperienceAbout Darwix AIDarwix AI is one of India's fastest-growing enterprise AI startups. Our GenAI-powered conversational intelligence and real-time agent assist platform helps leading enterprises across India, MENA, and Southeast Asia supercharge...
Senior Machine Learning Engineer

2 weeks ago

Gurgaon, Haryana, India AiSensy Full time ₹ 12,00,000 - ₹ 36,00,000 per year

About AiSensyAiSensy is a WhatsApp based Marketing & Engagement platform helping businesses like Adani, Delhi Transport Corporation, Yakult, Godrej, Aditya Birla Hindalco, Wipro, Asian Paints, India Today Group, Skullcandy, Vivo, Physicswallah, and Cosco grow their revenues via WhatsApp.Enabling 100,000+ Businesses with WhatsApp Engagement & Marketing400...
Data Scientist – .AI

2 weeks ago

Gurgaon, Haryana, India TP Full time ₹ 12,00,000 - ₹ 36,00,000 per year

Maximize Your Impact with TPWelcome to TP, a global hub of innovation and empowerment, where we redefine the future. With a remarkable €10 billion annual revenue and a global team of 500,000 employees serving 170 countries in over 300 languages, we lead in intelligent, digital-first solutions.As a globally certified Great Place to Work in 72 countries, our...
AI Technical Lead

5 days ago

Gurgaon, Haryana, India BitsAtom Technologies Full time ₹ 12,00,000 - ₹ 36,00,000 per year

Location: GurugramEmployment Type: Full-Time OnsiteExperience Required: 8–10 YearsAbout the Role:We are seeking a visionary and hands-on Senior AI Technical Lead to spearhead our Generative AI initiatives focusing on conversational bot development, prompt engineering, and scalable AI solutions. This role demands deep technical expertise, strategic...
Full Stack Developer

2 weeks ago

Gurgaon, Haryana, India CoPoint AI Full time ₹ 12,00,000 - ₹ 36,00,000 per year

About CoPoint AICoPoint AI is a specialized consulting firm focused on transforming businesses through process improvement, data insights, and technology-driven innovation. We leverage AI technologies, Microsoft cloud platforms, and modern web development frameworks to deliver intelligent, scalable solutions that drive measurable impact for our clients. Our...
Senior AI/ML Engineer

1 day ago

Gurgaon, Haryana, India BigStep Technologies Full time ₹ 12,00,000 - ₹ 24,00,000 per year

Description : We are seeking an experienced Senior AI/ML Engineer to lead the design and implementation of AI-powered solutions leveraging large language models (LLMs), multimodal AI, and advanced generative technologies. You will play a pivotal role in defining the technical vision, architecture, and strategy for enterprise-grade GenAI systems -...

Americas

Europe

Asia / Oceania

Africa

Multi Modal AI Senior Developer