Consultant - Gen AI Developer

1 day ago


Delhi, Delhi, India Delphi Consulting Middle East Full time ₹ 12,00,000 - ₹ 36,00,000 per year

Join Delphi - Where Innovation meets transformation
At Delphi, we believe in creating an environment where our people thrive. Our
hybrid work model
empowers you to choose where you work—whether it's from the office, your home, or a mix of both—so you can prioritize what matters most. We are committed to supporting your personal goals, family, and overall well-being while driving transformative results for our clients.

We welcome exceptional talent from anywhere across the globe. Interviews and onboarding are conducted virtually, reflecting our digital-first mindset.

Rooted in the region, we specialize in delivering tailored, impactful solutions in
Data, Advanced Analytics and AI, Infrastructure, Cloud Security, and Application Modernization.
Whether it's enabling
predictive analytics
, transforming operations with automation, or driving customer engagement with intelligent platforms, we are the trusted partner for organizations ready to embrace a smarter, more efficient future.

About The Role
We are looking for a Senior AI/ML Solution Architect with deep expertise in Generative AI and agentic systems to lead the design and implementation of enterprise-scale AI solutions. This role requires a unique blend of hands-on technical expertise in both Large Language Models (LLMs) and Small Language Models (SLMs), combined with the architectural vision to deploy these solutions across diverse computing environments. The ideal candidate will architect scalable agentic solutions, implement advanced fine-tuning strategies, and design comprehensive integration systems that connect AI capabilities with enterprise applications. You will be at the forefront of our AI transformation initiatives, working with cutting-edge technologies while maintaining a practical approach to deployment and optimization.
Job Responsibilities
Architecture & Design

  • Design and architect scalable agentic solutions using advanced LLM capabilities.
  • Implement Model Context Protocol (MCP) integrations to connect applications with diverse external services and APIs.
  • Develop multi-agent orchestration systems for complex workflow automation.
  • Design context and memory management systems for persistent agent interactions.

Technical Implementation

  • Build and optimize Retrieval-Augmented Generation (RAG) systems for efficient knowledge retrieval
  • Implement agent frameworks (LangChain, LangGraph, Semantic Kernel, Agno) for various deployment environments
  • Design and deploy model inference pipelines optimized for different computing environments (cloud, edge, on-premises)
  • Develop comprehensive fine-tuning strategies for both Large Language Models (LLMs) and Small Language Models (SLMs)
  • Architect SLM deployment strategies for resource-constrained environments
  • Implement model compression and quantization techniques for efficient inference

Integration & Connectivity

  • Architect REST/gRPC/GraphQL APIs and SDK integrations for seamless service connectivity
  • Implement event-driven architectures using webhooks and message buses
  • Design secure authentication and authorization systems (SSO/OIDC)
  • Build connectors for popular platforms (Slack, Jira, Salesforce, CRM/ERP systems)

Data & Model Management

  • Design comprehensive data preprocessing pipelines including cleaning, deduplication, and PII reduction
  • Implement embedding creation and re-embedding strategies for optimal retrieval
  • Develop chunking and windowing strategies for mobile-optimized content processing
  • Establish model selection criteria and evaluation frameworks

Job Requirements
Core AI/ML Expertise

  • Foundation Models: Deep experience with GPT-4, Claude, LLaMA, and other state-of-the-art LLMs
  • Small Language Models (SLMs): Expertise in deploying and optimizing SLMs (Phi-3, Gemma, TinyLlama) for mobile environments
  • Agent Frameworks: Proficiency in LangChain, LangGraph, Microsoft Semantic Kernel, Agno, and custom agent development
  • RAG Systems: Advanced knowledge of retrieval-augmented generation, vector databases, and semantic search.

Fine-tuning & Adaptation

  • Advanced fine-tuning techniques: LoRA/QLoRA, DoRA, AdaLoRA for parameter-efficient training
  • Model compression: Pruning, quantization (INT8/INT4), knowledge distillation
  • Prompt-tuning, adapters, prefix tuning, and P-tuning v2 methodologies
  • RLHF/RLAIF techniques for alignment and preference learning
  • Domain-specific fine-tuning for mobile use cases and vertical applications

Deployment & Optimization

  • SLM Deployment: Expertise in deploying Small Language Models across various computing environments
  • Multi-Platform Optimization: Experience optimizing both LLMs and SLMs for cloud, edge, and onpremises deployment
  • Efficient Inference: Knowledge of quantization (GPTQ, AWQ, GGML), pruning, and distillation techniques
  • Model Compression: Advanced techniques for reducing model size while maintaining performance
  • Real-time Processing: Expertise in streaming inference and adaptive reasoning depth control
  • Performance Optimization: Proficiency in autoscaling, rate limiting, and resource management

Adaptive Fine-tuning

  • Environment-specific model adaptation and optimization
  • Federated learning approaches for distributed fine-tuning
  • Few-shot and zero-shot learning techniques for resource-efficient adaptation

Integration Technologies

  • MCP Implementation: Deep understanding of Model Context Protocol for service integration
  • API Development: Expertise in designing and implementing REST, gRPC, and GraphQL APIs
  • Event Systems: Experience with event buses, webhooks, and real-time communication
  • Security: Knowledge of secure storage, caching, and access control systems

Development Frameworks

  • Libraries: TensorFlow, PyTorch, Hugging Face Transformers, LlamaIndex
  • Application Development: Web frameworks, desktop applications, API development
  • Cloud Platforms: AWS, GCP, Azure with focus on AI/ML services
  • DevOps: CI/CD pipelines, containerization (Docker/Kubernetes), monitoring

Preferred Qualifications

  • Master's or PhD in Computer Science, AI, Machine Learning, or related field
  • Published research or contributions to open-source AI/ML projects
  • Experience with multi-modal models and cross-modal applications
  • Knowledge of MLOps best practices and model lifecycle management
  • Experience with regulatory compliance in AI systems (GDPR, AI Act, etc.)
  • Track record of leading AI transformation initiatives in enterprise environments
  • Certifications in cloud platforms (AWS, GCP, Azure) with focus on AI/ML services

Technical Competencies to Be Assessed

  • System design and architecture for distributed AI systems
  • Code review and optimization for production AI deployments
  • Performance benchmarking and model evaluation methodologies
  • Cost optimization strategies for large-scale AI deployments
  • Security and privacy considerations in AI systems
  • Scalability patterns for AI applications

What we offer

At Delphi, we are dedicated to creating an environment where you can thrive, both professionally and personally. Our
competitive compensation package, performance-based incentives,
and health benefits are designed to ensure you're well-supported. We believe in your continuous growth and offer
company-sponsored certifications, training programs
, and skill-building opportunities to help you succeed.

We foster a culture of inclusivity and support, with
remote work
options and a fully supported work-from-home setup to ensure your comfort and productivity. Our positive and inclusive culture includes team activities, wellness and mental health programs to ensure you feel supported.



  • Delhi, Delhi, India Lovers Ai Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    As a CTO - Gen AI Engineer/Full Stack at Lovers Ai in Delhi, India, you will play a crucial role in leading software development projects, developing IT strategies, and overseeing the entire product development lifecycle. Your responsibilities will include project management, creating detailed architecture, and ensuring that products meet high standards of...


  • Delhi, Delhi, India Lenskart Full time

    About LenskartLenskart is India's leading eyewear brand, revolutionizing the way people buy eyewear through technology, design, and world-class customer experience. With a fast-growing footprint across India and global markets, we are building intelligent, tech-first solutions to optimize operations and scale efficiently.We are now looking for AI/Gen AI...


  • Delhi, Delhi, India Lenskart Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    About LenskartLenskart is India's leading eyewear brand, revolutionizing the way people buy eyewear through technology, design, and world-class customer experience. With a fast-growing footprint across India and global markets, we are building intelligent, tech-first solutions to optimize operations and scale efficiently.We are now looking forAI/Gen AI...


  • Delhi, Delhi, India VUPICO Full time

    About the Role :We are seeking an experienced Gen AI Consultant/Developer with strong expertise in Java, Spring Boot, and AI-driven solutions. This is a hands-on role that requires both technical depth and the ability to collaborate effectively with cross-functional teams. The ideal candidate is a problem-solver who can bring AI concepts into practical...


  • Delhi, Delhi, India PMGlide Full time ₹ 15,00,000 - ₹ 45,00,000 per year

    Company DescriptionEffortlessly steer your projects to success with PMGlide's AI-powered project management automation. Our solutions automate the entire project lifecycle for seamless delivery. Consult our experts to discover how AI-powered tools streamline your success and eliminate all manual hurdles during the journey.Your Key ResponsibilitiesClient...


  • Delhi, Delhi, India DigiDarts Marketing Pvt. Ltd. Full time

    Business Development Manager - AI Consulting About the RoleWe are seeking a dynamic and results-oriented Growth Manager to lead the commercialization and expansion of our brand-new enterprise AI-consulting service. In this role, you will be responsible for defining our go-to-market strategy, driving the entire sales funnel from lead generation to deal...


  • Delhi, Delhi, India TrueFan AI Full time ₹ 5,00,000 - ₹ 10,00,000 per year

    About TrueFan AITrueFan AI is India's leading AI-led platform at the intersection of celebrities, brands, and consumers. For over 4 years, we have been pioneering generative AI - creating 6.4+ million minutes of content in 175+ languages. We are the only AI platform to partner with 52+ blue-chip brands (Hero MotoCorp, Bajaj Finserv, Zomato, Cipla, ICICI...

  • Gen AI Architect

    3 weeks ago


    Delhi, Delhi, India Talent Toppers Full time

    About the Company Global consulting, technology, and managed services company, supporting insurance organizations worldwide. With expertise across underwriting, claims, policy services, data, and digital transformation, the company helps insurers enhance efficiency and accelerate growth. Leveraging innovation, domain knowledge, and advanced technologies, it...

  • AI/ML Developer

    5 days ago


    Delhi, Delhi, India Tech four Engineering Solutions Pvt Ltd Full time ₹ 6,00,000 - ₹ 8,00,000 per year

    Position OverviewWe are looking for a Data Scientist with 1 year of hands-on experience and strongfoundational knowledge in Machine Learning (ML), Deep Learning (DL), Natural LanguageProcessing (NLP), Generative AI, Transformers, Retrieval-Augmented Generation (RAG),and Large Language Models (LLMs). The ideal candidate will contribute to building...


  • Delhi, Delhi, India MethdAI - The AI Learning Platform Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    We are seeking aPhysical AI Engineerto design, develop, and implement AI-driven control and decision-making systems for humanoid robots and embodied agents. This role involves integratingvision-language-action models, reinforcement learning, imitation learning, and real-time robotics systemsto create robots capable of performing complex tasks in dynamic...