Senior Solutions Architect, Generative AI

3 weeks ago


bangalore, India NVIDIA Full time

NVIDIA is seeking a dynamic and experienced Generative AI Solution Architect with specialized expertise in training Large Language Models (LLMs) and implementing workflows based on Retrieval-Augmented Generation (RAG). As a key member of our AI Solutions team, you will play a pivotal role in architecting and delivering cutting-edge solutions that leverage the power of NVIDIA's generative AI technologies. This position requires a deep understanding of language models, particularly LLMs, and a strong proficiency in designing and implementing RAG-based workflows.
What you will be doing

  • Architect end-to-end generative AI solutions with a focus on LLMs and RAG workflows.

  • Collaborate closely with customers to understand their language-related business challenges and design tailored solutions.

  • Collaborate with sales and business development teams to support pre-sales activities, including technical presentations and demonstrations of LLM and RAG capabilities.

  • Work closely with NVIDIA engineering teams to provide feedback and contribute to the evolution of generative AI technologies.

  • Engage directly with customers to understand their language-related requirements and challenges.

  • Lead workshops and design sessions to define and refine generative AI solutions focused on LLMs and RAG workflows and lead the training and optimization of Large Language Models using NVIDIA’s hardware and software platforms.

  • Implement strategies for efficient and effective training of LLMs to achieve optimal performance.

  • Design and implement RAG-based workflows to enhance content generation and information retrieval.

  • Work closely with customers to integrate RAG workflows into their applications and systems and stay abreast of the latest developments in language models and generative AI technologies.

  • Provide technical leadership and guidance on best practices for training LLMs and implementing RAG-based solutions.


What we need to see

  • Master's or Ph.D. in Computer Science, Artificial Intelligence, or equivalent experience

  • 5+ years of hands-on experience in a technical role, specifically focusing on generative AI, with a strong emphasis on training Large Language Models (LLMs).

  • Proven track record of successfully deploying and optimizing LLM models for inference in production environments.

  • In-depth understanding of state-of-the-art language models, including but not limited to GPT-3, BERT, or similar architectures.

  • Expertise in training and fine-tuning LLMs using popular frameworks such as TensorFlow, PyTorch, or Hugging Face Transformers.

  • Proficiency in model deployment and optimization techniques for efficient inference on various hardware platforms, with a focus on GPUs.

  • Strong knowledge of GPU cluster architecture and the ability to leverage parallel processing for accelerated model training and inference.

  • Excellent communication and collaboration skills with the ability to articulate complex technical concepts to both technical and non-technical stakeholders.

  • Experience leading workshops, training sessions, and presenting technical solutions to diverse audiences.


Ways To Stand Out From The Crowd

  • Experience in deploying LLM models in cloud environments (e.g., AWS, Azure, GCP) and on-premises infrastructure.

  • Proven ability to optimize LLM models for inference speed, memory efficiency, and resource utilization.

  • Familiarity with containerization technologies (e.g., Docker) and orchestration tools (e.g., Kubernetes) for scalable and efficient model deployment.

  • Deep understanding of GPU cluster architecture, parallel computing, and distributed computing concepts.

  • Hands-on experience with NVIDIA GPU technologies, and GPU cluster management and ability to design and implement scalable and efficient workflows for LLM training and inference on GPU clusters

With competitive salaries and a generous benefits package, we are widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us and, due to unprecedented growth, our exclusive engineering teams are rapidly growing. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.



  • bangalore, India NVIDIA Full time

    NVIDIA is seeking a dynamic and experienced Generative AI Solution Architect with specialized expertise in training Large Language Models (LLMs) and implementing workflows based on Retrieval-Augmented Generation (RAG). As a key member of our AI Solutions team, you will play a pivotal role in architecting and delivering cutting-edge solutions that leverage...


  • bangalore, India NVIDIA Full time

    NVIDIA is seeking a dynamic and experienced Generative AI Solution Architect with specialized expertise in training Large Language Models (LLMs) and implementing workflows based on Pretraining, Finetuning LLMs & Retrieval-Augmented Generation (RAG). As a key member of our AI Solutions team, you will play a pivotal role in architecting and delivering...


  • Hyderabad/Bangalore, India Codersbrain technology pvt ltd Full time

    Position Name : AI ArchitectExperience Required : 9 - 12 YearsLocation : HyderabadNotice Period : immediate - 15 days Salary : 23-35 LPA JD :- Collaborate with cross-functional teams to understand business requirements and goals.- Generative AI Solution Design: Design and architect generative AI solutions that aligns with requirements.- Work closely with...


  • Hyderabad/Bangalore, Karnataka, India Codersbrain technology pvt ltd Full time

    Position Name : AI ArchitectExperience Required : 9 - 12 YearsLocation : HyderabadNotice Period : immediate - 15 daysSalary : 23-35 LPA JD :- Collaborate with cross-functional teams to understand business requirements and goals.- Generative AI Solution Design: Design and architect generative AI solutions that aligns with requirements.- Work closely with...


  • Hyderabad,Bangalore, India Codersbrain technology pvt ltd Full time

    Position Name : AI ArchitectExperience Required : 9 - 12 YearsLocation : HyderabadNotice Period : immediate - 15 days Salary : 23-35 LPA JD :- Collaborate with cross-functional teams to understand business requirements and goals.- Generative AI Solution Design: Design and architect generative AI solutions that aligns with requirements.- Work closely with...


  • bangalore, India C3 IoT Full time

    C3.ai, Inc. (NYSE:AI) is a leading Enterprise AI software provider for accelerating digital transformation. The proven C3 AI Platform provides comprehensive services to build enterprise-scale AI applications more efficiently and cost-effectively than alternative approaches. The C3 AI Platform supports the value chain in any industry with prebuilt,...


  • bangalore, India C3 IoT Full time

    C3.ai, Inc. (NYSE:AI) is a leading Enterprise AI software provider for accelerating digital transformation. The proven C3 AI Platform provides comprehensive services to build enterprise-scale AI applications more efficiently and cost-effectively than alternative approaches. The C3 AI Platform supports the value chain in any industry with prebuilt,...


  • bangalore, India NVIDIA Full time

    At NVIDIA, our passion is working with the world's most challenging problems in LLM, MLLM ,Generative AI, RAGs using our innovative platforms. A Senior Solution Architect brings focus and technical expertise about NVIDIA technological advances to our partners and customers. What You’ll Be Doing: You will be part of our India Solution Architecture Team...


  • bangalore, India NVIDIA Full time

    At NVIDIA, our passion is working with the world's most challenging problems in LLM, MLLM ,Generative AI, RAGs using our innovative platforms. A Senior Solution Architect brings focus and technical expertise about NVIDIA technological advances to our partners and customers. What You’ll Be Doing: You will be part of our India Solution Architecture Team...

  • Generative AI

    1 week ago


    bangalore, India Alp Consulting Limited Full time

    Project Role : Solution Architect Project Role Description : Own the overall solution blueprint and roadmap, work closely with clients to articulate business problems and translate them into an appropriate solution design. Must have skills : Solution Architecture Good to have skills : NA Minimum 7.5 year(s) of experience is required Educational...


  • bangalore, India Valiance Solutions Full time

    Company Overview: We are a dynamic and innovative technology company leveraging cutting-edge cloud technologies and generative AI to create groundbreaking solutions that address complex challenges across various industries. We are seeking a skilled AI Engineer to to drive the architecture and development of our AI/ML applications, contributing to our growth...

  • AI Architect

    7 days ago


    bangalore, India Recruise India Consulting Pvt Ltd Full time

    Cloud Architect (AI) AI Architect 10+ years of exp 10+ years of exp multi cloud skills, AWS/Azure Exp in generative AI solutions Designing cloud solutions from platforms perspective and data perspective (data pipeline) Basics/good cloud understanding cloud design principles (serverless, dedicated server based approach, etc) hands on knowledge on cloud...


  • bangalore, India Genpact Full time

    Genpact (NYSE: G) is a global professional services and solutions firm delivering outcomes that shape the future. Our 125,000+ people across 30+ countries are driven by our innate curiosity, entrepreneurial agility, and desire to create lasting value for clients. Powered by our purpose – the relentless pursuit of a world that works better for people –...


  • bangalore, India Genpact Full time

    Genpact (NYSE: G) is a global professional services and solutions firm delivering outcomes that shape the future. Our 125,000+ people across 30+ countries are driven by our innate curiosity, entrepreneurial agility, and desire to create lasting value for clients. Powered by our purpose – the relentless pursuit of a world that works better for people –...

  • Ai Architect

    24 hours ago


    Bangalore City, India Recruise India Consulting Pvt Ltd Full time

    Cloud Architect (AI) AI Architect 10+ years of exp 10+ years of exp multi cloud skills, AWS/Azure Exp in generative AI solutions Designing cloud solutions from platforms perspective and data perspective (data pipeline) Basics/good cloud understanding cloud design principles (serverless, dedicated server based approach, etc) hands on knowledge on cloud...


  • Bangalore, Karnataka, India Tekfortune IT India Pvt Ltd Full time

    Title: GenAI Engineer, GenAI Solution Architect, GenAI App Developer, GenAI Platform Engineers (Multiple Positions) Experience : 5-12 YearsLocation : Bangalore location (Hybrid)Notice period : Immediate to 30 Days MaxKey Skills : Generative AI, Solutions Architect Exp, Implementation of GenAI/ ML solutions, Compliance/Security requirements for GenAI/ ML, LLM...

  • AI Architect

    3 weeks ago


    bangalore, India Dell International Services India Pvt Ltd (7451) Full time

    Senior Software Principal Engineer and Technical Staff, Software Engineering The Software Engineering team delivers next-generation software application enhancements and new products for a changing world. Working at the cutting edge, we design and develop software for platforms, peripherals, applications and diagnostics — all with the most advanced...

  • AI Architect

    4 weeks ago


    bangalore, India Dell International Services India Pvt Ltd (7451) Full time

    Senior Software Principal Engineer and Technical Staff, Software Engineering The Software Engineering team delivers next-generation software application enhancements and new products for a changing world. Working at the cutting edge, we design and develop software for platforms, peripherals, applications and diagnostics — all with the most advanced...


  • bangalore, India AXA Full time

    About AXAAs one of the largest global insurers, our purpose is to act for human progress by protecting what matters .Protection has always been at the core of our business, helping individuals, businesses and societies to thrive. And AXA has always been a leader, an innovator, an entrepreneurial company, fostering progress in all its dimensions. Our purpose...


  • bangalore, India AXA Full time

    About AXAAs one of the largest global insurers, our purpose is to act for human progress by protecting what matters .Protection has always been at the core of our business, helping individuals, businesses and societies to thrive. And AXA has always been a leader, an innovator, an entrepreneurial company, fostering progress in all its dimensions. Our purpose...