Senior Solutions Architect, Generative AI

1 month ago


bangalore, India NVIDIA Full time

NVIDIA is seeking a dynamic and experienced Generative AI Solution Architect with specialized expertise in training Large Language Models (LLMs) and implementing workflows based on Retrieval-Augmented Generation (RAG). As a key member of our AI Solutions team, you will play a pivotal role in architecting and delivering cutting-edge solutions that leverage the power of NVIDIA's generative AI technologies. This position requires a deep understanding of language models, particularly LLMs, and a strong proficiency in designing and implementing RAG-based workflows.
What you will be doing

  • Architect end-to-end generative AI solutions with a focus on LLMs and RAG workflows.

  • Collaborate closely with customers to understand their language-related business challenges and design tailored solutions.

  • Collaborate with sales and business development teams to support pre-sales activities, including technical presentations and demonstrations of LLM and RAG capabilities.

  • Work closely with NVIDIA engineering teams to provide feedback and contribute to the evolution of generative AI technologies.

  • Engage directly with customers to understand their language-related requirements and challenges.

  • Lead workshops and design sessions to define and refine generative AI solutions focused on LLMs and RAG workflows and lead the training and optimization of Large Language Models using NVIDIA’s hardware and software platforms.

  • Implement strategies for efficient and effective training of LLMs to achieve optimal performance.

  • Design and implement RAG-based workflows to enhance content generation and information retrieval.

  • Work closely with customers to integrate RAG workflows into their applications and systems and stay abreast of the latest developments in language models and generative AI technologies.

  • Provide technical leadership and guidance on best practices for training LLMs and implementing RAG-based solutions.


What we need to see

  • Master's or Ph.D. in Computer Science, Artificial Intelligence, or equivalent experience

  • 5+ years of hands-on experience in a technical role, specifically focusing on generative AI, with a strong emphasis on training Large Language Models (LLMs).

  • Proven track record of successfully deploying and optimizing LLM models for inference in production environments.

  • In-depth understanding of state-of-the-art language models, including but not limited to GPT-3, BERT, or similar architectures.

  • Expertise in training and fine-tuning LLMs using popular frameworks such as TensorFlow, PyTorch, or Hugging Face Transformers.

  • Proficiency in model deployment and optimization techniques for efficient inference on various hardware platforms, with a focus on GPUs.

  • Strong knowledge of GPU cluster architecture and the ability to leverage parallel processing for accelerated model training and inference.

  • Excellent communication and collaboration skills with the ability to articulate complex technical concepts to both technical and non-technical stakeholders.

  • Experience leading workshops, training sessions, and presenting technical solutions to diverse audiences.


Ways To Stand Out From The Crowd

  • Experience in deploying LLM models in cloud environments (e.g., AWS, Azure, GCP) and on-premises infrastructure.

  • Proven ability to optimize LLM models for inference speed, memory efficiency, and resource utilization.

  • Familiarity with containerization technologies (e.g., Docker) and orchestration tools (e.g., Kubernetes) for scalable and efficient model deployment.

  • Deep understanding of GPU cluster architecture, parallel computing, and distributed computing concepts.

  • Hands-on experience with NVIDIA GPU technologies, and GPU cluster management and ability to design and implement scalable and efficient workflows for LLM training and inference on GPU clusters

With competitive salaries and a generous benefits package, we are widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us and, due to unprecedented growth, our exclusive engineering teams are rapidly growing. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.



  • Hyderabad,Bangalore, India Codersbrain technology pvt ltd Full time

    Position Name : AI ArchitectExperience Required : 9 - 12 YearsLocation : HyderabadNotice Period : immediate - 15 days Salary : 23-35 LPA JD :- Collaborate with cross-functional teams to understand business requirements and goals.- Generative AI Solution Design: Design and architect generative AI solutions that aligns with requirements.- Work closely with...


  • bangalore, India NVIDIA Full time

    At NVIDIA, our passion is working with the world's most challenging problems in LLM, MLLM ,Generative AI, RAGs using our innovative platforms. A Senior Solution Architect brings focus and technical expertise about NVIDIA technological advances to our partners and customers. What You’ll Be Doing: You will be part of our India Solution Architecture Team...


  • bangalore, India Genpact Full time

    Genpact (NYSE: G) is a global professional services and solutions firm delivering outcomes that shape the future. Our 125,000+ people across 30+ countries are driven by our innate curiosity, entrepreneurial agility, and desire to create lasting value for clients. Powered by our purpose – the relentless pursuit of a world that works better for people –...


  • bangalore, India Microsoft Full time

    Overview As a Prinicipal Solution Specialist – Generative Artificial Intelligence , you will be a technical solution expert within our worldwide AI commercial solution area organization working with strategic customers to help establish their AI Strategy, prioritizing key use cases that drive differentiated business outcomes. You will share...

  • AI Architect

    2 days ago


    bangalore, India Dell International Services India Pvt Ltd (7451) Full time

    Senior Software Principal Engineer and Technical Staff, Software Engineering The Software Engineering team delivers next-generation software application enhancements and new products for a changing world. Working at the cutting edge, we design and develop software for platforms, peripherals, applications and diagnostics — all with the most advanced...


  • bangalore, India AXA Full time

    About AXAAs one of the largest global insurers, our purpose is to act for human progress by protecting what matters .Protection has always been at the core of our business, helping individuals, businesses and societies to thrive. And AXA has always been a leader, an innovator, an entrepreneurial company, fostering progress in all its dimensions. Our purpose...

  • Full Stack Architect

    2 weeks ago


    Bangalore/Hyderabad, India Zenfinet Solutions(OPC) Private Limited Full time

    Role : Full Stack ArchitectJob Type : Full Time(Permanent)Locations : Hyderabad, BangaloreNP : May joiner, June joiner if profile is goodMandatory Skills :Total Exp : 7+ YearsAzure Stack : 7 to 8 Years- Generative AI- ML Data Engineering- Python- React.js We need full stack profile :- In this role, you will play a key role in the development, deployment,...


  • bangalore, India Crayon Full time

    Are you ready to become the driving force behind our clients' success, shaping cutting-edge solutions and fostering enduring partnerships?Want to be part of our APAC Center of Excellence team delivering some of the most innovative solutions in Data & AI?Ready to join a growing company that has won Microsoft partner of the year for Data & AI?Practical...

  • Senior Data Scientist

    4 weeks ago


    Bangalore, India RapidBraiins Full time

    Overview :As a Senior Data Scientist, you will be responsible for the development of advanced AI models, particularly in Generative AI, leveraging OpenAI technologies and Azure cloud services.A minimum of 5 years of experience into the field of AI, with a total of 5 years of professional experience.Job Description :- Design, develop, and implement generative...

  • Senior Data Scientist

    1 month ago


    bangalore, India RapidBraiins Full time

    Overview :As a Senior Data Scientist, you will be responsible for the development of advanced AI models, particularly in Generative AI, leveraging OpenAI technologies and Azure cloud services.A minimum of 5 years of experience into the field of AI, with a total of 5 years of professional experience.Job Description :- Design, develop, and implement generative...


  • bangalore, India Lenovo Full time

    Description and Requirements Why Work at Lenovo: At Lenovo, we believe in more innovative technology for all, so we spend time building a brighter and more inclusive society. And we go big. No, not big—huge. We’re a US$60 billion revenue Fortune Global 500 company serving customers in 180 markets worldwide. Focused on a bold vision to...


  • Bangalore Urban, India AXA Full time

    About AXA As one of the largest global insurers, our purpose is to act for human progress by protecting what matters . Protection has always been at the core of our business, helping individuals, businesses and societies to thrive. And AXA has always been a leader, an innovator, an entrepreneurial company, fostering progress in all its dimensions. Our...


  • Bangalore Urban, India AXA Full time

    About AXAAs one of the largest global insurers, our purpose is to act for human progress by protecting what matters.Protection has always been at the core of our business, helping individuals, businesses and societies to thrive. And AXA has always been a leader, an innovator, an entrepreneurial company, fostering progress in all its dimensions. Our purpose...


  • bangalore, India Oracle Full time

    The Oracle Cloud Infrastructure (OCI) Generative AI Outbound Product Management team is responsible for growing OCI’s Generative AI services and platform. Our goal is to enable both our customers to apply Gen AI to solve a particular business problem with Oracle’s assistance and expertise. In this role, you will have an opportunity to work with the...

  • AWS AI Architect

    2 days ago


    bangalore, India LTIMindtree Full time

    LTIMindtree is a global technology consulting and digital solutions company that enables enterprises across industries to reimagine business models, accelerate innovation, and maximize growth by harnessing digital technologies. As a digital transformation partner to more than 750 clients, LTIMindtree brings extensive domain and technology expertise to help...

  • Prompt Engineer

    2 days ago


    Bangalore, India Magna Hire Full time

    Responsibilities : - Implementing generative ai models in projects, with familiarity in gpt and prompt engineering concepts.- Collaborating with content creators, product teams, and architects to ensure alignment with company goals and user needs.- Continuously improving prompt quality, performance, and the overall ai prompt generation process.- optimizing...


  • bangalore, India Oracle Full time

    The Oracle Cloud Infrastructure (OCI) Generative AI Outbound Product Management team is responsible for growing OCI’s Generative AI services and platform. Our goal is to enable both our customers to apply Gen AI to solve a particular business problem with Oracle’s assistance and expertise. In this role, you will have an opportunity to work with the most...

  • AWS AI Architect

    2 days ago


    bangalore, India LTIMindtree Full time

    LTIMindtree is a global technology consulting and digital solutions company that enables enterprises across industries to reimagine business models, accelerate innovation, and maximize growth by harnessing digital technologies. As a digital transformation partner to more than 750 clients, LTIMindtree brings extensive domain and technology expertise to help...


  • Bangalore/Pune, India Swift Strategic Staff Solutions INC Full time

    We are seeking a highly motivated and experienced Senior Enterprise Solution Architect to join our growing team!In this pivotal role, you will leverage your expertise in cloud technologies, system architecture, and industry domains (SEMI, BFSI, Card & Payments) to design and deliver innovative solutions that meet the critical needs of our clients.The Ideal...


  • Bangalore, India Wize Careers Consultants Full time

    Gen.AI (AI) Architect (with Azure Cloud experience) Job Description :- 7 to 10 years of experience in Software Architecture- Minimum 2 years of experience as an AI (ML/DL) Architect. - Preferred with at least 1 year of experience in Azure Cloud and Services- Designing and Developing Data architectures, AI strategies, Foresee and overturn security risks.-...