Senior Solutions Architect, Generative AI

1 month ago


Mumbai, India NVIDIA Full time
NVIDIA is seeking a dynamic and experienced Generative AI Solution Architect with specialized expertise in training Large Language Models (LLMs) and implementing workflows based on Retrieval-Augmented Generation (RAG). As a key member of our AI Solutions team, you will play a pivotal role in architecting and delivering cutting-edge solutions that leverage the power of NVIDIA's generative AI technologies. This position requires a deep understanding of language models, particularly LLMs, and a strong proficiency in designing and implementing RAG-based workflows.What you will be doingArchitect end-to-end generative AI solutions with a focus on LLMs and RAG workflows.

Work closely with our customers stakeholders like product, business, engineering and research in defining their product roadmap based on our AI Software & Hardware platforms.

Work closely with NVIDIA engineering teams to provide feedback and contribute to the evolution of generative AI technologies.

Lead workshops and design sessions to define and refine generative AI solutions focused on LLMs and RAG workflows and lead the training and optimization of Large Language Models using NVIDIA’s hardware and software platforms.

Implement strategies for efficient and effective training of LLMs to achieve optimal performance.

Design and implement RAG-based workflows to enhance content generation and information retrieval.

Work closely with customers to integrate RAG workflows into their applications and systems and stay abreast of the latest developments in language models and generative AI technologies.

Provide technical leadership and guidance on best practices for training LLMs and implementing RAG-based solutions.

What we need to seeMaster's or Ph.D. in Computer Science, Artificial Intelligence, or equivalent experience

5+ years of hands-on experience in a technical role, specifically focusing on generative AI, with a strong emphasis on training Large Language Models (LLMs).

Proven track record of successfully deploying and optimizing LLM models for inference in production environments.

In-depth understanding of state-of-the-art language models, including but not limited to GPT-3, BERT, or similar architectures.

Expertise in training and fine-tuning LLMs using popular frameworks such as TensorFlow, PyTorch, or Hugging Face Transformers.

Proficiency in model deployment and optimization techniques for efficient inference on various hardware platforms, with a focus on GPUs.

Strong knowledge of GPU cluster architecture and the ability to leverage parallel processing for accelerated model training and inference.

Excellent communication and collaboration skills with the ability to articulate complex technical concepts to both technical and non-technical stakeholders.

Experience leading workshops, training sessions, and presenting technical solutions to diverse audiences.

Ways To Stand Out From The CrowdExperience in deploying LLM models in cloud environments (e.g., AWS, Azure, GCP) and on-premises infrastructure.

Proven ability to optimize LLM models for inference, memory efficiency, and utilization.

Familiarity with containerization technologies (e.g., Docker) and orchestration tools (e.g., Kubernetes) for scalable and efficient model deployment.

Deep understanding of GPU cluster architecture, parallel computing, and distributed computing concepts.

Hands-on experience with NVIDIA GPU technologies, and GPU cluster management and ability to design and implement scalable and efficient workflows for LLM training and inference on GPU clusters

With competitive salaries and a generous benefits package, we are widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us and, due to unprecedented growth, our exclusive engineering teams are rapidly growing. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.



  • mumbai, India NVIDIA Full time

    NVIDIA is seeking a dynamic and experienced Generative AI Solution Architect with specialized expertise in training Large Language Models (LLMs) and implementing workflows based on Retrieval-Augmented Generation (RAG). As a key member of our AI Solutions team, you will play a pivotal role in architecting and delivering cutting-edge solutions that leverage...


  • Mumbai, India NVIDIA Full time

    NVIDIA is seeking a dynamic and experienced Generative AI Solution Architect with specialized expertise in training Large Language Models (LLMs) and implementing workflows based on Retrieval-Augmented Generation (RAG). As a key member of our AI Solutions team, you will play a pivotal role in architecting and delivering cutting-edge solutions that leverage...


  • Mumbai, Maharashtra, India Google Full time

    **Minimum qualifications**: - Bachelor's degree in Computer Science, Data Science, or equivalent practical experience. - Experience with Python and ML frameworks (e.g., TensorFlow, PyTorch). - Experience delivering technical presentations and leading business value sessions. - Experience in Generative AI as a user or a developer. - Experience working in...

  • Generative AI Engineer

    2 months ago


    Mumbai, India Valiance Solutions Full time

    Company Overview: We are a dynamic and innovative technology company leveraging cutting-edge cloud technologies and generative AI to create groundbreaking solutions that address complex challenges across various industries. We are seeking a skilled AI Engineer to to drive the architecture and development of our AI/ML applications, contributing to our growth...

  • Generative AI Engineer

    2 months ago


    Mumbai, India Valiance Solutions Full time

    Company Overview:We are a dynamic and innovative technology company leveraging cutting-edge cloud technologies and generative AI to create groundbreaking solutions that address complex challenges across various industries. We are seeking a skilled AI Engineer to to drive the architecture and development of our AI/ML applications, contributing to our growth...

  • Generative AI Engineer

    2 months ago


    Mumbai, India Valiance Solutions Full time

    Company Overview:We are a dynamic and innovative technology company leveraging cutting-edge cloud technologies and generative AI to create groundbreaking solutions that address complex challenges across various industries. We are seeking a skilled AI Engineer to to drive the architecture and development of our AI/ML applications, contributing to our growth...


  • Mumbai, India NEC Software Solutions Full time

    Company DescriptionRave Technologies – A Northgate Public Services Company, is a software services company that works with small and medium sized organisations. We are a part of Northgate Public Services Company which is based in the UK.Based in Mumbai, Rave Technologies’ end to end product/application engineering services help address challenges in the...


  • mumbai, India NEC Software Solutions Full time

    Company Description Rave Technologies – A Northgate Public Services Company, is a software services company that works with small and medium sized organisations. We are a part of Northgate Public Services Company which is based in the UK. Based in Mumbai, Rave Technologies’ end to end product/application engineering services help address challenges in...


  • Mumbai, India Capgemini Full time

    **Job Description**: - We are seeking an experienced Senior Solutions Architect/ Presales Solution Architect/ Innovation Solution Architect with a strong background in data and AI, Digital Twin, Immersive experience technologies to join our dynamic team.- As a Senior Solutions Architect, you will be responsible for driving customers' data and AI solutions,...


  • Mumbai, India Capgemini Full time

    Job DescriptionWe are seeking an experienced Senior Solutions Architect/ Presales Solution Architect/ Innovation Solution Architect with a strong background in data and AI, Digital Twin, Immersive experience technologies to join our dynamic team.As a Senior Solutions Architect, you will be responsible for driving customers' data and AI solutions, Horizon 1...

  • Gen AI Architect

    2 months ago


    Mumbai, Maharashtra, India Cognizant Technology Solutions Full time

    Job Title : Gen AI ArchitectExp: 14 to 18 yrs Location : Pan India As a Gen AI Architect (General Artificial Intelligence Architect), your primary role is to design and develop artificial intelligence systems that have the capability to learn, reason, and make decisions autonomously. You will oversee the architecture and infrastructure required to build and...

  • Gen Ai Architect

    1 month ago


    Mumbai, Maharashtra, India Cognizant Technology Solutions Full time

    **Job Description: Gen AI Architect** **Exp: 14 to 18 yrs** **Location : Mumbai & Bangalore.** As a Gen AI Architect (General Artificial Intelligence Architect), your primary role is to design and develop artificial intelligence systems that have the capability to learn, reason, and make decisions autonomously. You will oversee the architecture and...


  • mumbai, India Capgemini Full time

    Job Description • We are seeking an experienced Senior Solutions Architect/ Presales Solution Architect/ Innovation Solution Architect with a strong background in data and AI, Digital Twin, Immersive experience technologies to join our dynamic team. • As a Senior Solutions Architect, you will be responsible for driving customers' data and AI solutions,...


  • Mumbai, India Capgemini Full time

    Job Description• We are seeking an experienced Senior Solutions Architect/ Presales Solution Architect/ Innovation Solution Architect with a strong background in data and AI, Digital Twin, Immersive experience technologies to join our dynamic team. • As a Senior Solutions Architect, you will be responsible for driving customers' data and AI solutions,...


  • Mumbai, India Capgemini Full time

    Job Description We are seeking an experienced Senior Solutions Architect/ Presales Solution Architect/ Innovation Solution Architect with a strong background in data and AI, Digital Twin, Immersive experience technologies to join our dynamic team.  As a Senior Solutions Architect, you will be responsible for driving customers' data and AI...


  • Mumbai, India NEC Full time

    Job DescriptionJob Summary: We are seeking a hands-on AI Architect to join our AI Tech Team. The candidate will be responsible for spearheading the design and development of AI solutions within the Cloud frameworks, with a focus on Google Cloud Platform (GCP). This role is crucial to our goal of providing cutting-edge AI solutions that deliver substantial...


  • mumbai, India NEC Full time

    Job Description Job Summary: We are seeking a hands-on AI Architect to join our AI Tech Team. The candidate will be responsible for spearheading the design and development of AI solutions within the Cloud frameworks, with a focus on Google Cloud Platform (GCP). This role is crucial to our goal of providing cutting-edge AI solutions that deliver...


  • Mumbai, Maharashtra, India NEC Software Solutions Full time

    Company Description Rave Technologies - A Northgate Public Services Company, is a software services company that works with small and medium sized organisations. We are a part of Northgate Public Services Company which is based in the UK. **Job Description**: Key Responsibilities: - Lead the design, development, and implementation of AI solutions in the...


  • Mumbai, Maharashtra, India NEC Software Solutions (India) Full time

    **Company Description** Rave Technologies - A Northgate Public Services Company, is a software services company that works with small and medium sized organisations. We are a part of Northgate Public Services Company which is based in the UK. Key Responsibilities: - Lead the design, development, and implementation of AI solutions in the GCP Cloud...

  • Python AI Engineer

    6 days ago


    Navi Mumbai, India Bizdeed HR Solutions Pvt.Ltd Full time

    Urgent Hiring for - Python AI Engineer - Navi Mumbai (On-site)- 6-8 Years- Full Time - Hybrid (3 Days a Week)- Shift Timings - 11 am - 8 pm (IST)CTC- 20 lpa - 25 lpaJob Overview: We are seeking a skilled AI Engineer with expertise in Generative AI and Prompt Engineering to join our innovative team. This role is essential for developing, deploying, and...