Lead Solution Architect AI/ML

1 day ago


Bengaluru, Karnataka, India BayOne Solutions Full time ₹ 12,00,000 - ₹ 36,00,000 per year

Job Description:

We are seeking an experienced
Lead Solutions Architect
with deep expertise in
AI/ML infrastructure
,
High Performance Computing (HPC)
, and
container platforms
to join our dynamic team focused on delivering
Private Cloud AI
and
Enterprise AI Factory Solutions
. This role is instrumental in architecting, deploying, and optimizing private cloud environments that leverage HPE's co-developed solutions with NVIDIA, as well as validated the product reference architectures, to support enterprise-grade AI workloads at scale.

The ideal candidate will bring strong technical expertise in AI infrastructure, container orchestration platforms, and hybrid cloud environments, and will play a key role in delivering scalable, secure, and high-performance AI platform solutions powered by HPE GreenLake and NVIDIA AI Enterprise technologies.

Key Responsibilities:

Leadership and Strategy:

  • Provide delivery assurance and serve as the lead design authority to ensure seamless execution of Enterprise grade container platform —including Red Hat OpenShift and SUSE Rancher, Private Cloud AI and HPC/AI solutions, fully aligned with customer AI/ML strategies and business objectives.
  • Align solution architecture with NVIDIA Enterprise AI Factory design principles, including modular scalability, GPU optimization, and hybrid cloud orchestration.
  • Oversee planning, risk management, and stakeholder alignment throughout the project lifecycle to ensure successful outcomes.

Solution Planning and Design:

  • Architect and optimize end-to-end solutions across container orchestration and HPC workload management domains, leveraging platforms such as Red Hat OpenShift, SUSE Rancher, and/or workload schedulers like Slurm and Altair PBS Pro.
  • Ensure seamless integration of container and AI platforms with the broader software ecosystem, including NVIDIA AI Enterprise, as well as open-source DevOps, AI/ML tools, and frameworks.

Opportunity assessment:

  • Lead technical responses to RFPs, RFIs, and customer inquiries, ensuring alignment with business and technical requirements.
  • Conduct proof-of-concept (PoC) engagements to validate solution feasibility, performance, and integration within customer environments.
  • Assess customer infrastructure and workloads to recommend optimal configurations using validated reference architectures from HPE and strategic partners such as Red Hat, NVIDIA, SUSE, along with components from the open-source ecosystem.

Innovation and Research:

  • Stay current with emerging technologies, industry trends, and best practices across HPC, Kubernetes, container platforms, hybrid cloud, and security to inform solution design and innovation.

Customer-centric mindset:

  • Act as a trusted advisor to enterprise customers, ensuring alignment of AI solutions with business goals.
  • Translate complex technical concepts into value propositions for stakeholders

Team Collaboration:

  • Collaborate with cross-functional teams, including subject matter experts in infrastructure components—such as HPE servers, storage, networking—and data science teams to ensure cohesive and integrated solution delivery.
  • Mentor technical consultants and contribute to internal knowledge sharing through tech talks and innovation forums.

Required Skills:

  1. HPC & AI Infrastructure

  2. Extensive knowledge of HPC technologies and workload scheduler such as Slurm and/or Altair PBS Pro,

  3. Proficient in HPC cluster management tools, including Cluster Management (HPCM) and/or NVIDIA Base Command Manager.
  4. Experience with HPC cluster managers like Cluster Management (HPCM) and/or NVIDIA Base Command Manager.
  5. Good understanding with high-speed networking stacks (InfiniBand, Mellanox) and performance tuning of HPC components.
  6. Solid grasp of high-speed networking technologies, such as InfiniBand and Ethernet.

Containerization & Orchestration

  • Extensive hands-on experience with containerization technologies such as Docker, Podman, and Singularity
  • Proficiency with at least two container orchestration platforms: CNCF Kubernetes, Red Hat OpenShift, SUSE Rancher (RKE/K3S), Canonical Charmed Kubernetes.
  • Strong understanding of GPU technologies, including the NVIDIA GPU Operator for Kubernetes-based environments and DCGM (Data Center GPU Manager) for GPU health and performance monitoring.

Operating Systems & Virtualization

  • Extensive experience in Linux system administration, including package management, boot process troubleshooting, performance tuning, and network configuration.
  • Proficient with multiple Linux distributions, with hands-on expertise in at least two of the following: RHEL, SLES, and Ubuntu.
  • Experience with virtualization technologies, including KVM and OpenShift Virtualization, for deploying and managing virtualized workloads in hybrid cloud environments.
  • Cloud, DevOps & MLOpsSolid understanding of hybrid cloud architectures and experience working with major cloud platforms in conjunction with on-premises infrastructure.
  • Familiarity with DevOps practices, including CI/CD pipelines, infrastructure as code (IaC), and microservices-based application delivery.
  • Experience integrating and operationalizing open-source AI/ML tools and frameworks, supporting the full model lifecycle from development to deployment.
  • Good understanding of cloud-native security, observability, and compliance frameworks, ensuring secure and reliable AI/ML operations at scale.

Networking & Protocols

  • Strong understanding of core networking principles, including DNS, TCP/IP, routing, and load balancing, essential for designing resilient and scalable infrastructure.
  • Working knowledge of key network protocols, such as S3, NFS, and SMB/CIFS, for data access, transfer, and integration across hybrid environments.
  • Programming & Automation
  • Proficiency in scripting or programming languages such as Python and Bash.
  • Experience automating infrastructure and AI workflows.

Soft Skills & Leadership

  • Excellent problem-solving, analytical thinking, and communication skills for engaging both technical and non-technical stakeholders.
  • Proven ability to lead complex technical projects from requirements gathering through architecture, design, and delivery.
  • Strong business acumen with the ability to align technical solutions with client challenges and objectives.

Qualifications:

  • Bachelor's/master's degree in computer science, Information Technology, or a related field.
  • Professional certifications in AI Infrastructure, Containers and Kubernetes are highly desirable —such as RHCSA, RHCE, CNCF certifications (CKA, CKAD, CKS), NVIDIA-Certified Associate - AI Infrastructure and Operations
  • Typically, 8–10 years of hands-on experience in architecting and implementing HPC, AI/ML, and container platform solutions within hybrid or private cloud environments, with a strong focus on scalability, performance, and enterprise integration.

  • AI/ML Architect

    3 weeks ago


    Bengaluru, Karnataka, India GENERAL ELECTRIC (GE) Full time

    Job DescriptionRole Responsibilities:1. Design, develop, and implement end-to-end AI solutions across the organization.2. Architect scalable AI systems and integrate them with business and IT infrastructure.3. Lead AI development teams, ensuring alignment with business goals and innovation.4. Optimize and enhance the performance of machine learning models...


  • Bengaluru, Karnataka, India Net Connect Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    As a Lead AI Solutions Architect , you will spearhead the design, deployment, and optimization of AI infrastructure built on hybrid/private cloud environments using validated reference architectures. Youll architect scalable, high-performance solutions tailored to enterprise AI workloads, working closely with cross-functional teams and enterprise...


  • Bengaluru, Karnataka, India Arting Digital Full time

    Job Title : AI/ML Technical Architect (Creative Cloud)Location : Bangalore / NoidaWork Mode : HybridExperience : 10+ yearsNotice Period : Immediate joiners to 15 days.Job Overview :We are seeking an experienced AI/ML Technical Architect with a strong background in designing and scaling AI platforms and delivering enterprise-grade solutions. The ideal...

  • AI ML Architect

    6 days ago


    Bengaluru, Karnataka, India Capgemini Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    Job DescriptionDefine and own the end-to-end architecture for enterprise-scale GenAI/AI solutionsDesign reference architectures, reusable patterns, and best practices for integrating GenAI into business applicationsCollaborate with domain leaders, data scientists, security and developers to align business requirements with scalable AI architecturesSelect and...

  • AI ML Architect

    6 days ago


    Bengaluru, Karnataka, India Capgemini Engineering Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Job DescriptionDefine and own the end-to-end architecture for enterprise-scale GenAI/AI solutionsDesign reference architectures, reusable patterns, and best practices for integrating GenAI into business applicationsCollaborate with domain leaders, data scientists, security and developers to align business requirements with scalable AI architecturesSelect and...

  • Ai/Ml Architect

    4 weeks ago


    Bengaluru, Karnataka, India Tata Consultancy Services Full time

    Tcs Hiring for skill AI/ML ArchitectExp Req:10+ yrsLocation:BLR/CHENNAI/ Mumbai/ Pune/ HYDJDIn-depth knowledge in AI & ML and associated architectureAI system design , Enterprise AI integration , APIs and MicroserviceKnowledge of Emerging Technologies & TrendsKnowledge in MLOPS, Cloud and Infrastructure, AI Compliance and Governance, Secure AI deployment,...


  • Bengaluru, Karnataka, India Quest Global Full time US$ 1,50,000 - US$ 2,00,000 per year

    Job Requirements 1. Job Description : AI/ML/GenAI Architect with Microsoft Azure Cloud skillsResponsibilities:Design and implement advanced AI solutions, including machine learning models, computer vision applications, OCR systems, and Generative AI (GenAI) solutions using frameworks like TensorFlow, Pytorch, OpenCV and Langchain.knowledge of AI...

  • AI/ML Architect

    6 days ago


    Bengaluru, Karnataka, India Resolvetech Solutions Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    We are always looking out for Talented Professionals who are Passionate, Curious, Creative, and Solution-driven Team Players. If you're interested, we have some exciting job openings shared below for you to apply. If you do not find a suitable position, simply drop your resume at and we will revert back when a suitable opportunity comes up where your...


  • Bengaluru, Karnataka, India Cigres Technologies Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Bengaluru, Karnataka, IndiaJob TypeFull TimeAbout the RoleKey Responsibilities Architect end-to-end AI/ML deployment solutions focusing on AWS services such as S3, Redshift, SageMaker, Bedrock, Lambda, and IAM, ensuring best practices in security, compliance, and scalability.Design and integrate data lakes, structured databases, and ML pipelines leveraging...

  • AI/ML Lead

    2 days ago


    Bengaluru, Karnataka, India Atlas Systems Full time ₹ 15,00,000 - ₹ 20,00,000 per year

    Who We Are Looking ForWe are seeking an experienced AI Solutions Lead with a strong background in AI/ML and expertise in delivering scalable AI solutions across diverse problems. The ideal candidate should have a proven track record of translating business requirements into impactful AI solutions.ResponsibilitiesPrior experience in Business Analysis, Product...