HPC Network Engineer

1 day ago


India Stealth AI Startup Full time

Job Title: HPC Network Engineer

Location: Hyderabad or Mumbai

Experience: Minimum 5 years of relevant network experience


Job Overview:

We are seeking a highly skilled and experienced HPC Network Engineer to join our team. The ideal candidate will have a strong background in setting up and managing high-performance computing (HPC) networks with cutting-edge technologies such as 400G and 800G network connectivity. This role involves designing, implementing, and troubleshooting complex network architectures tailored to HPC and GPU-based systems. The engineer will also play a critical role in enabling efficient GPU interconnects and scaling AI and HPC workloads.


Key Responsibilities:


HPC Network Deployment :

  • Design, deploy, and maintain HPC networks with 400G/800G connectivity.
  • Optimize network performance for large-scale computing environments.

Advanced Networking Expertise :

  • Deep understanding and hands-on experience with RoCE (RDMA over Converged Ethernet) and Infiniband technologies.
  • Collaborate with cross-functional teams to architect and implement robust HPC networking solutions.

Architectural Design and Communication :

  • Develop and present complex network architectures to technical and non-technical stakeholders.
  • Translate customer requirements into scalable and efficient network designs.

GPU Communication Frameworks :

  • Expertise in NVLink and NVSwitch for high-speed GPU-to-GPU communication.
  • Optimize interconnects for distributed training and inference workloads.

Technology Expertise :

  • Hands-on experience with switches and networking equipment from Broadcom, Arista, Mellanox, Juniper, Cisco, SONiC, or Dell .
  • Familiarity with NVIDIA, AMD, and Intel HPC architectures and their network integration requirements.

Storage Networking for HPC and AI :

  • Integrate GPUDirect Storage and NVMe-oF for efficient data movement between storage and GPUs.
  • Optimize data pathways for high-speed storage access in HPC workloads.

Problem Solving and Troubleshooting :

  • Monitor, analyze, and troubleshoot network performance issues.
  • Implement monitoring tools to ensure high availability and reliability of the HPC network.

Customer-Centric Solutions :

  • Engage with customers to understand their requirements and deliver tailored solutions on the fly.
  • Provide ongoing support and documentation for implemented solutions.

Comprehensive Network Knowledge :

  • Expertise in end-to-end network monitoring, analysis, troubleshooting, and implementation.
  • Stay updated on industry trends, standards, and best practices for HPC networking.

AI and HPC Workload Integration :

  • Support hybrid workloads combining AI and traditional HPC tasks.
  • Scale large language models and scientific simulations across GPU clusters with minimal latency.


Required Skills and Qualifications:


  1. Minimum 5 years of hands-on experience in core network engineering.
  2. Proven expertise in configuring and managing 400G or 800G network environments.
  3. Strong knowledge of RoCE and Infiniband protocols.
  4. Hands-on experience with NVLink and NVSwitch in GPU-based environments.
  5. Familiarity with networking equipment and technologies from vendors such as Broadcom, Arista, Mellanox, Juniper, Cisco, SONiC, or Dell .
  6. Experience working with NVIDIA, AMD, or Intel HPC and GPU architectures .
  7. Ability to conceptualize and explain complex network designs to diverse audiences.
  8. Strong analytical and troubleshooting skills in high-performance environments.
  9. Excellent communication and customer engagement skills to address requirements and provide solutions effectively.


Preferred Qualifications:

  1. Industry certifications such as CCNP, CCIE, or equivalent.
  2. Experience in scripting and automation for network operations.
  3. Exposure to large-scale HPC deployments in data center environments.
  4. Knowledge of software-defined networking (SDN) and virtualized networking environments.
  5. Familiarity with AI-specific frameworks like TensorFlow , PyTorch , or Horovod in distributed setups.


Why Join Us?

  • Work on cutting-edge HPC and GPU-based technologies.
  • Collaborate with industry leaders in AI and cloud infrastructure.
  • Competitive compensation and growth opportunities.
  • Opportunity to work in a dynamic and fast-paced environment.



  • India Stealth AI Startup Full time

    Job Title: HPC Network Engineer Location: Hyderabad or Mumbai Experience: Minimum 5 years of relevant network experience Job Overview: We are seeking a highly skilled and experienced HPC Network Engineer to join our team. The ideal candidate will have a strong background in setting up and managing high-performance computing (HPC) networks...


  • India Stealth AI Startup Full time

    Job Title: HPC Network EngineerLocation: Hyderabad or MumbaiExperience: Minimum 5 years of relevant network experienceJob Overview:We are seeking a highly skilled and experienced HPC Network Engineer to join our team. The ideal candidate will have a strong background in setting up and managing high-performance computing (HPC) networks with cutting-edge...


  • India Stealth AI Startup Full time

    Job Title: HPC Network Engineer Location: Hyderabad or Mumbai Experience: Minimum 5 years of relevant network experience Job Overview: We are seeking a highly skilled and experienced HPC Network Engineer to join our team. The ideal candidate will have a strong background in setting up and managing high-performance computing (HPC) networks...


  • india Stealth AI Startup Full time

    Job Title: HPC Network Engineer Location: Hyderabad or Mumbai Experience: Minimum 5 years of relevant network experience Job Overview: We are seeking a highly skilled and experienced HPC Network Engineer to join our team. The ideal candidate will have a strong background in setting up and managing high-performance computing (HPC) networks with...


  • India Stealth AI Startup Full time

    HPC Network Engineer Job DescriptionWe are seeking a highly skilled and experienced HPC Network Engineer to design, implement, and troubleshoot complex network architectures tailored to HPC and GPU-based systems.HPC Network DeploymentDeploy and maintain HPC networks with 400G/800G connectivity.Optimize network performance for large-scale computing...


  • India Stealth AI Startup Full time

    About UsStealth AI Startup is an innovative company that focuses on developing cutting-edge technologies in High-Performance Computing (HPC) and Artificial Intelligence (AI). We strive to push the boundaries of what is possible with advanced network infrastructure and accelerate innovation in various fields.Why Work with UsWork on the latest HPC and...


  • india KLA Full time

    Responsibilities:Optimize performance and scalability of HPC applications running in containerized environments.Stay up to date with the latest advancements in HPC, cloud technologies.Collaborate with other DevOps engineers and developers to ensure seamless integration of HPC solutions.Configure Linux OS for HPC needs.Implement and maintain Kubernetes...


  • India iVedha Inc. Full time

    Job OverviewiVedha Inc. is seeking an experienced HPC Solution Architect to design and implement scalable High-Performance Computing clusters within a data center environment. This role involves working with NVIDIA GPUs, InfiniBand networking, and state-of-the-art technologies to deliver efficient solutions.Key Responsibilities1. Design end-to-end HPC...

  • SHI | Cymune

    5 days ago


    india SHI | Cymune - An SHI Company Full time

    Job Opening: Technology Associate – HPC & Linux Location : MUMBAI Experience : Minimum 1-2 Years in High-Performance Computing (HPC) Should be open for Field Support We are currently seeking a Technology Associate with hands-on experience in HPC and Linux systems. If you have a passion for working with Linux environments and HPC systems, we would love to...


  • india 8bit.ai Full time

    ABOUT 8bit.ai We are 8bit.ai, a pioneering new initiative from CtrlS and Cloud4C group, focused on developing a high-performance multi-technology, vendor-independent, and xPU-based Accelerated Cloud Computing platform. Stacking massive clusters purpose-built for high-performance parallel computing, the group also aims to launch a global accelerated cloud...

  • Hpc architect

    6 days ago


    India Stealth AI Startup Full time

    Job Description: HPC Architect – Stealth AI Startup Location: Remote Experience: 5+ Years in High-Performance Computing (HPC) Type: Full-Time About Us: Join our innovative stealth AI startup, driving breakthroughs in Artificial Intelligence and High-Performance Computing (HPC). We are a well-funded company with an exceptional leadership...


  • India IVedha Inc. Full time

    ***Must have experience in HPC Data Center, NVIDIA and Infini Band*** Overview i Vedha is seeking a highly skilled and experienced Solution Architect to work with the team to design and implement a High-Performance Computing (HPC) cluster within a data center environment. This role will involve working with NVIDIA GPUs, Infini Band networking, and...


  • India iVedha Inc. Full time

    ***Must have experience in HPC Data Center, NVIDIA and InfiniBand*** Overview iVedha is seeking a highly skilled and experienced Solution Architect to work with the team to design and implement a High-Performance Computing (HPC) cluster within a data center environment. This role will involve working with NVIDIA GPUs, InfiniBand networking, and...

  • SHI | Locuz

    5 days ago


    india SHI | Locuz - An SHI Company Full time

    HPC Field Engineer Please find the JD below: Work Location - Pune Experience - 2+years Should know the below mentioned: HPC Skill Set Cluster Tool Kit: Rocks, xCAT, OpenHPC Scheduler: PBS Pro, SLURM MPI: Intel, OpenMPI PFS: Lustre, GPFS Linux Skill : • OS Deep Dive ( RedHat, SLES, Ubuntu ) • Unattended Installation Deep Dive ( PXE, Cobbler, xCAT, etc)...


  • India iVedha Inc. Full time

    ***Must have experience in HPC Data Center, NVIDIA and InfiniBand***OverviewiVedha is seeking a highly skilled and experienced Solution Architect to work with the team to design and implement a High-Performance Computing (HPC) cluster within a data center environment. This role will involve working with NVIDIA GPUs, InfiniBand networking, and...


  • India Stealth AI Startup Full time

    Job OverviewWe are seeking a highly skilled and experienced High-Performance Computing (HPC) Network Engineer to join our team at Stealth AI Startup. The ideal candidate will have a strong background in setting up and managing cutting-edge technologies such as 400G and 800G network connectivity.This role involves designing, implementing, and troubleshooting...


  • India iVedha Inc. Full time

    ***Must have experience in HPC Data Center, NVIDIA and InfiniBand*** Overview iVedha is seeking a highly skilled and experienced Solution Architect to work with the team to design and implement a High-Performance Computing (HPC) cluster within a data center environment. This role will involve working with NVIDIA GPUs, InfiniBand networking, and...


  • india iVedha Inc. Full time

    ***Must have experience in HPC Data Center, NVIDIA and InfiniBand***OverviewiVedha is seeking a highly skilled and experienced Solution Architect to work with the team to design and implement a High-Performance Computing (HPC) cluster within a data center environment. This role will involve working with NVIDIA GPUs, InfiniBand networking, and...


  • india iVedha Inc. Full time

    ***Must have experience in HPC Data Center, NVIDIA and InfiniBand*** Overview iVedha is seeking a highly skilled and experienced Solution Architect to work with the team to design and implement a High-Performance Computing (HPC) cluster within a data center environment. This role will involve working with NVIDIA GPUs, InfiniBand networking, and...


  • Bengaluru, India Talent Ocean (Osheanire HR Consulting) Full time

    Job Description Job Title: HPCVDI Company Information The RLE INTERNATIONAL Group is one of the world s leading development, technology and consultation service providers to the international engineering industries. Our 2.100 employees constantly keep abreast of technological progress. Thanks to their wide-ranging skills and innovative ideas, they play...