HPC Network Engineer
1 day ago
Job Title: HPC Network Engineer
Location: Hyderabad or Mumbai
Experience: Minimum 5 years of relevant network experience
Job Overview:
We are seeking a highly skilled and experienced HPC Network Engineer to join our team. The ideal candidate will have a strong background in setting up and managing high-performance computing (HPC) networks with cutting-edge technologies such as 400G and 800G network connectivity. This role involves designing, implementing, and troubleshooting complex network architectures tailored to HPC and GPU-based systems. The engineer will also play a critical role in enabling efficient GPU interconnects and scaling AI and HPC workloads.
Key Responsibilities:
HPC Network Deployment :
- Design, deploy, and maintain HPC networks with 400G/800G connectivity.
- Optimize network performance for large-scale computing environments.
Advanced Networking Expertise :
- Deep understanding and hands-on experience with RoCE (RDMA over Converged Ethernet) and Infiniband technologies.
- Collaborate with cross-functional teams to architect and implement robust HPC networking solutions.
Architectural Design and Communication :
- Develop and present complex network architectures to technical and non-technical stakeholders.
- Translate customer requirements into scalable and efficient network designs.
GPU Communication Frameworks :
- Expertise in NVLink and NVSwitch for high-speed GPU-to-GPU communication.
- Optimize interconnects for distributed training and inference workloads.
Technology Expertise :
- Hands-on experience with switches and networking equipment from Broadcom, Arista, Mellanox, Juniper, Cisco, SONiC, or Dell .
- Familiarity with NVIDIA, AMD, and Intel HPC architectures and their network integration requirements.
Storage Networking for HPC and AI :
- Integrate GPUDirect Storage and NVMe-oF for efficient data movement between storage and GPUs.
- Optimize data pathways for high-speed storage access in HPC workloads.
Problem Solving and Troubleshooting :
- Monitor, analyze, and troubleshoot network performance issues.
- Implement monitoring tools to ensure high availability and reliability of the HPC network.
Customer-Centric Solutions :
- Engage with customers to understand their requirements and deliver tailored solutions on the fly.
- Provide ongoing support and documentation for implemented solutions.
Comprehensive Network Knowledge :
- Expertise in end-to-end network monitoring, analysis, troubleshooting, and implementation.
- Stay updated on industry trends, standards, and best practices for HPC networking.
AI and HPC Workload Integration :
- Support hybrid workloads combining AI and traditional HPC tasks.
- Scale large language models and scientific simulations across GPU clusters with minimal latency.
Required Skills and Qualifications:
- Minimum 5 years of hands-on experience in core network engineering.
- Proven expertise in configuring and managing 400G or 800G network environments.
- Strong knowledge of RoCE and Infiniband protocols.
- Hands-on experience with NVLink and NVSwitch in GPU-based environments.
- Familiarity with networking equipment and technologies from vendors such as Broadcom, Arista, Mellanox, Juniper, Cisco, SONiC, or Dell .
- Experience working with NVIDIA, AMD, or Intel HPC and GPU architectures .
- Ability to conceptualize and explain complex network designs to diverse audiences.
- Strong analytical and troubleshooting skills in high-performance environments.
- Excellent communication and customer engagement skills to address requirements and provide solutions effectively.
Preferred Qualifications:
- Industry certifications such as CCNP, CCIE, or equivalent.
- Experience in scripting and automation for network operations.
- Exposure to large-scale HPC deployments in data center environments.
- Knowledge of software-defined networking (SDN) and virtualized networking environments.
- Familiarity with AI-specific frameworks like TensorFlow , PyTorch , or Horovod in distributed setups.
Why Join Us?
- Work on cutting-edge HPC and GPU-based technologies.
- Collaborate with industry leaders in AI and cloud infrastructure.
- Competitive compensation and growth opportunities.
- Opportunity to work in a dynamic and fast-paced environment.
-
Hpc network engineer
1 day ago
India Stealth AI Startup Full timeJob Title: HPC Network Engineer Location: Hyderabad or Mumbai Experience: Minimum 5 years of relevant network experience Job Overview: We are seeking a highly skilled and experienced HPC Network Engineer to join our team. The ideal candidate will have a strong background in setting up and managing high-performance computing (HPC) networks...
-
HPC Network Engineer
3 days ago
India Stealth AI Startup Full timeJob Title: HPC Network EngineerLocation: Hyderabad or MumbaiExperience: Minimum 5 years of relevant network experienceJob Overview:We are seeking a highly skilled and experienced HPC Network Engineer to join our team. The ideal candidate will have a strong background in setting up and managing high-performance computing (HPC) networks with cutting-edge...
-
HPC Network Engineer
1 day ago
India Stealth AI Startup Full timeJob Title: HPC Network Engineer Location: Hyderabad or Mumbai Experience: Minimum 5 years of relevant network experience Job Overview: We are seeking a highly skilled and experienced HPC Network Engineer to join our team. The ideal candidate will have a strong background in setting up and managing high-performance computing (HPC) networks...
-
Stealth AI Startup | HPC Network Engineer
3 days ago
india Stealth AI Startup Full timeJob Title: HPC Network Engineer Location: Hyderabad or Mumbai Experience: Minimum 5 years of relevant network experience Job Overview: We are seeking a highly skilled and experienced HPC Network Engineer to join our team. The ideal candidate will have a strong background in setting up and managing high-performance computing (HPC) networks with...
-
Advanced Network Architect for AI and HPC
10 hours ago
India Stealth AI Startup Full timeHPC Network Engineer Job DescriptionWe are seeking a highly skilled and experienced HPC Network Engineer to design, implement, and troubleshoot complex network architectures tailored to HPC and GPU-based systems.HPC Network DeploymentDeploy and maintain HPC networks with 400G/800G connectivity.Optimize network performance for large-scale computing...
-
HPC Network Infrastructure Expert
1 day ago
India Stealth AI Startup Full timeAbout UsStealth AI Startup is an innovative company that focuses on developing cutting-edge technologies in High-Performance Computing (HPC) and Artificial Intelligence (AI). We strive to push the boundaries of what is possible with advanced network infrastructure and accelerate innovation in various fields.Why Work with UsWork on the latest HPC and...
-
KLA | Cloud Engineer
6 days ago
india KLA Full timeResponsibilities:Optimize performance and scalability of HPC applications running in containerized environments.Stay up to date with the latest advancements in HPC, cloud technologies.Collaborate with other DevOps engineers and developers to ensure seamless integration of HPC solutions.Configure Linux OS for HPC needs.Implement and maintain Kubernetes...
-
HPC Infrastructure Specialist
1 day ago
India iVedha Inc. Full timeJob OverviewiVedha Inc. is seeking an experienced HPC Solution Architect to design and implement scalable High-Performance Computing clusters within a data center environment. This role involves working with NVIDIA GPUs, InfiniBand networking, and state-of-the-art technologies to deliver efficient solutions.Key Responsibilities1. Design end-to-end HPC...
-
SHI | Cymune
5 days ago
india SHI | Cymune - An SHI Company Full timeJob Opening: Technology Associate – HPC & Linux Location : MUMBAI Experience : Minimum 1-2 Years in High-Performance Computing (HPC) Should be open for Field Support We are currently seeking a Technology Associate with hands-on experience in HPC and Linux systems. If you have a passion for working with Linux environments and HPC systems, we would love to...
-
8bit.ai | HPC Architect
5 days ago
india 8bit.ai Full timeABOUT 8bit.ai We are 8bit.ai, a pioneering new initiative from CtrlS and Cloud4C group, focused on developing a high-performance multi-technology, vendor-independent, and xPU-based Accelerated Cloud Computing platform. Stacking massive clusters purpose-built for high-performance parallel computing, the group also aims to launch a global accelerated cloud...
-
Hpc architect
6 days ago
India Stealth AI Startup Full timeJob Description: HPC Architect – Stealth AI Startup Location: Remote Experience: 5+ Years in High-Performance Computing (HPC) Type: Full-Time About Us: Join our innovative stealth AI startup, driving breakthroughs in Artificial Intelligence and High-Performance Computing (HPC). We are a well-funded company with an exceptional leadership...
-
Hpc solution architect
1 day ago
India IVedha Inc. Full time***Must have experience in HPC Data Center, NVIDIA and Infini Band*** Overview i Vedha is seeking a highly skilled and experienced Solution Architect to work with the team to design and implement a High-Performance Computing (HPC) cluster within a data center environment. This role will involve working with NVIDIA GPUs, Infini Band networking, and...
-
HPC Solution Architect
4 days ago
India iVedha Inc. Full time***Must have experience in HPC Data Center, NVIDIA and InfiniBand*** Overview iVedha is seeking a highly skilled and experienced Solution Architect to work with the team to design and implement a High-Performance Computing (HPC) cluster within a data center environment. This role will involve working with NVIDIA GPUs, InfiniBand networking, and...
-
SHI | Locuz
5 days ago
india SHI | Locuz - An SHI Company Full timeHPC Field Engineer Please find the JD below: Work Location - Pune Experience - 2+years Should know the below mentioned: HPC Skill Set Cluster Tool Kit: Rocks, xCAT, OpenHPC Scheduler: PBS Pro, SLURM MPI: Intel, OpenMPI PFS: Lustre, GPFS Linux Skill : • OS Deep Dive ( RedHat, SLES, Ubuntu ) • Unattended Installation Deep Dive ( PXE, Cobbler, xCAT, etc)...
-
HPC Solution Architect
4 days ago
India iVedha Inc. Full time***Must have experience in HPC Data Center, NVIDIA and InfiniBand***OverviewiVedha is seeking a highly skilled and experienced Solution Architect to work with the team to design and implement a High-Performance Computing (HPC) cluster within a data center environment. This role will involve working with NVIDIA GPUs, InfiniBand networking, and...
-
India Stealth AI Startup Full timeJob OverviewWe are seeking a highly skilled and experienced High-Performance Computing (HPC) Network Engineer to join our team at Stealth AI Startup. The ideal candidate will have a strong background in setting up and managing cutting-edge technologies such as 400G and 800G network connectivity.This role involves designing, implementing, and troubleshooting...
-
HPC Solution Architect
3 days ago
India iVedha Inc. Full time***Must have experience in HPC Data Center, NVIDIA and InfiniBand*** Overview iVedha is seeking a highly skilled and experienced Solution Architect to work with the team to design and implement a High-Performance Computing (HPC) cluster within a data center environment. This role will involve working with NVIDIA GPUs, InfiniBand networking, and...
-
iVedha Inc. | HPC Solution Architect
4 days ago
india iVedha Inc. Full time***Must have experience in HPC Data Center, NVIDIA and InfiniBand***OverviewiVedha is seeking a highly skilled and experienced Solution Architect to work with the team to design and implement a High-Performance Computing (HPC) cluster within a data center environment. This role will involve working with NVIDIA GPUs, InfiniBand networking, and...
-
iVedha Inc. | HPC Solution Architect
4 days ago
india iVedha Inc. Full time***Must have experience in HPC Data Center, NVIDIA and InfiniBand*** Overview iVedha is seeking a highly skilled and experienced Solution Architect to work with the team to design and implement a High-Performance Computing (HPC) cluster within a data center environment. This role will involve working with NVIDIA GPUs, InfiniBand networking, and...
-
High-Performance Computing
2 days ago
Bengaluru, India Talent Ocean (Osheanire HR Consulting) Full timeJob Description Job Title: HPCVDI Company Information The RLE INTERNATIONAL Group is one of the world s leading development, technology and consultation service providers to the international engineering industries. Our 2.100 employees constantly keep abreast of technological progress. Thanks to their wide-ranging skills and innovative ideas, they play...