Yotta - L2 HPC Administrator
3 months ago
Job Description :
As an HPC Admin L2, you will be responsible for the management and maintenance of GPU Supercomputing clusters on NVIDIA reference architecture. You will ensure optimal performance and uptime of these critical systems, supporting high-performance computing (HPC) requirements.
Job Responsibilities :
- Configure, and maintain GPU Supercomputing clusters and associated networking configuration.
- Implement and optimize software stacks including MaaS (metal-as-a-service), Job Scheduler (SLURM/PBS), Cloud Orchestration (Kubernetes), and Network Management (NetQ for Ethernet fabric and UFM for InfiniBand).
- Conduct performance activities such as debugging, profiling, benchmarking, and tuning of GPU applications on large-scale supercomputing clusters.
- Run benchmarking applications from widely used platforms such as MLPerf Training & Inference, AI Training (PyTorch, TensorFlow, NeMo, Megatron-LM), and AI Inference (TensorRT-LLM, Triton Inference Server, vLLM).
Must-Have Skill :
- Hands-on experience with NVIDIA GPU, particularly NVIDIA Data Centre GPUs (A100/H100)
- Experience in provisioning and managing software stacks like MaaS, Job Scheduler (SLURM/PBS), Cloud Orchestration (Kubernetes), and Network Management (NetQ for Ethernet fabric and UFM for InfiniBand).
- Prior experience collaborating with NVIDIA Solution Architect & Engineering teams on large-scale GPU-as-a-service projects.
- Familiarity with benchmarking applications from widely used platforms and frameworks, including MLPerf, PyTorch, TensorFlow, NeMo, Megatron-LM, TensorRT-LLM, Triton Inference Server, and vLLM.
- Experience in performance engineering, including debugging, profiling, benchmarking, and tuning various GPU applications on large-scale supercomputing clusters.
Good to Have Skill :
- Knowledge of other HPC technologies and architectures beyond NVIDIA, broadening expertise in the field.
- Experience with other cloud platforms and orchestration tools, expanding versatility in deployment environments.
- Strong problem-solving and troubleshooting abilities, enabling quick resolution of complex technical issues.
- Excellent communication and collaboration skills to work effectively within cross-functional teams and with external partners.
Behavioral Attributes :
- Strong problem-solving skills with a proactive and solution-oriented approach.
- Excellent communication and collaboration skills for effective customer support.
- Adaptability to handle a dynamic and fast-paced cloud administration environment.
- Commitment to security best practices and continuous improvement.
Qualification and Experience :
- Bachelors degree in Engineering, or equivalent.
- Minimum 6 years experience in IT, 3+ years of relevant experience in HPC engineering roles, with a
focus on NVIDIA GPU and Networking Technologies.
- Demonstrated success in deploying and managing large-scale GPU Supercomputing clusters, preferably in collaboration with NVIDIA teams.
- Proven track record of performance engineering activities and optimizing GPU applications for high-performance computing workloads.
(ref:hirist.tech)-
Yotta - L2 HPC Administrator
3 weeks ago
Navi Mumbai, Maharashtra, India YOTTA INFRASTRUCTURE SOLUTIONS LLP Full timeJob Title: Yotta - L2 HPC AdministratorAt Yotta Infrastructure Solutions LLP, we are seeking a highly skilled L2 HPC Administrator to join our team. As an HPC Admin L2, you will be responsible for the management and maintenance of GPU Supercomputing clusters on NVIDIA reference architecture.Key Responsibilities:Configure and maintain GPU Supercomputing...
-
Yotta - L3 HPC Administrator
3 weeks ago
Navi Mumbai, Maharashtra, India YOTTA INFRASTRUCTURE SOLUTIONS LLP Full timeJob Title: Yotta - L3 HPC AdministratorAs a seasoned HPC Administrator, you will be responsible for the end-to-end management of GPU Supercomputing clusters on NVIDIA reference architecture. Your primary focus will be on ensuring optimal performance and uptime of these critical systems, supporting high-performance computing requirements.Key...
-
Yotta - L3 HPC Administrator
2 weeks ago
Mumbai/Airoli, India YOTTA INFRASTRUCTURE SOLUTIONS LLP Full timeJob SummaryAs a seasoned HPC professional, you will be responsible for the provisioning, management, and maintenance of GPU Supercomputing clusters on NVIDIA reference architecture. Your goal will be to ensure optimal performance and uptime of these critical systems, supporting high-performance computing requirements.Key ResponsibilitiesProvision, configure,...
-
Yotta - L2 HPC Administrator
2 weeks ago
Mumbai/Airoli, India YOTTA INFRASTRUCTURE SOLUTIONS LLP Full timeJob Description : As an HPC Admin L2, you will be responsible for the management and maintenance of GPU Supercomputing clusters on NVIDIA reference architecture. You will ensure optimal performance and uptime of these critical systems, supporting high-performance computing (HPC) requirements. Job Responsibilities : - Configure, and maintain GPU...
-
Yotta - L3 HPC Administrator
3 months ago
Navi Mumbai, India YOTTA INFRASTRUCTURE SOLUTIONS LLP Full timeAs an HPC Admin L3, you will be responsible for the provisioning, management, and maintenance of GPU Supercomputing clusters on NVIDIA reference architecture. You will ensure optimal performance and uptime of these critical systems, supporting high-performance computing (HPC) requirements. Job Responsibilities : - Provision, configure, and maintain GPU...
-
HPC Systems Administrator
3 weeks ago
Mumbai, Maharashtra, India Yotta Data Services Private Limited Full timeJob Title: HPC AdminAt Yotta Data Services Private Limited, we are seeking a highly skilled HPC Admin to join our team. As an HPC Admin, you will be responsible for the provisioning, management, and maintenance of GPU Supercomputing clusters on NVIDIA reference architecture.Key Responsibilities:Provision, configure, and maintain GPU Supercomputing clusters...
-
Yotta - L3 HPC Administrator
2 weeks ago
Mumbai/Airoli, India YOTTA INFRASTRUCTURE SOLUTIONS LLP Full timeAs an HPC Admin L3, you will be responsible for the provisioning, management, and maintenance of GPU Supercomputing clusters on NVIDIA reference architecture. You will ensure optimal performance and uptime of these critical systems, supporting high-performance computing (HPC) requirements. Job Responsibilities : - Provision, configure, and maintain GPU...
-
NVIDIA HPC Cluster Architect
6 days ago
Mumbai/Airoli, India YOTTA INFRASTRUCTURE SOLUTIONS LLP Full timeHPC Infrastructure Engineer RoleAs a seasoned HPC Infrastructure Engineer at Yotta Infrastructure Solutions LLP, you will be responsible for delivering high-performance computing (HPC) solutions to our clients. Your primary focus will be on designing, deploying, and managing large-scale GPU Supercomputing clusters on NVIDIA reference architecture.Key...
-
HPC Admin
3 weeks ago
mumbai, India Yotta Data Services Private Limited Full timeJob Scope: As an HPC Admin L3, you will be responsible for the provisioning, management, and maintenance of GPU Supercomputing clusters on NVIDIA reference architecture. You will ensure optimal performance and uptime of these critical systems, supporting high-performance computing (HPC) requirements.Job Responsibilities: Provision, configure, and maintain...
-
Hpc admin
3 weeks ago
Mumbai, India Yotta Data Services Private Limited Full timeJob Scope: As an HPC Admin L3, you will be responsible for the provisioning, management, and maintenance of GPU Supercomputing clusters on NVIDIA reference architecture. You will ensure optimal performance and uptime of these critical systems, supporting high-performance computing (HPC) requirements. Job Responsibilities: Provision, configure, and...
-
HPC Admin
1 month ago
mumbai, India Yotta Data Services Private Limited Full timeJob Scope: As an HPC Admin L3, you will be responsible for the provisioning, management, and maintenance of GPU Supercomputing clusters on NVIDIA reference architecture. You will ensure optimal performance and uptime of these critical systems, supporting high-performance computing (HPC) requirements. Job Responsibilities: Provision, configure, and...
-
HPC Admin
1 month ago
Mumbai Metropolitan Region, India Yotta Data Services Private Limited Full timeJob Scope: As an HPC Admin L3, you will be responsible for the provisioning, management, and maintenance of GPU Supercomputing clusters on NVIDIA reference architecture. You will ensure optimal performance and uptime of these critical systems, supporting high-performance computing (HPC) requirements.Job Responsibilities: Provision, configure, and maintain...
-
High Performance Computing Specialist
1 week ago
Mumbai, Maharashtra, India Yotta Data Services Private Limited Full timeAbout the Role:We seek an exceptional High Performance Computing Specialist to join our team at Yotta Data Services Private Limited.Main Responsibilities:Design, provision, and maintain large-scale GPU Supercomputing clusters using NVIDIA reference architecture.Collaborate closely with our Engineering teams and external partners to deploy and manage...
-
High-Performance Computing Administrator
2 weeks ago
Mumbai/Airoli, India YOTTA INFRASTRUCTURE SOLUTIONS LLP Full timeHPC Administrator Job SummaryWe are seeking a highly skilled HPC Administrator to manage and maintain our GPU Supercomputing clusters on NVIDIA reference architecture. The successful candidate will ensure optimal performance and uptime of these critical systems, supporting high-performance computing requirements.Key Responsibilities:Configure and maintain...
-
Team Member Network L2
5 months ago
Mumbai, India Yotta Infrastructure Full timeJob Title Network engineer (L2) -Operation Experience 4-8 years Location :Panvel / Noida Business Unit DC technology Purpose of Job In-depth knowledge in Data Center Networking systems/services, LAN switch technology (including Fabric path, Spanning tree, Vxlan, EVPN), routing protocols (EIGRP, OSPF, BGP). Ability to manage a multi-vendor WAN,...
-
Mumbai, India Baker Hughes Full timeWould you like to help shape and implement our Digital Technology teams' strategic direction? Are you passionate in Technology, Software and Development? Join our Digital Technology team! We operate at the heart of the digital transformation of our business. Our team is responsible designing & building secure solution involving global...
-
Server Administrator L2
5 months ago
Mumbai, Maharashtra, India Aidewiser Soltek Full time**Role**: - Server Administrator L2 **Location** - Powai **Experience**: - 3.6+ Years **Required Experience** Bachelor’s degree in Computer Science or Associate Degree. **3 **or more years of related Tech Support / Information Technology experience. Can provide L2 Level Customer Support. Can provide exceptional support while communicating and...
-
Server Administrator L2
5 months ago
Mumbai, Maharashtra, India Aidewiser Soltek Full time**Role**: - Server Administrator L2 **Location** - Powai **Experience**: - 3.6+ Years **Required Experience** Bachelor’s degree in Computer Science or Associate Degree. **3 **or more years of related Tech Support / Information Technology experience. Can provide L2 Level Customer Support. Can provide exceptional support while communicating and...
-
L2 Desktop Support Engineer
3 months ago
Navi Mumbai, India Excis Compliance ltd Full timeSupervise mentor and guide theIT Help Desk team. Conduct regularteam meetings to discuss performance updates andstrategies. Provide technicalassistance and support for escalatedissues. Stay updated with thelatest technology trends and ensure the team is aware of thesechanges. Maintain a high level ofcustomer service and ensure customer inquiries are...
-
L2 Desktop Support Engineer
4 weeks ago
navi mumbai, India Excis Compliance ltd Full timeSupervise mentor and guide the IT Help Desk team. Conduct regular team meetings to discuss performance updates and strategies. Provide technical assistance and support for escalated issues. Stay updated with the latest technology trends and ensure the team is aware of these changes. Maintain a high level of customer service and ensure...