
High Performance Computing Specialist
3 days ago
Job Overview
We are seeking a skilled High Performance Computing Specialist to join our team. As a Senior Consultant, you will be responsible for designing and implementing high-performance computing clusters on Azure.
Your primary focus will be on automating cluster buildout workflows, tasks, and reports to produce innovative solutions for cluster buildout and maintenance activities.
Key Responsibilities:
- Configure, deploy, and troubleshoot InfiniBand networking layers, including cabling validation according to network topology.
- Perform IBUFM switch software and firmware upgrades during buildout and production.
- Monitor the health of IBUFM nodes and troubleshoot using Microsoft-provided tools and configuring them as necessary according to provided procedures.
- Define and follow recommended practices and business processes within the project.
- Support automating the buildout process and reporting.
- Perform cluster maintenance activities as required, including node diagnostics, resolution, routing, and recovery.
- Monitor target clusters in buildout progress and work with component and vendor teams to unblock and drive deployments to resolution.
- Move nodes and devices to the RMA queue for faulty devices and work with respective teams and vendors until resolution, then move them back into production to fill cluster capacity.
- Mitigate incident queues and automate.
- Create TSG SOP documents.
- Collaborate with the team.
This role requires flexibility to work on 24/7 shifts and have excellent oral and written communication skills. Preferred skills include intermediate PowerBi Kusto skills. Experience in IT operations, PowerShell automation, Linux Ubuntu, Windows Server Operating Systems, and datacenter architecture is highly desired.
Requirements- Experience in high-performance computing, Azure, and IT operations.
- Strong understanding of InfiniBand networking and its configuration.
- Ability to troubleshoot complex issues and collaborate with cross-functional teams.
- Excellent communication and problem-solving skills.
A dynamic and supportive work environment that fosters growth and development. Opportunities for professional growth and advancement. A competitive compensation package.
-
High-Performance Computing Specialist
5 days ago
Hyderabad, Telangana, India beBeeVerification Full time ₹ 15,00,000 - ₹ 25,00,000As a Senior Performance Verification Engineer, you will play a crucial role in the Computing and Graphics Performance Verification team.Key Responsibilities:Develop high-performance computing systems to meet stringent quality and reliability standards.Collaborate with cross-functional teams to design and implement verification strategies for complex graphics...
-
High-Performance Computing Specialist
1 week ago
Hyderabad, Telangana, India beBeeHpc Full time ₹ 20,00,000 - ₹ 23,17,500Job Title:A highly skilled HPC AI Applications Professional is required to drive the implementation of high-performance computing solutions.Key Responsibilities:Design and implement high-performance computing (HPC) solutions using Open-source and Commercial HPC AI ApplicationsInstall, benchmark, and fine-tune open-source applications, libraries, and...
-
High-Performance Compute Specialist
1 day ago
Hyderabad, Telangana, India beBeeEngineer Full time US$ 1,50,000 - US$ 2,00,000Job Overview">We're seeking an experienced engineer to develop and optimize software systems for our silicon platform. This role focuses on building efficient runtime systems that maximize chip performance while ensuring reliability and ease of use.">Key Responsibilities:">">Design and implement runtime systems for AI accelerator execution and memory...
-
High-Performance Computing Specialist
3 days ago
Hyderabad, Telangana, India beBeeVerification Full time ₹ 1,50,00,000 - ₹ 2,50,00,000Performance Verification ExpertWe are seeking a skilled Performance Verification Engineer to join our team.The ideal candidate will have a strong background in performance verification and experience working with high-performance computing systems, including:Developing simulation infrastructure and methodology advances to model customer...
-
Hyderabad, Telangana, India beBeeCloudComputing Full time ₹ 1,50,00,000 - ₹ 2,00,00,000Job Opportunity:As a High-Performance Computing (HPC) Cluster Specialist, you will be responsible for the administration and maintenance of HPC clusters. Your duties will include user account management for onboarding and offboarding, creation and maintenance of AMI images, installation and configuration of Linux operating systems, and support for necessary...
-
Senior High Performance Computing Specialist
1 week ago
Hyderabad, Telangana, India beBeeHighPerformanceComputing Full time ₹ 1,50,00,000 - ₹ 2,10,00,000We are seeking a senior high performance computing professional to join our team.Job DescriptionWe are looking for an experienced engineer to work with our data science group. The ideal candidate will have a strong background in Linux/Unix system administration and be proficient in job scheduling and resource management tools such as SLURM, PBS, and LSF....
-
High-Performance Infrastructure Specialist
2 days ago
Hyderabad, Telangana, India beBeeHigh Full time ₹ 8,00,000 - ₹ 12,00,000Job Title: High-Performance Infrastructure Specialist">As a critical member of our infrastructure team, the successful candidate will be responsible for designing and implementing high-performance computing solutions that meet the needs of our AI clients.">Key Responsibilities:">• Design, deploy, and maintain highly scalable server infrastructure to...
-
High-Performance Computing Expert
1 week ago
Hyderabad, Telangana, India beBeeArtificial Full time ₹ 2,50,00,000 - ₹ 3,50,00,000Job Title: AI and HPC EngineerThe position involves designing, optimizing, and benchmarking CPU- and GPU-intensive environments to ensure maximum efficiency in scientific and machine learning workloads.Expertise in Open-source and Commercial High-Performance Computing (HPC) AI ApplicationsProficient in deploying and optimizing scientific codes such as...
-
High Performance Computing Specialist
2 days ago
Hyderabad, Telangana, India beBeeHighperformancecomputing Full time ₹ 20,00,000 - ₹ 25,00,000HPC Administrator Role OverviewThis role involves managing and maintaining our High-Performance Computing (HPC) environment, requiring strong Linux system administration skills, AWS cloud services expertise, and HPC platform experience.HPC Cluster Administration: We are seeking an individual with hands-on experience in administering HPC clusters, including...
-
Hyderabad, Telangana, India beBeeCloudComputing Full time ₹ 1,80,00,000 - ₹ 2,00,00,000Job OverviewWe are seeking a seasoned High-Performance Computing Engineer to join our team. As a key member of our organization, you will play a pivotal role in designing, integrating, and managing high-performance computing systems that encompass both hardware and software components into our network infrastructure.This individual will be responsible for...