Senior High-Performance Computing Professional

5 days ago


Hyderabad Secunderabad Telangana, India beBeeHpc Full time ₹ 1,04,000 - ₹ 1,30,878
Job Description:">

We are seeking an experienced HPC Systems Engineer with 7+ years of expertise in high-performance computing environments. This role requires hands-on experience with Python, Kubernetes (K8s), Slurm, OpenStack, and Ansible, along with the ability to support external clients in live troubleshooting sessions.

">

The Ideal Candidate:

The ideal candidate will have deep technical knowledge of drivers, troubleshooting methods, and system-level debugging and will play a key role in managing, optimizing, and troubleshooting HPC clusters and cloud-based HPC environments.

  • HPC System Administration & Troubleshooting

Key Responsibilities Include:

  • Manage and optimize HPC clusters, ensuring high availability and performance.
  • Troubleshoot GPU, CPU, network drivers, firmware, and OS-level issues.
  • Debug storage, networking, and job scheduling bottlenecks in Slurm-based environments.
  • Kubernetes & Cloud HPC Environments

Required Skills and Qualifications:

  • Experience with AMD MI300, MI2X0 GPUs, ROCm, MPI, UCX, or XPMEM.
  • Exposure to containerized workloads using Singularity or Docker in HPC.
  • Familiarity with OpenStack deployment automation (e.g., TripleO, Kolla, or OpenStack-Ansible).
  • Experience in customer-facing technical roles, with a strong ability to troubleshoot live issues.


  • Hyderabad / Secunderabad, Telangana, India beBeeCloud Full time ₹ 1,04,000 - ₹ 1,30,878

    Job Overview:We are seeking an expert in high-performance computing to join our team. As a key member of our infrastructure team, you will be responsible for designing and implementing Azure clusters that meet the highest standards of performance and reliability.Your expertise in InfiniBand networking, including configuration, deployment, and...


  • Hyderabad, Telangana, India beBeeVerification Full time ₹ 15,00,000 - ₹ 25,00,000

    As a Senior Performance Verification Engineer, you will play a crucial role in the Computing and Graphics Performance Verification team.Key Responsibilities:Develop high-performance computing systems to meet stringent quality and reliability standards.Collaborate with cross-functional teams to design and implement verification strategies for complex graphics...


  • Hyderabad, Telangana, India beBeeCloudComputing Full time ₹ 1,80,00,000 - ₹ 2,00,00,000

    Job OverviewWe are seeking a seasoned High-Performance Computing Engineer to join our team. As a key member of our organization, you will play a pivotal role in designing, integrating, and managing high-performance computing systems that encompass both hardware and software components into our network infrastructure.This individual will be responsible for...


  • Hyderabad / Secunderabad, Telangana, India beBeeHpcexpert Full time ₹ 1,04,000 - ₹ 1,30,878

    We are looking for a senior HPC expert to oversee the deployment, maintenance and support of our multi-cloud infrastructure. This hands-on role requires deep technical expertise in HPC technology and is crucial for supporting data science, AI/ML workflows and image analysis. The ideal candidate will have advanced knowledge in large Linux environments,...


  • Hyderabad / Secunderabad, Telangana, India beBeeSystemAdministrator Full time ₹ 1,04,000 - ₹ 1,30,878

    Job Title: System AdministratorKey Responsibilities:Manage and maintain infrastructure supporting high-performance computing (HPC) environments for research purposes.Install, configure and update operating systems and application software as required.Evaluate HPC systems to ensure they are secure, reliable, scalable, and cost-effective.Provide technical...


  • Hyderabad, Telangana, India beBeeHighPerformanceComputing Full time ₹ 1,50,00,000 - ₹ 2,10,00,000

    We are seeking a senior high performance computing professional to join our team.Job DescriptionWe are looking for an experienced engineer to work with our data science group. The ideal candidate will have a strong background in Linux/Unix system administration and be proficient in job scheduling and resource management tools such as SLURM, PBS, and LSF....


  • Hyderabad / Secunderabad, Telangana, India beBeePerformanceEngineer Full time ₹ 15,00,000 - ₹ 20,00,000

    About the RoleThe Server SoC Performance Validation team is a critical component of AMD's global Server SoC Performance teams. This team plays a pivotal role in next generation AMD Server SoC design, requiring an individual with a deep understanding of existing AMD X86 SoC architecture/microarchitecture.We are looking for a talented professional who can dive...


  • Hyderabad / Secunderabad, Telangana, India beBeeHPC Full time ₹ 9,00,000 - ₹ 12,00,000

    High-Performance Computing ExpertHPC System Administration & Troubleshooting:Manage and optimize HPC clusters for high availability and performance.Troubleshoot GPU, CPU, network drivers, firmware, and OS-level issues to ensure smooth operation.Debug storage, networking, and job scheduling bottlenecks in Slurm-based environments for optimal...


  • Hyderabad / Secunderabad, Telangana, India beBeeData Full time

    Senior Data Analyst RoleAs a Senior Data Analyst, you will play a vital role in driving business growth by providing high-quality analytics support to internal customers. Your primary responsibility will be to design and implement advanced analytical and statistical solutions for various projects related to promotion evaluation, multi-channel marketing...


  • Hyderabad, Telangana, India beBeeHpc Full time ₹ 20,00,000 - ₹ 23,17,500

    Job Title:A highly skilled HPC AI Applications Professional is required to drive the implementation of high-performance computing solutions.Key Responsibilities:Design and implement high-performance computing (HPC) solutions using Open-source and Commercial HPC AI ApplicationsInstall, benchmark, and fine-tune open-source applications, libraries, and...