Senior HPC Systems Engineer

1 week ago


Hyderabad Secunderabad Telangana, India beBeeHighperformance Full time ₹ 1,04,000 - ₹ 1,30,878
Lead High-Performance Computing (HPC) Engineer

HPC System Administration & Troubleshooting:

  • Benchmark, optimize and troubleshoot complex HPC systems to ensure high availability, performance and reliability.
  • Solve GPU, CPU, network drivers, firmware and OS-level issues efficiently using technical expertise.
  • Debug storage, networking and job scheduling bottlenecks in Slurm-based environments to maintain cluster health.

Kubernetes & Cloud HPC Environments:

  • Design and deploy scalable HPC workloads in Kubernetes for AI/ML and parallel computing applications.
  • Optimize OpenStack-based HPC clusters with Ceph, Cinder and Neutron for cloud scalability and efficiency.
  • Implement containerized HPC workflows using Kubernetes and OpenShift for seamless integration.

Automation & Infrastructure as Code (IaC):

  • Develop Ansible and Terraform scripts for provisioning and managing HPC resources effectively.
  • Automate job scheduling, cluster monitoring and log analysis using Python for data-driven insights.
  • Enhance CI/CD pipelines for HPC and AI/ML applications by integrating DevOps practices.

Performance Tuning & Benchmarking:

  • Conduct thorough benchmarking and optimization of multi-node HPC workloads (MPI, NCCL, ROCm, CUDA).
  • Tune OS parameters, networking (InfiniBand, RoCE) and Slurm configurations for peak performance.
  • Improve HPC storage performance (Ceph, Lustre, NFS) and distributed computing efficiency through expert analysis.

Client Support & Collaboration:

  • Provide timely and effective technical support and troubleshooting for HPC users across the organization.
  • Engage with developers, DevOps engineers and system administrators to collaborate on optimizing cluster performance and efficiency.
  • Document solutions, best practices and contribute to internal knowledge bases for future reference.

PREFERRED QUALIFICATIONS:

  • Experience with AMD MI300, MI2X0 GPUs, ROCm, MPI, UCX or XPMEM technologies.
  • Familiarity with containerized workloads using Singularity or Docker in HPC environments.
  • Knowledge of OpenStack deployment automation tools such as TripleO, Kolla or OpenStack-Ansible.
  • Background in customer-facing technical roles with strong problem-solving skills and ability to work under pressure.


  • Hyderabad / Secunderabad, Telangana, India beBeeHighperformancecomputing Full time ₹ 15,00,000 - ₹ 28,00,000

    Job Title: Senior High Performance Computing EngineerWe are seeking a Senior High-Performance Computing (HPC) professional to deploy, maintain, and support HPC infrastructure in a multi-cloud environment. This hands-on role requires deep technical expertise in HPC technology and is vital for supporting data science, AI/ML workflows, and image analysis.Key...


  • Hyderabad / Secunderabad, Telangana, India beBeeEngineering Full time ₹ 15,00,000 - ₹ 28,00,000

    High-Performance Computing Systems EngineerWe are seeking an experienced professional with 7+ years of expertise in high-performance computing (HPC) environments. This role requires hands-on experience with Python, Kubernetes (K8s), Slurm, OpenStack, and Ansible along with the ability to support external clients in live troubleshooting sessions.The ideal...

  • Hpc Engineer

    1 day ago


    Hyderabad, India Symphoni Hr Full time

    Senior HPC EngineerHyderabad | 48 Years | NP: Max Days Job Description:Looking for an experienced HPC Engineer to manage and optimize high-performance compute environments. Must Have: Strong expertise in IBM Spectrum LSF administration Linux/RedHat skills (RHCE preferred) Scripting: Bash, Shell, Python, Perl Cloud exposure (AWS/GCP/Azure) Good to Have:...

  • Hpc Engineer

    2 weeks ago


    Hyderabad, Telangana, India Symphoni Hr Full time ₹ 15,00,000 - ₹ 28,00,000 per year

    Senior HPC Engineer Hyderabad | 48 Years | NP: Max DaysJob Description: Looking for an experienced HPC Engineer to manage and optimize high-performance compute environments.Must Have:Strong expertise in IBM Spectrum LSF administrationLinux/RedHat skills (RHCE preferred)Scripting: Bash, Shell, Python, PerlCloud exposure (AWS/GCP/Azure)Good to Have:Terraform,...

  • HPC System Architect

    2 weeks ago


    Hyderabad, Telangana, India beBeeExpert Full time ₹ 1,00,00,000 - ₹ 1,50,00,000

    High-Performance Computing ExpertJob Summary:As a High-Performance Computing (HPC) expert, you will be responsible for the administration and maintenance of our HPC clusters. This includes managing user accounts, creating and maintaining AMI images, and installing and configuring Linux operating systems.Key Responsibilities:Administration of HPC and VDI...

  • HPC System Admin

    2 weeks ago


    Hyderabad, Telangana, India Metasys Technologies Full time

    HPC AdminFull TimeHyderabadResponsibilities:• Administration of HPC and VDI clusters• User Account management for HPC onboarding and offboarding• Creation and Maintenance of AMI Images in AMI accounts• Install, configure, and maintain Linux operating systems on HPC clusters.• Support HPC necessary components and native services of the platform by...

  • HPC System Admin

    1 day ago


    Hyderabad, India Metasys Technologies Full time

    HPC Admin Full Time Hyderabad Responsibilities: • Administration of HPC and VDI clusters • User Account management for HPC onboarding and offboarding • Creation and Maintenance of AMI Images in AMI accounts • Install, configure, and maintain Linux operating systems on HPC clusters. • Support HPC necessary components and native services of the...


  • Hyderabad, Telangana, India beBeeInfrastructureManagement Full time ₹ 15,00,000 - ₹ 28,00,000

    Job Title: HPC System Specialist">Description: The role of an HPC System Specialist involves the administration of high-performance computing (HPC) and virtual desktop infrastructure (VDI) clusters. This position requires expertise in managing user accounts for onboarding and offboarding processes, creating and maintaining Amazon Machine Image (AMI) images...

  • HPC Applications

    1 week ago


    Madhapur, Hyderabad, Telangana, India Locuz Enterprise Solutions Full time ₹ 15,000 - ₹ 28,00,000 per year

    L2 Skill HPC Engineer with Application ExpertiseRole Overview:An L2 HPC (High-Performance Computing) Engineer with an application skillset is responsible for supporting, troubleshooting, and maintaining HPC infrastructure and assisting users with scientific and engineering applications. They operate between infrastructure and application layers, ensuring...


  • Hyderabad, Telangana, India SHI | Locuz - An SHI Company Full time ₹ 15,00,000 - ₹ 28,00,000 per year

    Hi,We have an immediate requirement for HPC Applications Engineer with our organization SHI Locuz Enterprise Solutions Pvt Ltd.PFB JDL2 Skill HPC Engineer with Application ExpertiseRole Overview:An L2 HPC (High-Performance Computing) Engineer with an application skillset is responsible for supporting, troubleshooting, and maintaining HPC infrastructure and...