High Performance Computing Specialist

5 days ago


Hyderabad Secunderabad Telangana, India beBeeHpcexpert Full time ₹ 1,04,000 - ₹ 1,30,878

We are looking for a senior HPC expert to oversee the deployment, maintenance and support of our multi-cloud infrastructure. This hands-on role requires deep technical expertise in HPC technology and is crucial for supporting data science, AI/ML workflows and image analysis. The ideal candidate will have advanced knowledge in large Linux environments, networking, storage and cloud technologies with a proven ability to perform root-cause analysis.

Roles & Responsibilities
  • Infrastructure Management: Implement and manage cloud-based infrastructure that supports HPC environments ensuring security, scalability and reliability.
  • Collaboration & Optimization: Work closely with data scientists and ML engineers to deploy scalable machine learning models. Optimize cloud resources for cost-effective and efficient use.
  • Automation & Monitoring: Develop and maintain CI/CD pipelines for deploying resources to multi-cloud environments. Monitor and troubleshoot cluster operations and cloud environments.
  • Technical Leadership: Provide technical guidance in cloud and HPC systems management. Document system design and operational procedures.
Qualifications
  • A Bachelor's degree in Computer Science or a related field with hands-on experience in HPC administration.
  • Expertise in Linux/Unix system administration (RHEL, CentOS, Ubuntu). Experience with job scheduling and resource management tools (SLURM, PBS, LSF).
  • Proficiency with parallel computing, MPI, OpenMP and GPU acceleration (CUDA, ROCm). Knowledge of storage architectures and distributed file systems (Lustre, GPFS, Ceph).
  • Experience with scripting languages (Python, Bash) and containerization technologies (Docker, Kubernetes). Familiarity with Infrastructure as Code (IaC) tools like Terraform or CloudFormation and Git.
  • Experience in cloud computing (AWS, Azure, GCP) with a strong understanding of cloud architecture.
  • Familiarity with Red Hat Certified Engineer (RHCE) or AWS Certified Solutions Architect certifications is an asset.
Key Skills
  • Problem-Solving: Strong analytical skills with expertise in root-cause analysis and troubleshooting.
  • Communication: Top-level communication and documentation skills are required.
  • Collaboration: Ability to work effectively with global, virtual and cross-functional teams in a fast-paced environment.
  • Technical Expertise: Experience with multi-cloud environments, machine learning frameworks (TensorFlow, PyTorch) and distributed computing technologies is an asset.
  • Availability: This position requires onsite presence and involves a 24/5 and weekend on-call rotation with the possibility of working later shifts.


  • Hyderabad / Secunderabad, Telangana, India beBeeCloud Full time ₹ 1,04,000 - ₹ 1,30,878

    Job Overview:We are seeking an expert in high-performance computing to join our team. As a key member of our infrastructure team, you will be responsible for designing and implementing Azure clusters that meet the highest standards of performance and reliability.Your expertise in InfiniBand networking, including configuration, deployment, and...


  • Hyderabad / Secunderabad, Telangana, India beBeePerformance Full time ₹ 1,04,000 - ₹ 1,30,878

    Performance Testing SpecialistWe are looking for a skilled Performance Testing Specialist to join our team. As a key member of our testing team, you will play a crucial role in ensuring the high-quality and performance of our software applications.Key Responsibilities:Develop and execute performance tests for web applications, APIs, and backend systems to...


  • Hyderabad, Telangana, India beBeeVerification Full time ₹ 15,00,000 - ₹ 25,00,000

    As a Senior Performance Verification Engineer, you will play a crucial role in the Computing and Graphics Performance Verification team.Key Responsibilities:Develop high-performance computing systems to meet stringent quality and reliability standards.Collaborate with cross-functional teams to design and implement verification strategies for complex graphics...


  • Hyderabad / Secunderabad, Telangana, India beBeeSystemAdministrator Full time ₹ 1,04,000 - ₹ 1,30,878

    Job Title: System AdministratorKey Responsibilities:Manage and maintain infrastructure supporting high-performance computing (HPC) environments for research purposes.Install, configure and update operating systems and application software as required.Evaluate HPC systems to ensure they are secure, reliable, scalable, and cost-effective.Provide technical...


  • Hyderabad / Secunderabad, Telangana, India beBeeHpc Full time ₹ 1,04,000 - ₹ 1,30,878

    Job Description:">We are seeking an experienced HPC Systems Engineer with 7+ years of expertise in high-performance computing environments. This role requires hands-on experience with Python, Kubernetes (K8s), Slurm, OpenStack, and Ansible, along with the ability to support external clients in live troubleshooting sessions.">The Ideal Candidate:The ideal...


  • Hyderabad / Secunderabad, Telangana, India beBeeHPC Full time ₹ 9,00,000 - ₹ 12,00,000

    High-Performance Computing ExpertHPC System Administration & Troubleshooting:Manage and optimize HPC clusters for high availability and performance.Troubleshoot GPU, CPU, network drivers, firmware, and OS-level issues to ensure smooth operation.Debug storage, networking, and job scheduling bottlenecks in Slurm-based environments for optimal...


  • Hyderabad / Secunderabad, Telangana, India beBeePerfomance Full time ₹ 9,00,000 - ₹ 12,00,000

    As a high-performance testing specialist, you will be responsible for designing and executing performance tests to validate system performance against defined benchmarks.About the Role:The ideal candidate will have experience with performance testing tools such as JMeter, LoadRunner, or similar, and will be able to utilize these tools to simulate various...


  • Hyderabad, Telangana, India beBeeHpc Full time ₹ 20,00,000 - ₹ 23,17,500

    Job Title:A highly skilled HPC AI Applications Professional is required to drive the implementation of high-performance computing solutions.Key Responsibilities:Design and implement high-performance computing (HPC) solutions using Open-source and Commercial HPC AI ApplicationsInstall, benchmark, and fine-tune open-source applications, libraries, and...


  • Hyderabad, Telangana, India beBeeVerification Full time ₹ 1,50,00,000 - ₹ 2,50,00,000

    Performance Verification ExpertWe are seeking a skilled Performance Verification Engineer to join our team.The ideal candidate will have a strong background in performance verification and experience working with high-performance computing systems, including:Developing simulation infrastructure and methodology advances to model customer...


  • Hyderabad / Secunderabad, Telangana, India beBeeperformance Full time ₹ 1,04,000 - ₹ 1,30,878

    About the RoleWe are seeking a highly skilled and experienced Performance Test Engineer to join our team.As a key member of our organization, you will be responsible for driving performance test methodologies, principles, patterns, and practices across multiple parallel initiatives. Your expertise in software design principles and patterns will enable you to...