HPC System Expert

1 day ago


Mumbai, Maharashtra, India beBeeHighPerformance Full time ₹ 1,50,00,000 - ₹ 2,00,00,000
Job Title: High-Performance Computing Systems Specialist

 

We are seeking a seasoned professional to join our IT infrastructure team as a High-Performance Computing Systems Specialist. This role involves designing, deploying, and maintaining robust HPC systems that support advanced computing and data-intensive applications.

Key Responsibilities:

  • Design, implement, and manage high-performance network architectures for HPC clusters.
  • Configure and optimize InfiniBand and Ethernet switches, routers, and interconnects.

System Maintenance and Troubleshooting:

  • Ensure high availability, redundancy, and fault tolerance in HPC systems.
  • Deploy and maintain HPC clusters, monitor job scheduling, and ensure optimal system health.
  • Troubleshoot compute node hardware/software issues and implement performance improvements.

Storage and Monitoring:

  • Maintain storage systems (Ceph, Vast Data, Lustre, GPFS, NFS, GlusterFS) with fast, reliable access from clusters.
  • Configure and manage InfiniBand fabrics; upgrade firmware and monitor performance.
  • Use tools like Grafana, Prometheus, Ganglia, and UFM for cluster and network monitoring.

Collaboration and Communication:

  • Work closely with researchers and data scientists to support HPC/AI workloads.
  • Assist in debugging, tuning, and optimizing distributed applications.
  • Create and maintain HLD and LLD documentation.

Required Experience and Qualifications:

  • 5+ years managing infrastructure in HPC environments.
  • Strong background in data center operations servers, switches, routers, storage.
  • Proficient in NVIDIA/Mellanox (Cumulus OS) switch configuration and troubleshooting.

Preferred Certifications:

  • Red Hat Certified Engineer (RHCE)
  • Cisco Certified Network Associate (CCNA)
  • AWS Certified Solutions Architect


  • Mumbai, Maharashtra, India SHI | Locuz - An SHI Company Full time

    We're Hiring: Technology Associate-HPC Location: Mumbai Experience: 2+ years Expertise: HPC, Cluster, Lustre, Pbs, LSF, xCATWe are looking for a skilled High-Performance Computing (HPC) Administrator to manage and optimize our HPC infrastructure. You'll support mission-critical research and computational workloads by maintaining cluster systems, storage,...

  • HPC Network Engineer

    12 hours ago


    Mumbai, Maharashtra, India Stealth AI Startup Full time US$ 1,25,000 - US$ 1,75,000 per year

    Job Title: HPC Network EngineerLocation: MumbaiExperience: Minimum 5 years of relevant network experienceJob Overview:We are seeking a highly skilled and experienced HPC Network Engineer to join our team. The ideal candidate will have a strong background in setting up and managing high-performance computing (HPC) networks with cutting-edge technologies such...


  • Mumbai, Maharashtra, India beBeePerformance Full time ₹ 1,50,00,000 - ₹ 2,50,00,000

    Senior Linux Systems EngineerWe're seeking a seasoned expert in Linux systems administration to join our team. As a Senior Linux Systems Engineer, you will be responsible for designing, implementing, and maintaining high-performance Linux systems that meet the needs of our organization.About the RoleExperience: 7 to 12+ yearsLocation: Kurla, MumbaiType:...


  • Mumbai, Maharashtra, India beBeeExpertise Full time ₹ 20,00,000 - ₹ 25,00,000

    Job Title:High Performance Computing SpecialistAbout the Role:We are seeking an experienced High Performance Computing (HPC) specialist to manage and optimize our HPC infrastructure. The ideal candidate will have a strong background in Linux system administration, job scheduling, and performance tuning.Key Responsibilities:Administer and monitor Linux-based...


  • Mumbai, Maharashtra, India beBeeTechnical Full time ₹ 1,50,00,000 - ₹ 2,50,00,000

    Job Role:We are seeking a seasoned Systems Professional to collaborate with clients and provide technical expertise for customers across the country and globally. As a technical expert, you will be responsible for delivering high-quality results by participating in customer meetings, presentations, whiteboard sessions, and custom technical design work.About...


  • Mumbai, Maharashtra, India beBeeFinancial Full time ₹ 60,00,000 - ₹ 1,20,00,000

    Financial Systems ExpertA key role within our organization involves collaborating with cross-functional teams to comprehend system processes, identify and troubleshoot issues, ensuring the accuracy and integrity of financial data.

  • Systems Engineer

    1 day ago


    Mumbai, Maharashtra, India Talent Socio Bizcon LLP Full time

    About the role :As a Consulting Systems Engineer (CSE), you will be partnering with Client Executives / Client and Account Managers to provide the technical Pre-Sales work for customers across the country and Global Finance customers operating out of India. No two days will be the same as our broad offerings include Infrastructure Modernization, Multicloud...


  • Mumbai, Maharashtra, India beBeeAdministration Full time ₹ 15,00,000 - ₹ 20,00,000

    Job Title: Middleware Systems Expert",


  • Mumbai, Maharashtra, India beBeeTechnical Full time ₹ 7,50,000 - ₹ 10,00,000

    Job Summary:The role of NAVISYS Engineer is pivotal in ensuring seamless navigation system operations. As the primary point of contact for all matters related to navigation and integrated systems, you will work closely with internal departments, vendors, classification societies, and Fleet Superintendents. Main Responsibilities:Serve as a technical expert...


  • Mumbai, Maharashtra, India Talent Socio Full time

    As a Consulting Systems Engineer (CSE), you will be partnering with Client Executives / Client and Account Managers to provide the technical Pre-Sales work for customers across the country and Global Finance customers operating out of India. No two days will be the same as our broad offerings include Infrastructure Modernization, Multicloud Architecture,...