Senior HPC Infrastructure Architect

1 week ago


Mumbai, Maharashtra, India beBeeHpc Full time ₹ 1,50,00,000 - ₹ 2,00,00,000
High-Performance Computing Systems Specialist

We are seeking a skilled High-Performance Computing (HPC) systems specialist to design, deploy and maintain robust HPC systems that support advanced computing and data-intensive applications.

Key Responsibilities:
  • Design and Implementation: Design high-performance network architectures for HPC clusters, configure and optimize InfiniBand and Ethernet switches, routers, and interconnects.
  • System Management: Ensure high availability, redundancy, and fault tolerance in HPC systems, deploy and maintain HPC clusters, monitor job scheduling, and ensure optimal system health.
  • Troubleshooting and Performance Improvement: Troubleshoot compute node hardware/software issues and implement performance improvements, maintain storage systems with fast and reliable access from clusters.
  • Monitoring and Configuration: Configure and manage InfiniBand fabrics, upgrade firmware, and monitor performance using tools like Grafana, Prometheus, Ganglia, and UFM.
Required Experience and Skills:
  • Infrastructure Management: 5+ years managing infrastructure in HPC environments, strong background in data center operations including servers, switches, routers, and storage.
  • Technical Expertise: Proficient in NVIDIA/Mellanox switch configuration and troubleshooting, hands-on with monitoring tools: Prometheus, Grafana, Elastic Observability.
  • HPC Scheduling and Kubernetes: Experience with HPC schedulers: SLURM, PBS, or Torque, setup and maintenance experience in Kubernetes environments.
  • Linux Administration and Data Science: Strong Linux administration experience, familiar with ML and data science workflows in HPC/AI environments.
  • Collaboration and Documentation: Collaborative team player with strong communication skills, capable of documenting and designing complex systems.
Skills and Knowledge:
  • Networking: Deep understanding of Ethernet and InfiniBand networks.
  • Distributed Storage: Proficiency in distributed storage and file systems.
  • Problem-Solving: Expertise in diagnosing and resolving complex infrastructure issues.
  • Documentation and Design: Capable of documenting and designing complex systems.

  • HPC System Expert

    2 weeks ago


    Mumbai, Maharashtra, India beBeeHighPerformance Full time ₹ 1,50,00,000 - ₹ 2,00,00,000

    Job Title: High-Performance Computing Systems Specialist We are seeking a seasoned professional to join our IT infrastructure team as a High-Performance Computing Systems Specialist. This role involves designing, deploying, and maintaining robust HPC systems that support advanced computing and data-intensive applications.Key Responsibilities:Design,...

  • HPC Network Engineer

    2 weeks ago


    Mumbai, Maharashtra, India Stealth AI Startup Full time US$ 1,25,000 - US$ 1,75,000 per year

    Job Title: HPC Network EngineerLocation: MumbaiExperience: Minimum 5 years of relevant network experienceJob Overview:We are seeking a highly skilled and experienced HPC Network Engineer to join our team. The ideal candidate will have a strong background in setting up and managing high-performance computing (HPC) networks with cutting-edge technologies such...


  • Mumbai, Maharashtra, India SHI | Locuz - An SHI Company Full time

    We're Hiring: Technology Associate-HPC Location: Mumbai Experience: 2+ years Expertise: HPC, Cluster, Lustre, Pbs, LSF, xCATWe are looking for a skilled High-Performance Computing (HPC) Administrator to manage and optimize our HPC infrastructure. You'll support mission-critical research and computational workloads by maintaining cluster systems, storage,...


  • Mumbai, Maharashtra, India Selby Jennings Full time ₹ 9,00,000 - ₹ 12,00,000 per year

    HPC System AdministratorJoin a Leading Team in High-Performance ComputingWe are seeking an experienced HPC System Administrator to join a dynamic and globally dispersed team dedicated to cutting-edge research and innovation. This role focuses on designing, developing, and supporting high-performance trading infrastructure and computing environments that...

  • Senior Architect

    7 days ago


    Mumbai, Maharashtra, India Architect Hafeez Contractor Full time

    Job DescriptionAbout the Company - Architect Hafeez Contractor the name needs no introduction. It boasts of 700+ team members and 35 plus Associates. It has a wide portfolio of clients ranging from institutional to commercial to residential.About the Role - Senior ArchitectResponsibilities -- Looking for Senior Architect with experience of working in...

  • Senior Architect

    2 weeks ago


    Mumbai, Maharashtra, India Architect Hafeez Contractor Full time ₹ 5,00,000 - ₹ 10,00,000 per year

    About the Company - Architect Hafeez Contractor the name needs no introduction. It boasts of 700+ team members and 35 plus Associates. It has a wide portfolio of clients ranging from institutional to commercial to residential.About the Role - Senior ArchitectResponsibilities -Looking for Senior Architect with experience of working in Architectural firms.Able...


  • Mumbai, Maharashtra, India beBeeLinux Full time ₹ 1,20,00,000 - ₹ 1,80,00,000

    Job TitleSenior Linux Engineer – Quantitative ResearchAbout the RoleWe are seeking a Senior Linux Engineer to join our technology infrastructure team.This is not a traditional admin or support role, you will be a core enabler of quantitative research and high-performance computing (HPC) within a large-scale financial research environment.Our infrastructure...


  • Navi Mumbai, Maharashtra, India beBeeCloud Full time US$ 1,50,000 - US$ 2,00,000

    Job Title: Senior Cloud Infrastructure Architect We are seeking a highly skilled and experienced Senior Cloud Infrastructure Architect to join our team. Job Description: Design, develop, and deploy scalable and secure cloud infrastructure solutions using AWS technologies.Collaborate with cross-functional teams to identify opportunities for cloud adoption...


  • Mumbai, Maharashtra, India SHI | Locuz - An SHI Company Full time ₹ 15,00,000 - ₹ 20,00,000 per year

    Join the SHI | Locuz Journey – We're HiringPosition:Solution Architect – Core Data Center (DC)Locations:Delhi, Mumbai, PuneExperience:4–10 YearsIndustry:IT Infrastructure | Data Center | HPC | HCI | VirtualizationWe're thrilled to invite you to be a part of an exciting journey withSHI | Locuz If you're ready to thrive in a fast-paced, innovative...


  • Mumbai, Maharashtra, India beBeeInfrastructure Full time ₹ 1,20,00,000 - ₹ 2,50,00,000

    Job Title: Cloud Infrastructure ArchitectAs a Cloud Infrastructure Architect, you will design and implement scalable cloud-based infrastructure solutions to support business growth.