HPC Infrastructure Engineer

2 days ago


Gurgaon, Haryana, India AHEAD Full time
Job Description

Roles & Responsibilities

- Provide enterprise-level operational support to Managed Services customers for incident, problem, and change management activities
- Plan and perform maintenance activities
- Assess customer environments for performance and design issues and propose resolutions
- Work across technical teams to troubleshoot complex infrastructure issues
- Create and maintain detailed documentation
- Serve as a subject matter expert and escalation point for storage technologies
- Work with vendors to resolve storage issues
- Communicate with customers and internal team with transparency
- Participate in on-call rotation
- Completion of training and certification as assigned to further skills and knowledge

Skills Required

- Bachelor's degree or equivalent in Information Systems or related field
- 5+ years of expert-level experience managing infrastructure in high-performance computing environments
- 1+ years of experience with Nvidia DGX preferred
- Experience with HPC schedulers (e.g., SLURM, PBS, Torque)
- Experience configuring, maintaining, and troubleshooting Kubernetes
- Experience with storage technology (e.g., Ceph, Vast Data Platform) and distributed file systems (e.g., Lustre, GPFS, NFS, GlusterFS)
- Experience with machine learning or data science workflows in HPC/AI environments
- Advanced experience with Linux operating systems
- Experience with Nvidia/Mellanox (Cumulus OS) switches a plus
- Experience with ethernet and InfiniBand networking a plus
- 1+ years working with monitoring platforms (e.g., Prometheus, Grafana); Elastic Observability experience is a bonus
- 1+ years working with enterprise ITSM systems (ServiceNow is a bonus)
- Experience with automation tools such as Ansible, Puppet, or Chef is a plus
- Managed Services or consulting experience is required
- Strong background in customer service
- High-level problem-solving and communication skills
- Strong oral and written communication skills
- Related network certifications are a bonus

Why AHEAD

- Diversity-focused workplace with initiatives like Moving Women AHEAD and RISE AHEAD
- Multi-million-dollar lab and cross-department training
- Sponsorship for certifications and ongoing learning

USA Employment Benefits Include

- Medical, Dental, and Vision Insurance
- 401(k)
- Paid company holidays
- Paid time off
- Paid parental and caregiver leave
- Additional benefits listed at https://www.aheadbenefits.com/

Note: The OTE range includes base salary and target bonus and may vary by experience and location.

  • Gurgaon, Haryana, India beBeeHpc Full time ₹ 1,50,00,000 - ₹ 2,50,00,000

    Job Summary:\High-Performance Computing experts are sought after to provide operational support, plan and perform maintenance activities, assess customer environments for performance and design issues, and troubleshoot complex infrastructure issues.\Key Responsibilities:\\Operational support for incident, problem, and change management...


  • Gurgaon, Haryana, India AHEAD Full time

    Job DescriptionThe High-Performance Computing Storage Engineer is primarily responsible for the overall health and maintenance of storage technologies in our managed services customers environments. Our Storage Engineers are a valued member of the Managed Services Infrastructure Practice responsible for Tier 3 incident management, service request management...


  • Gurgaon, Haryana, India Tower Research Capital Full time US$ 1,50,000 - US$ 2,00,000 per year

    Tower Research Capital is a leading quantitative trading firm founded in 1998. Tower has built its business on a high-performance platform and independent trading teams. We have a 25+ year track record of innovation and a reputation for discovering unique market opportunities.Tower is home to some of the world's best systematic trading and engineering...

  • HPC Engineer

    2 days ago


    Gurgaon, Haryana, India Graviton Research Capital LLP Full time US$ 90,000 - US$ 1,20,000 per year

    Graviton is a privately funded quantitative trading firm striving for excellence in financial markets research. We are seeking an HPC Engineer for our team in Gurgaon. Graviton trades across a multitude of asset classes and trading venues using a gamut of concepts and techniques ranging from time series analysis, filtering, classification, stochastic models,...

  • HPC Engineer

    2 days ago


    Gurgaon, Haryana, India Graviton Research Capital LLP Full time US$ 90,000 - US$ 1,20,000 per year

    Graviton is a privately funded quantitative trading firm striving for excellence in financial markets research. We are seeking an HPC Engineer for our team in Gurgaon. Graviton trades across a multitude of asset classes and trading venues using a gamut of concepts and techniques ranging from time series analysis, filtering, classification, stochastic models,...


  • Gurgaon, Haryana, India NVISH SOLUTIONS PRIVATE LIMITED Full time

    Responsibilities : - Administration of HPC and VDI clusters - User Account management for HPC onboarding and offboarding - Creation and Maintenance of AMI Images in AMI accounts- Install, configure, and maintain Linux operating systems on HPC clusters.- Support HPC necessary components and native services of the platform by coordinating with respective...


  • Gurgaon, Haryana, India AHEAD Full time

    Job DescriptionThe High-Performance Computing Network Engineer is primarily responsible for the overall health and maintenance of storage technologies in our managed services customer's environments. Our Network Engineers are a valued member of the Managed Services Infrastructure Practice responsible for Tier 3 incident management, service request management...


  • Gurgaon, Haryana, India beBeeInfrastructure Full time ₹ 15,00,000 - ₹ 28,00,000

    Job DescriptionWe are seeking an experienced professional to fill the role of High-Performance Computing Engineer. The successful candidate will provide operational support for enterprise-level customers, planning and performing maintenance activities, assessing customer environments for performance and design issues, and collaborating with technical teams...


  • Gurgaon, Haryana, India beBeeNetwork Full time ₹ 18,00,000 - ₹ 24,00,000

    Job Opportunity:We are seeking a highly skilled High-Performance Computing Network Engineer to join our team.Job Description:This individual will play a vital role in maintaining the overall health and infrastructure of storage technologies for managed services customers. They will be responsible for Tier 3 incident management, service request management,...


  • Gurgaon, Haryana, India beBeeHighPerformanceComputing Full time ₹ 1,50,00,000 - ₹ 2,00,00,000

    HPC System AdministrationWe are seeking an experienced HPC system administrator to join our team. The ideal candidate will have a strong background in managing high-performance computing (HPC) and virtual desktop infrastructure (VDI) clusters.Responsibilities include:Administration of HPC and VDI clustersUser Account management for HPC onboarding and...