Chief High-Performance Infrastructure Specialist

1 week ago


Gurgaon, Haryana, India beBeeInfrastructure Full time ₹ 15,00,000 - ₹ 28,00,000
Job Description

We are seeking an experienced professional to fill the role of High-Performance Computing Engineer. The successful candidate will provide operational support for enterprise-level customers, planning and performing maintenance activities, assessing customer environments for performance and design issues, and collaborating with technical teams to troubleshoot complex infrastructure issues.

This role requires a strong background in customer service, high-level problem-solving skills, and effective communication abilities. The ideal candidate will have experience managing infrastructure in high-performance computing environments, expertise with HPC schedulers, storage technology, and distributed file systems, as well as familiarity with machine learning or data science workflows in HPC/AI environments.

Required Skills and Qualifications
  • Bachelor's degree in Information Systems or related field
  • 5+ years of expert-level experience managing infrastructure in high-performance computing environments
  • 1+ years of experience with Nvidia DGX preferred
  • Experience with HPC schedulers (e.g., SLURM, PBS, Torque)
  • Experience configuring, maintaining, and troubleshooting Kubernetes
  • Experience with storage technology (e.g., Ceph, Vast Data Platform) and distributed file systems (e.g., Lustre, GPFS, NFS, GlusterFS)
  • Experience with machine learning or data science workflows in HPC/AI environments
  • Advanced experience with Linux operating systems
  • Experience with Nvidia/Mellanox (Cumulus OS) switches a plus
  • Experience with ethernet and InfiniBand networking a plus
  • 1+ years working with monitoring platforms (e.g., Prometheus, Grafana); Elastic Observability experience is a bonus
  • 1+ years working with enterprise ITSM systems (ServiceNow is a bonus)
  • Experience with automation tools such as Ansible, Puppet, or Chef is a plus
  • Managed Services or consulting experience is required
Benefits

The following benefits package is available:

  • Medical, Dental, and Vision Insurance
  • 401(k)
  • Paid company holidays
  • Paid time off
  • Paid parental and caregiver leave
Why This Opportunity?

This role offers a unique opportunity to work with a diverse team, participate in cross-department training, and engage in ongoing learning and certification opportunities.



  • Gurgaon, Haryana, India beBeeReliability Full time ₹ 15,00,000 - ₹ 20,00,000

    Job Title:Senior Site Reliability EngineerOverview:The successful candidate will be responsible for designing and implementing large-scale distributed systems with a focus on performance at scale, real-time monitoring, logging, and alerting. The ideal candidate will have a deep understanding of GPU computing and AI infrastructure.Responsibilities:Design and...


  • Gurgaon, Haryana, India beBeeNetwork Full time ₹ 15,00,000 - ₹ 28,00,000

    Expert Network Professionals are sought for the role of High-Performance Computing Network Engineer.This position requires a highly skilled individual with extensive experience in managing Network infrastructure in high-performance computing environments. The ideal candidate will have expertise in configuring, maintaining, and troubleshooting Nvidia/Mellanox...


  • Gurgaon, Haryana, India beBeeKafka Full time ₹ 15,00,000 - ₹ 25,00,000

    Job Title : High-Performance Messaging Systems SpecialistWe are seeking an experienced Kafka Administrator to manage, maintain, and optimize our distributed, multi-cluster Kafka infrastructure deployed in an on-premise environment. This role requires deep knowledge of Kafka internals, Zookeeper administration, performance tuning, and operational excellence...


  • Gurgaon, Haryana, India beBeeInfrastructure Full time ₹ 15,00,000 - ₹ 28,00,000

    Job Overview:We are seeking a talented HPC Infrastructure Specialist to join our team. In this role, you will provide expert-level operational support to customers for incident, problem, and change management activities.Key Responsibilities:Provide enterprise-level operational support to customers for incident, problem, and change management activitiesPlan...


  • Gurgaon, Haryana, India beBeeAutomation Full time ₹ 15,00,000 - ₹ 25,00,000

    Achieve Infrastructure ExcellenceWe seek a highly skilled Chief Automation Specialist with 3+ years of experience to lead our infrastructure modernization initiatives.Automate provisioning workflows using CloudStack API.Develop Terraform/Ansible playbooks for seamless deployments.Integrate CMP with backend automation scripts.Establish CI/CD pipelines for...


  • Gurgaon, Haryana, India beBeeMarketing Full time ₹ 12,00,000 - ₹ 15,00,000

    Job Title:A high-performing Performance Marketing Specialist is required to plan and execute Paid Advertising campaigns on social media platforms.Key Responsibilities:Developing and implementing Paid Advertising strategies across multiple social media channelsOptimizing campaign performance through A/B testing and ROI analysisManaging programmatic...


  • Gurgaon, Haryana, India beBeeinfrastructure Full time ₹ 1,50,000 - ₹ 28,00,000

    System Infrastructure SpecialistWe are seeking an experienced System Infrastructure Specialist to join our team. As a key member of our infrastructure team, you will be responsible for the management and maintenance of high availability infrastructure.


  • Gurgaon, Haryana, India beBeeDevOps Full time ₹ 20,00,000 - ₹ 25,00,000

    AWS DevOps Engineer - Cloud Infrastructure SpecialistWe are seeking a seasoned cloud infrastructure specialist with robust experience in designing, deploying, and maintaining secure, scalable, and high-availability AWS environments.Design and manage AWS infrastructure, focusing on middleware services such as API Gateway, Lambda, SQS, SNS, ECS, and...


  • Gurgaon, Haryana, India beBeeReliability Full time ₹ 2,00,00,000 - ₹ 2,50,00,000

    Job OverviewWe are seeking an experienced Senior Reliability Engineer to ensure the reliability, availability, scalability, and performance of our Azure-based platforms and applications.Service Reliability & SLOs: Define and maintain Service Level Objectives (SLOs) for the systems you own.Automation & Scalability: Develop automation to scale systems...


  • Gurgaon, Haryana, India beBeeCloudInfrastructure Full time ₹ 20,09,917 - ₹ 25,12,756

    Job Title:Cloud Infrastructure SpecialistAbout the Role:This is an exciting opportunity to join our team as a Cloud Infrastructure Specialist. In this role, you will be responsible for designing, building, testing, and deploying cloud application solutions that integrate cloud and non-cloud infrastructure.Your primary focus will be on collaborating with...