High Performance Computing Systems Specialist

7 days ago


Pushkar, Rajasthan, India beBeeInfrastructure Full time ₹ 2,00,00,000 - ₹ 2,42,25,000

Job Title: AI Infrastructure Expert

Job Description:

  • Assess current AI infrastructure requirements and forecast future growth, ensuring seamless integration with business objectives.
  • Design, deploy, and support AI infrastructure to meet evolving needs, encompassing CPU, GPU, middleware, and orchestration tools.
  • Configure and manage parallel file systems (PFS) and network file systems (NFS), optimizing performance and scalability.
  • Install and configure libraries and compilers, guaranteeing compatibility and efficiency.
  • Deploy and manage monitoring and observability tools for AI clusters, providing real-time insights and proactive issue resolution.
  • Manage and troubleshoot network issues with InfiniBand, ROCE switches, UFM, NETQ, and other specialized tools, minimizing downtime and maximizing system availability.
  • Expertly manage NVIDIA GPUs, Cuda toolkit, and related software, unlocking advanced AI capabilities.
  • Have knowledge of cloud infrastructure management in AWS and GCP, enabling seamless migration and deployment.
  • Be familiar with cloud bursting techniques, dynamically scaling resources to meet changing demands.
  • Install and configure Kubernetes, automating containerization and orchestration.

Required Skills and Qualifications:

  • Operating systems: Linux (RHEL, CentOS, SuSE)
  • Languages: C, C++, bash, Python
  • Schedulers and resource management: PBS, SLURM, Grid Engine, LSF
  • Cluster management: xCAT, Bright Cluster Manager, War wolf
  • Monitoring tools: Grafana, Nagios, Zabbix, Gangalia
  • Compilers and libraries: GNU, Intel, OpenMPI, Cuda, MPI
  • Networking: InfiniBand/ROCE/UFM/Ethernet/DPU
  • Parallel file systems: GPFS, Lustre, Weka, BeeGFS, VAST
  • Benchmarking tools: IOR, Lim pack
  • NVIDIA stack: NVIDIA AI Enterprise, RunAI
  • PFS configuration and troubleshooting: Lustre, GPFS, Weka
  • Job scheduler installation and configuration: PBS, SLURM, Grid Engine
  • Root cause analysis of cluster issues
  • Performance benchmarking of clusters
  • Basic parallel programming skills – OpenMPI, MPI
  • GPU orchestration and MIG instance creation
  • Scripting and automation: bash, Perl, Python
  • Advanced Linux OS debugging skills


  • Pushkar, Rajasthan, India beBeeDesktopSupport Full time ₹ 6,00,000 - ₹ 12,00,000

    Job Role:We are seeking an experienced Desktop Support Specialist to join our organization.Main Responsibilities:Desktop Support: Provide high-quality technical assistance to employees for desktop computers, laptops, and mobile devices.Operating System Management: Install, configure, and troubleshoot operating systems, hardware, and software...


  • Pushkar, Rajasthan, India beBeeSoftware Full time ₹ 1,20,00,000 - ₹ 2,00,00,000

    Job OverviewWe are seeking a skilled Software Architect to design and build high-performance systems. The ideal candidate will collaborate with TypeScript/Node.js, PostgreSQL, and modern distributed technologies to develop scalable infrastructure that supports rapid user growth.Key Responsibilities:Design server-side applications using TypeScript and Nest.js...


  • Pushkar, Rajasthan, India beBeeHpcAdmin Full time ₹ 20,00,000 - ₹ 25,00,000

    Job Title: HPC System AdministratorMain Responsibilities:We are seeking an experienced HPC system administrator to administer High-Performance Computing (HPC) and Virtual Desktop Infrastructure (VDI) clusters.This role is responsible for managing user accounts for HPC onboarding and offboarding, ensuring seamless access and compliance with organizational...


  • Pushkar, Rajasthan, India beBeeAutomation Full time ₹ 1,50,00,000 - ₹ 2,50,00,000

    Network Automation Engineer">We are seeking a skilled Network Automation Engineer to join our team and design, implement, and optimize automation solutions for large-scale AI/ML workloads.">Develop and maintain network automation frameworks for datacenter environments supporting AI/ML applications.Design and implement Python-based automation scripts and...


  • Pushkar, Rajasthan, India beBeeElectronic Full time ₹ 18,00,000 - ₹ 20,10,000

    About this RoleWe're seeking an experienced Electronics Specialist to join our team.This role involves designing and developing high-performance electronic hardware for water purification and distribution systems deployed globally.The ideal candidate will have expertise in creating complex, multi-layer printed circuit boards (PCBs) for power electronics...


  • Pushkar, Rajasthan, India beBeeMechanical Full time ₹ 14,00,000 - ₹ 22,40,000

    Job Role: Senior Mechanical SpecialistThis is a highly specialized position that involves the design, analysis and implementation of mechanical systems for infrastructure projects.The role requires strong technical skills, excellent analytical and problem-solving abilities and a deep understanding of industry trends and technologies.Key responsibilities...


  • Pushkar, Rajasthan, India beBeePerformance Full time ₹ 9,00,000 - ₹ 12,10,000

    Job Overview:We are seeking a skilled Performance Specialist to join our team. As a key member of our organization, you will be responsible for planning, executing, and analyzing performance testing initiatives.Key Requirements:6+ years of experience in performance testing with a proven track record of identifying and resolving complex performance...


  • Pushkar, Rajasthan, India beBeeAnalysis Full time ₹ 12,00,000 - ₹ 12,40,000

    Job Description:System Performance Expert RoleThe role of System Performance Expert involves monitoring and analyzing system performance to ensure optimal uptime.Key Responsibilities:Monitor system performance using ITRS/Grafana/Prometheus tools to identify issues that affect overall system efficiency.Analyze system performance data to troubleshoot problems...


  • Pushkar, Rajasthan, India beBeeAvailability Full time ₹ 18,00,000 - ₹ 22,50,000

    **Job Summary**We are seeking a highly skilled High Availability Specialist to join our team. In this role, you will be responsible for designing and implementing resiliency-focused testing strategies for enterprise-grade applications and infrastructure.Your key responsibilities will include developing, executing, and maintaining test plans, scripts, and...


  • Pushkar, Rajasthan, India beBeeBackend Full time ₹ 30,00,000 - ₹ 50,00,000

    **About Us**Babblebots AI is a pioneering company in the field of artificial intelligence. Our journey began with VoiceAI, followed by documents, intelligent conversations, and AI-based assessments.We have developed an AI-Agent called an AI-Recruiter, a concept that was later coined. This multi-agent, multi-modal AI platform has continuously evolved.**Job...