
High Performance Computing Systems Specialist
7 days ago
Job Title: AI Infrastructure Expert
Job Description:
- Assess current AI infrastructure requirements and forecast future growth, ensuring seamless integration with business objectives.
- Design, deploy, and support AI infrastructure to meet evolving needs, encompassing CPU, GPU, middleware, and orchestration tools.
- Configure and manage parallel file systems (PFS) and network file systems (NFS), optimizing performance and scalability.
- Install and configure libraries and compilers, guaranteeing compatibility and efficiency.
- Deploy and manage monitoring and observability tools for AI clusters, providing real-time insights and proactive issue resolution.
- Manage and troubleshoot network issues with InfiniBand, ROCE switches, UFM, NETQ, and other specialized tools, minimizing downtime and maximizing system availability.
- Expertly manage NVIDIA GPUs, Cuda toolkit, and related software, unlocking advanced AI capabilities.
- Have knowledge of cloud infrastructure management in AWS and GCP, enabling seamless migration and deployment.
- Be familiar with cloud bursting techniques, dynamically scaling resources to meet changing demands.
- Install and configure Kubernetes, automating containerization and orchestration.
Required Skills and Qualifications:
- Operating systems: Linux (RHEL, CentOS, SuSE)
- Languages: C, C++, bash, Python
- Schedulers and resource management: PBS, SLURM, Grid Engine, LSF
- Cluster management: xCAT, Bright Cluster Manager, War wolf
- Monitoring tools: Grafana, Nagios, Zabbix, Gangalia
- Compilers and libraries: GNU, Intel, OpenMPI, Cuda, MPI
- Networking: InfiniBand/ROCE/UFM/Ethernet/DPU
- Parallel file systems: GPFS, Lustre, Weka, BeeGFS, VAST
- Benchmarking tools: IOR, Lim pack
- NVIDIA stack: NVIDIA AI Enterprise, RunAI
- PFS configuration and troubleshooting: Lustre, GPFS, Weka
- Job scheduler installation and configuration: PBS, SLURM, Grid Engine
- Root cause analysis of cluster issues
- Performance benchmarking of clusters
- Basic parallel programming skills – OpenMPI, MPI
- GPU orchestration and MIG instance creation
- Scripting and automation: bash, Perl, Python
- Advanced Linux OS debugging skills
-
Computer Systems Specialist
1 week ago
Pushkar, Rajasthan, India beBeeDesktopSupport Full time ₹ 6,00,000 - ₹ 12,00,000Job Role:We are seeking an experienced Desktop Support Specialist to join our organization.Main Responsibilities:Desktop Support: Provide high-quality technical assistance to employees for desktop computers, laptops, and mobile devices.Operating System Management: Install, configure, and troubleshoot operating systems, hardware, and software...
-
High Performance Systems Developer
1 week ago
Pushkar, Rajasthan, India beBeeSoftware Full time ₹ 1,20,00,000 - ₹ 2,00,00,000Job OverviewWe are seeking a skilled Software Architect to design and build high-performance systems. The ideal candidate will collaborate with TypeScript/Node.js, PostgreSQL, and modern distributed technologies to develop scalable infrastructure that supports rapid user growth.Key Responsibilities:Design server-side applications using TypeScript and Nest.js...
-
High Performance Computing Cluster Engineer
1 week ago
Pushkar, Rajasthan, India beBeeHpcAdmin Full time ₹ 20,00,000 - ₹ 25,00,000Job Title: HPC System AdministratorMain Responsibilities:We are seeking an experienced HPC system administrator to administer High-Performance Computing (HPC) and Virtual Desktop Infrastructure (VDI) clusters.This role is responsible for managing user accounts for HPC onboarding and offboarding, ensuring seamless access and compliance with organizational...
-
High-Performance Computing Network Architect
1 week ago
Pushkar, Rajasthan, India beBeeAutomation Full time ₹ 1,50,00,000 - ₹ 2,50,00,000Network Automation Engineer">We are seeking a skilled Network Automation Engineer to join our team and design, implement, and optimize automation solutions for large-scale AI/ML workloads.">Develop and maintain network automation frameworks for datacenter environments supporting AI/ML applications.Design and implement Python-based automation scripts and...
-
High Performance Electronics Specialist
1 week ago
Pushkar, Rajasthan, India beBeeElectronic Full time ₹ 18,00,000 - ₹ 20,10,000About this RoleWe're seeking an experienced Electronics Specialist to join our team.This role involves designing and developing high-performance electronic hardware for water purification and distribution systems deployed globally.The ideal candidate will have expertise in creating complex, multi-layer printed circuit boards (PCBs) for power electronics...
-
Pushkar, Rajasthan, India beBeeMechanical Full time ₹ 14,00,000 - ₹ 22,40,000Job Role: Senior Mechanical SpecialistThis is a highly specialized position that involves the design, analysis and implementation of mechanical systems for infrastructure projects.The role requires strong technical skills, excellent analytical and problem-solving abilities and a deep understanding of industry trends and technologies.Key responsibilities...
-
High-Performance Tester
1 week ago
Pushkar, Rajasthan, India beBeePerformance Full time ₹ 9,00,000 - ₹ 12,10,000Job Overview:We are seeking a skilled Performance Specialist to join our team. As a key member of our organization, you will be responsible for planning, executing, and analyzing performance testing initiatives.Key Requirements:6+ years of experience in performance testing with a proven track record of identifying and resolving complex performance...
-
Optimize System Performance Specialist
1 week ago
Pushkar, Rajasthan, India beBeeAnalysis Full time ₹ 12,00,000 - ₹ 12,40,000Job Description:System Performance Expert RoleThe role of System Performance Expert involves monitoring and analyzing system performance to ensure optimal uptime.Key Responsibilities:Monitor system performance using ITRS/Grafana/Prometheus tools to identify issues that affect overall system efficiency.Analyze system performance data to troubleshoot problems...
-
High Availability Specialist
6 days ago
Pushkar, Rajasthan, India beBeeAvailability Full time ₹ 18,00,000 - ₹ 22,50,000**Job Summary**We are seeking a highly skilled High Availability Specialist to join our team. In this role, you will be responsible for designing and implementing resiliency-focused testing strategies for enterprise-grade applications and infrastructure.Your key responsibilities will include developing, executing, and maintaining test plans, scripts, and...
-
Building High-Traffic Systems
1 week ago
Pushkar, Rajasthan, India beBeeBackend Full time ₹ 30,00,000 - ₹ 50,00,000**About Us**Babblebots AI is a pioneering company in the field of artificial intelligence. Our journey began with VoiceAI, followed by documents, intelligent conversations, and AI-based assessments.We have developed an AI-Agent called an AI-Recruiter, a concept that was later coined. This multi-agent, multi-modal AI platform has continuously evolved.**Job...