HPC System Administrator(6+ years)
4 days ago
HPC System Administrator
Job Summary
We are seeking an experienced High-Performance Computing (HPC) System Administrator to manage, maintain, and optimize large-scale HPC clusters and infrastructure. This role focuses on ensuring reliable system operations, implementing robust monitoring solutions, managing user environments, and maintaining high availability of compute resources for research and production workloads.
Key Responsibilities
· Install, configure, and maintain HPC cluster hardware and software components
· Manage job scheduling systems (SLURM, PBS, LSF) and optimize queue configurations
· Monitor system performance, resource utilization, and cluster health using monitoring tools
· Administer user accounts, permissions, and resource allocations across compute nodes
· Deploy and maintain software stacks, compilers, libraries, and scientific applications
· Implement and maintain backup strategies and disaster recovery procedures
· Troubleshoot hardware failures, network issues, and software conflicts
· Perform regular system updates, security patches, and maintenance windows
· Manage storage systems including parallel file systems (Lustre, GPFS, BeeGFS)
· Coordinate with vendors for hardware support and warranty services
· Create and maintain system documentation and operational procedures
Required Qualifications
· Bachelor's degree in Computer Science, Information Technology, or related field
· years of experience administering Linux-based HPC systems
· Strong knowledge of Linux system administration (RHEL, CentOS, Ubuntu)
· Experience with job scheduling systems (SLURM preferred)
· Proficiency in shell scripting (Bash) and system automation
· Knowledge of networking concepts including InfiniBand and Ethernet fabrics
· Experience with configuration management tools (Ansible, Puppet, Chef)
· Understanding of parallel file systems and storage technologies
· Familiarity with HPC interconnects and high-speed networking
· Experience with system monitoring tools (Nagios, Zabbix, Ganglia)
Preferred Skills
· Experience with container technologies (Singularity, Docker) in HPC environments
· Knowledge of virtualization technologies (KVM, VMware)
· Familiarity with cloud computing platforms and hybrid cloud deployments
· Experience with GPU computing and CUDA environments
· Understanding of MPI, OpenMP, and other parallel programming models
· Knowledge of security best practices for multi-user HPC environments
· Experience with database administration (MySQL, PostgreSQL)
· Familiarity with ticketing systems and user support workflows
· Certification in relevant technologies (Red Hat, CompTIA, vendor-specific)
-
HPC System Administrator
2 weeks ago
Bengaluru, Karnataka, India NVISH SOLUTIONS PRIVATE LIMITED Full time ₹ 8,00,000 - ₹ 24,00,000 per yearResponsibilities : - Administration of HPC and VDI clusters - User Account management for HPC onboarding and offboarding - Creation and Maintenance of AMI Images in AMI accounts - Install, configure, and maintain Linux operating systems on HPC clusters. - Support HPC necessary components and native services of the platform by coordinating...
-
HPC System Administrator
2 days ago
Bengaluru, Karnataka, India Evalutech Prospect Services Full time ₹ 5,00,000 - ₹ 15,00,000 per yearManage, configure, and troubleshoot HPC clusters with SLURM, ROCKS/XCAT, and parallel file systems; support Linux/Windows servers and optimize scientific computing performance.
-
Hpc System Administrator
1 week ago
Bengaluru, Karnataka, India Phigrid Consulting Full time ₹ 15,00,000 - ₹ 25,00,000 per yearResponsibilities:* Manage HPC clusters on Linux* Collaborate with development teams for resource allocation* Ensure system availability and performance* Maintain security and compliance standards* Optimize resource utilization
-
HPC Systems Engineer
5 days ago
Bengaluru, Karnataka, India ExxonMobil Full time ₹ 6,00,000 - ₹ 12,00,000 per yearHPC Engineer About us At ExxonMobil, our vision is to lead in energy innovations that advance modern living and a net-zero future. As one of the world's largest publicly traded energy and chemical companies, we are powered by a unique and diverse workforce fueled by the pride in what we do and what we stand for. The success of our Upstream,...
-
HPC Engineer
1 week ago
Bengaluru, Karnataka, India Vikgol Full time ₹ 12,00,000 - ₹ 36,00,000 per yearJob Title : HPC Engineer Performance Engineering & BenchmarkingExperience : 3 YearsLocation : BangaloreEmployment Type : Full-timeJoining : ImmediateAbout the Role : We are looking for a highly skilled HPC Engineer to join our Performance Engineering and Benchmarking team. The role involves performance analysis, optimization, and benchmarking of...
-
HPC Administrator with Linux Expertise
6 days ago
Bengaluru, Karnataka, India Capgemini Full time ₹ 9,00,000 - ₹ 12,00,000 per yearHPC Admin+Linux We're hiring for below job role: Skill: HPC Admin+Linux Experience: 4 to 9 years Location: Banglaore If you're interested, please share your updated profile.
-
Senior HPC Systems Engineer
5 days ago
Bengaluru, Karnataka, India ExxonMobil Full time ₹ 1,20,000 - ₹ 5,44,000 per yearAbout usAt ExxonMobil, our vision is to lead in energy innovations that advance modern living and a net-zero future. As one of the world's largest publicly traded energy and chemical companies, we are powered by a unique and diverse workforce fueled by the pride in what we do and what we stand for.The success of our Upstream, Product Solutions and Low Carbon...
-
Cloud Engineer – HPC Compute
2 days ago
Bengaluru, Karnataka, India Chevron Full time ₹ 12,00,000 - ₹ 36,00,000 per yearTotal Number of Openings1Job Description Summary:Delivers High Performance Computing (HPC) application and infrastructure solutions that support complex, compute intensive workloads, parallel filesystems, low latency networks, artificial intelligence (AI), and machine learning (ML).About Chevron ITThe Information Technology function empowers all of Chevron...
-
Senior HPC Engineer
3 weeks ago
Bengaluru, Karnataka, India, Karnataka Hays Full timeJob Overview:The IT Infrastructure Engineering & Application (IE&A) group provides the high-performance compute environment that fuels product and solutions development for Arm's engineering community. Whether its high-performance compute (HPC) on Arm’s on-prem infrastructure and/or in the cloud, Electronic Design Automation (EDA) tools, or customised...
-
Linux/HPC System Engineer
1 week ago
Bengaluru, Karnataka, India Kyyba Full time ₹ 15,00,000 - ₹ 20,00,000 per yearJob IntroductionAs an SES Engineer, you will play a key role in supporting and enhancing our existing infrastructure, which includes multiple Linux-based clusters located across India, North America, and Europe. You'll be part of a globally distributed team responsible for maintaining and scaling our cloud computing environment. Your responsibilities will...