
Smts Systems Design Eng
24 hours ago
**SMTS Systems Design Eng.**:
- Hyderabad, India
- Engineering
- 62878
**Job Description**:
**WHAT YOU DO AT AMD CHANGES EVERYTHING**
- We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world’s most important challenges. We strive for execution excellence while being direct, humble, collaborative, and inclusive of diverse perspectives.
- AMD together we advance_
**THE TEAM**
AMD's Data Center GPU organization is transforming the industry with our AI based Graphic Processors. Our primary objective is to design exceptional products that drive the evolution of computing experiences, serving as the cornerstone for enterprise Data Centers, (AI) Artificial Intelligence, HPC and Embedded systems. If this resonates with you, come and joining our Data Center GPU organization where we are building amazing AI powered products with amazing people.
**THE ROLE**:
We are seeking an experienced **HPC Systems Engineer** with **7+ years of expertise in high-performance computing (HPC)**environments. This role requires hands-on experience with **Python, Kubernetes (K8s), Slurm, OpenStack, and Ansible**, along with the ability to **support external clients in live troubleshooting sessions.**
**The PERSON**:
***:
**KEY RESPONSIBILITIES**:
**HPC System Administration & Troubleshooting**:
- Manage and optimize HPC clusters, ensuring high availability and performance.
- Troubleshoot GPU, CPU, network drivers, firmware, and OS-level issues.
- Debug storage, networking, and job scheduling bottlenecks in Slurm-based environments.
**Kubernetes & Cloud HPC Environments**:
- Deploy and manage HPC workloads in Kubernetes for AI/ML and parallel computing.
- Optimize OpenStack-based HPC clusters with Ceph, Cinder, and Neutron for cloud scalability.
- Implement containerized HPC workflows using Kubernetes and OpenShift.
**Automation & Infrastructure As Code (IaC)**:
- Develop Ansible and Terraform scripts for provisioning and managing HPC resources.
- Automate job scheduling, cluster monitoring, and log analysis using Python.
**Performance Tuning & Benchmarking**:
- Benchmark and optimize multi-node HPC workloads (MPI, NCCL, ROCm, CUDA).
- Tune OS parameters, networking (InfiniBand, RoCE), and Slurm configurations for peak performance.
- Enhance HPC storage performance (Ceph, Lustre, NFS) and distributed computing efficiency.
**Client Support & Collaboration**:
- Provide real-time technical support and troubleshooting for HPC users.
- Engage with developers, DevOps, and system administrators to optimize cluster performance.
- Document solutions, best practices, and contribute to internal knowledge bases.
**PREFERRED QUALIFICATIONS**:
- Experience with AMD MI300, MI2X0 GPUs, ROCm, MPI, UCX, or XPMEM.
- Exposure to containerized workloads using Singularity or Docker in HPC.
- Familiarity with OpenStack deployment automation (e.g., TripleO, Kolla, or OpenStack-Ansible).
- Experience in customer-facing technical roles, with a strong ability to troubleshoot live issues.
This role is critical in ensuring seamless HPC operations, troubleshooting complex system issues, and supporting high-profile clients with real-time problem resolution in both bare-metal and cloud-based HPC environments.
**ACADEMIC CREDENTIALS**:
- Bachelor or Masters Degree in Computer Engineering or Electrical/Electronics Engineering
LI-PK1
-
Expert SMT Services Manager
1 week ago
Hyderabad, Telangana, India beBeeServiceLeader Full time ₹ 12,00,000 - ₹ 15,00,000Service Leader – SMT Job OpportunityWe are seeking an experienced Service Leader to oversee service operations for our SMT equipment, with a strong focus on Fuji SMT systems.This role requires technical leadership, customer management, and team supervision to ensure exceptional service delivery and client satisfaction.Main Responsibilities:Lead Service...
-
Operator for Smt Machine Operating
6 days ago
Hyderabad, India Cyient Full timeCyient is a global engineering and technology solutions company. As a Design, Build, and Maintain partner for leading organizations worldwide, we take solution ownership across the value chain to help clients focus on their core, innovate, and stay ahead of the curve. We leverage digital technologies, advanced analytics capabilities, and our domain knowledge...
-
Smt Operator
7 days ago
Hyderabad, India Cyient Full timeCyient is a global engineering and technology solutions company. As a Design, Build, and Maintain partner for leading organizations worldwide, we take solution ownership across the value chain to help clients focus on their core, innovate, and stay ahead of the curve. We leverage digital technologies, advanced analytics capabilities, and our domain knowledge...
-
SMTS Software Development Eng
7 days ago
Hyderabad, Telangana, India Xilinx Full timeJob DescriptionWHAT YOU DO AT AMD CHANGES EVERYTHINGWe care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded....
-
SMTS Software Development Eng
1 week ago
Hyderabad, Telangana, India AMD Full time ₹ 1,04,000 - ₹ 1,30,878 per yearOverview:WHAT YOU DO AT AMD CHANGES EVERYTHINGWe care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences – the building blocks for the data center, artificial intelligence, PCs, gaming and embedded....
-
Smts Silicon Design Engineer
2 weeks ago
Hyderabad, Telangana, India Advanced Micro Devices Full timeWHAT YOU DO AT AMD CHANGES EVERYTHING We care deeply about transforming lives with AMD technology to enrich our industry our communities and the world Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center artificial intelligence PCs gaming and embedded Underpinning our...
-
Urgent: SMTS Systems Design Eng
2 weeks ago
Hyderabad, Telangana, India Advanced Micro Devices (AMD) Full timeJob DescriptionHPC System Administration & Troubleshooting:- Manage and optimize HPC clusters, ensuring high availability and performance.- Troubleshoot GPU, CPU, network drivers, firmware, and OS-level issues.- Debug storage, networking, and job scheduling bottlenecks in Slurm-based environments.Kubernetes & Cloud HPC Environments:- Deploy and manage HPC...
-
SMTS Silicon Design Engineer
2 weeks ago
Hyderabad, Telangana, India Xilinx Full timeJob DescriptionWHAT YOU DO AT AMD CHANGES EVERYTHINGWe care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded....
-
SMTS Silicon Design Engineer
1 week ago
Hyderabad, Telangana, India AMD Full time ₹ 1,04,000 - ₹ 1,30,878 per yearWHAT YOU DO AT AMD CHANGES EVERYTHINGWe care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our...
-
SMT - Senior Maintenance Engineer
2 weeks ago
Hyderabad, Telangana, India Chang Yi Interconnect Technology Full time ₹ 9,00,000 - ₹ 12,00,000 per yearKey ResponsibilitiesPerform preventive and predictive maintenance of SMT equipmentTroubleshoot complex electrical and mechanical issuesManage spare parts inventory and machine calibrationCoordinate with Fuji service engineers for upgrades and supportMaintain accurate documentation and implement maintenance schedulesCorrect placement offsets and calibrate...