Urgent: SMTS Systems Design Eng

3 weeks ago


Hyderabad, Telangana, India Advanced Micro Devices (AMD) Full time
Job Description

HPC System Administration & Troubleshooting:

- Manage and optimize HPC clusters, ensuring high availability and performance.
- Troubleshoot GPU, CPU, network drivers, firmware, and OS-level issues.
- Debug storage, networking, and job scheduling bottlenecks in Slurm-based environments.

Kubernetes & Cloud HPC Environments:

- Deploy and manage HPC workloads in Kubernetes for AI/ML and parallel computing.
- Optimize OpenStack-based HPC clusters with Ceph, Cinder, and Neutron for cloud scalability.
- Implement containerized HPC workflows using Kubernetes and OpenShift.

Automation & Infrastructure as Code (IaC):

- Develop Ansible and Terraform scripts for provisioning and managing HPC resources.
- Automate job scheduling, cluster monitoring, and log analysis using Python.
- Optimize CI/CD pipelines for HPC and AI/ML applications.

Performance Tuning & Benchmarking:

- Benchmark and optimize multi-node HPC workloads (MPI, NCCL, ROCm, CUDA).
- Tune OS parameters, networking (InfiniBand, RoCE), and Slurm configurations for peak performance.
- Enhance HPC storage performance (Ceph, Lustre, NFS) and distributed computing efficiency.

Client Support & Collaboration:

- Provide real-time technical support and troubleshooting for HPC users.
- Engage with developers, DevOps, and system administrators to optimize cluster performance.
- Document solutions, best practices, and contribute to internal knowledge bases.

PREFERRED QUALIFICATIONS:

- Experience with AMD MI300, MI2X0 GPUs, ROCm, MPI, UCX, or XPMEM.
- Exposure to containerized workloads using Singularity or Docker in HPC.
- Familiarity with OpenStack deployment automation (e.g., TripleO, Kolla, or OpenStack-Ansible).
- Experience in customer-facing technical roles, with a strong ability to troubleshoot live issues.

  • Hyderabad, Telangana, India Xilinx Full time

    Job DescriptionWHAT YOU DO AT AMD CHANGES EVERYTHINGWe care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded....


  • Hyderabad, Telangana, India Advanced Micro Devices Full time

    WHAT YOU DO AT AMD CHANGES EVERYTHING We care deeply about transforming lives with AMD technology to enrich our industry our communities and the world Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center artificial intelligence PCs gaming and embedded Underpinning our...


  • Hyderabad, Telangana, India Xilinx Full time

    Job DescriptionWHAT YOU DO AT AMD CHANGES EVERYTHINGWe care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded....


  • Hyderabad, Telangana, India Xilinx Full time

    Job DescriptionWHAT YOU DO AT AMD CHANGES EVERYTHINGWe care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded....


  • Hyderabad, Telangana, India Micron Full time

    Our vision is to transform how the world uses information to enrich life for all Micron Technology is a world leader in innovating memory and storage solutions that accelerate the transformation of information into intelligence inspiring the world to learn communicate and advance faster than ever JR62531 PRINCIPAL ENG-DEG-TECHNOLOGY-VERIFICATION As a...


  • Hyderabad, Telangana, India Talent21 Full time ₹ 8,00,000 - ₹ 24,00,000 per year

    Azure Senior CloudOps EngineerPrimary SkillsComplete hands-on individual contributor required.Must have performed Azure engineer role for 6-8 years.Azure Cloud Operations Eng. with system administrationManually deploying, configuring, and troubleshooting - Virtual Machines, Network Security Groups, Azure Monitor, Log analytics, Insights, backup and Site...

  • Designer II

    5 days ago


    Hyderabad, Telangana, India TechnipFMC Full time ₹ 9,00,000 - ₹ 12,00,000 per year

    Job Purpose Issues design/fabrication plans and/or installation sketches/animation for SSE's projects and operations (from proposal to execution), in accordance with design and manufacturing standards and processes, schedule, and man hours. Job Description • Achieves design plans and detailed CAD (computer aided-design) models.• Contributes to technical...


  • Hyderabad, Telangana, India Cadence Design Systems Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    At Cadence, we hire and develop leaders and innovators who want to make an impact on the world of technology.In-depth understanding of high-speed Serdes/Memory interface circuits like I/O's,PLL's, Clocking, Datapath's.Hands on experience on PCIe Gen3/4/5/6, GDDRx/DDRx/LPDDRx memory interface circuits.Strong Analog Design and I/O Design fundamentals....

  • MLOps Engineer

    2 weeks ago


    Hyderabad, Telangana, India Transgraph Consulting Full time ₹ 15,00,000 - ₹ 28,00,000 per year

    Seeking an MLOps Engineer to design, deploy, and monitor ML systems. You'll ensure models are reliable, scalable, and easy to manage, while building tools that support teams and improve workflows. Required Candidate profileLooking for 3+ yrs exp in DevOps/MLOps/ML/Data Eng, strong Python, Git, CI/CD, Docker, K8s, cloud (AWS/GCP/Azure).Plus MLflow, Kubeflow,...

  • Senior Designer

    2 weeks ago


    Hyderabad, Telangana, India TechnipFMC Full time US$ 90,000 - US$ 1,20,000 per year

    Job Purpose Leads the production of design/fabrication plans and/or installation sketches/animation within SSE's standard projects and/or operations, in accordance with drawing, design, manufacturing standards and processes, schedule and man hours, with a permanent concern for quality standards and targets. Job Description • Ensures that all design...