ML GPU Kernel Development Engineer
1 week ago
Overview:
WHAT YOU DO AT AMD CHANGES EVERYTHING
At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you'll discover the real differentiator is our culture. We push the limits of innovation to solve the world's most important challenges—striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career.
Responsibilities:
ML GPU Kernel Development Engineer
THE ROLE:
We are seeking a talented Machine Learning Kernel Developer to design, develop, and optimize low-level machine learning kernels for AMD GPUs using the ROCm software stack. In this role, you will work on high-impact projects to accelerate AI frameworks and libraries, with a focus on emerging technologies like Large Language Models (LLMs) and other generative AI workloads.
THE PERSON:
The ideal candidate will have hands-on experience with GPU programming (ROCm or CUDA) and a passion for pushing the boundaries of AI performance.
KEY RESPONSIBILITIES:
- Design and implement highly optimized ML kernels (e.g., matrix operations, attention mechanisms) for AMD GPUs using ROCm.
- Profile, debug, and tune kernel performance to maximize hardware utilization for AI workloads.
- Collaborate with ML researchers and framework developers to integrate kernels into AI frameworks (e.g., PyTorch, TensorFlow) and inference engines (e.g., vLLM, SGLang).
- Contribute to the ROCm software stack by identifying and resolving bottlenecks in libraries like MIOpen, BLAS, or Composable Kernel.
- Stay updated on the latest AI/ML trends (LLMs, quantization, distributed inference) and apply them to kernel development.
- Document and communicate technical designs, benchmarks, and best practices.
- Troubleshoot and resolve issues related to GPU compatibility, performance, and scalability.
REQUIRED EXPERIENCE:
- 2+ years of experience in GPU kernel development for machine learning (ROCm or CUDA).
- Proficiency in C/C++ and Python, with experience in performance-critical programming.
- Strong understanding of ML frameworks (PyTorch, TensorFlow) and GPU-accelerated libraries.
- Basic knowledge of modern AI technologies (LLMs, transformers, inference optimization).
- Familiarity with parallel computing, memory optimization, and hardware architectures.
- Problem-solving skills and ability to work in a fast-paced environment.
PREFERRED EXPERIENCE:
- Direct experience with AMD ROCm development (HIP, MIOpen, Composable Kernel).
- Knowledge of LLM-specific optimizations (e.g., FlashAttention, PagedAttention in vLLM).
- Experience with distributed training/inference or model compression techniques.
- Contributions to open-source ML projects or GPU compute libraries.
ACADEMIC CREDENTIALS:
- Bachelor's/Master's in Computer Science, Electrical Engineering, or related field.
Qualifications:
Benefits offered are described: AMD benefits at a glance.
AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.
-
ML GPU Kernel Development Engineer
1 week ago
Hyderabad, Telangana, India Advanced Micro Devices, Inc Full time ₹ 10,00,000 - ₹ 20,00,000 per yearWHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create...
-
Open source AI/ML
2 weeks ago
Hyderabad, Telangana, India Source-Right Full time ₹ 15,00,000 - ₹ 20,00,000 per yearPosition: Open source AI/ML (SI35FT RM 3718)EXPERIENCE – Must HaveStrong C++ and Python programming skills.Performance analysis skills for both CPU and GPUGood knowledge of AI/ML Frameworks and ArchitectureBasic GPU kernel programming knowledgeExperience with software engineering methodologies such as Agile, Scrum, Kanban.Experience in all the phases of...
-
Open source AI/ML
2 weeks ago
Hyderabad, Telangana, India Source-Right Inc. Full time ₹ 12,00,000 - ₹ 36,00,000 per yearPosition: Open source AI/ML (SI35FT RM 3718)EXPERIENCE – Must have :Strong C++ and Python programming skills.Performance analysis skills for both CPU and GPUGood knowledge of AI/ML Frameworks and ArchitectureBasic GPU kernel programming knowledgeExperience with software engineering methodologies such as Agile, Scrum, Kanban.Experience in all the phases of...
-
GPU Validation Engineer
1 week ago
Hyderabad, Telangana, India ElevarSoC Technologies Full time ₹ 5,00,000 - ₹ 15,00,000 per yearHello EveryoneGreetings from ElevarSoCWe are hiring for GPU Validation Engineer for Hyderabad location with 1-3 Years of experienceBelow the JdGood knowledge on Graphics Device drivers WHQL validation, Python based Automation Execution, 3D gaming and multimediaExperience with Test Automation skills like Python, C and other scripting languagesKnowledge of...
-
AI/ML Engineer
2 weeks ago
Hyderabad, Telangana, India Infosif solution Full time ₹ 12,000 - ₹ 24,000 per yearJd for AI/ML Engineerl Hands-on experience with NVIDIA GPU acceleration, CUDA, TensorRT, and deep learning frameworks (e.g., PyTorch, TensorFlow).]l Vision + deep learning + sensor fusionl ML Models, ML Infrastructurel ROSl Python, Linux, and modern development workflows (Git, CI/CD, etc.).Exp 5 + years onlyLocation HyderabadNotice period-immediate to 20...
-
AI/ML Engineer
2 weeks ago
Hyderabad, Telangana, India HiringNinja Full time ₹ 3,50,000 - ₹ 20,00,000 per yearNight shift starting 9.30 pm istResponsibilities● Develop, train, and optimize AI/ML models that enhance Flippy's perception,decision-making, and real-time operational performance.● Conduct experiments and research to improve model accuracy, robustness, and generalization across diverse kitchen environments.● Collaborate with software, hardware, and...
-
Linux Kernel Yocto Platform Engineer
6 hours ago
Hyderabad, Telangana, India Sutherland Full timeCompany Description Sutherland is seeking a reliable and technical person to join us as Linux Kernel Yocto Platform Engineer who will play a key role in driving our continued product growth and innovation. If you are looking to build a fulfilling career and are confident you have the skills and experience to help us succeed, we want to work with you Job...
-
Camera kernel developer
1 week ago
Hyderabad, Telangana, India Testcore It Solutions Full time ₹ 4,00,000 - ₹ 8,00,000 per yearResponsibilities:* Design, develop, test & maintain camera kernel modules using C++ and V4L2 protocol.* Collaborate with hardware team on MIPI driver integration and optimization.
-
Senior Cloud Support Engineer- AI/ML, Kubernetes
2 weeks ago
Hyderabad, Telangana, India Droplet Offshore Services Full time ₹ 1,20,000 - ₹ 6,00,000 per yearDive in and do the best work of your career at DigitalOcean. Journey alongside a strong community of top talent who are relentless in their drive to build the simplest scalable cloud. If you have a growth mindset, naturally like to think big and bold, and are energized by the fast-paced environment of a true industry disruptor, youll find your place here. We...
-
Engineer - Software Development
4 days ago
Hyderabad, Telangana, India Zweizag Private Limited Full timeJob Description:-As a member of the APPS Power Optimization Team you will contribute to the optimization of power consumed of various test cases for upcoming Snapdragon SoCs designed for mobile, IoT, Wear, AR/VR and modem products. You will work with engineers across a range of disciplines (e.g. hardware, software and systems) and technologies (e.g. advanced...