AI Infrastructure Engineer GPU
3 days ago
AI Infrastructure Engineer (GPU & Server Specialist) – Onsite
We're looking for an
AI Infrastructure Engineer
to help us build a world-class
GPU server environment
for training large language models (GPT-style AI). This role is
onsite
and hands-on — setting up the latest GPUs, servers, and high-performance clusters.
What You'll Do
- Deploy and configure
GPU servers
(NVIDIA H100/A100 or AMD MI300). - Set up
server clusters
with high-speed networking (InfiniBand, NVLink). - Manage
storage systems
(NVMe, Lustre, BeeGFS) for AI training data. - Optimize environments for
PyTorch, TensorFlow, and Hugging Face models
. - Monitor and maintain
system health and performance
.
What We're Looking For
- 5+ years of experience in
HPC, GPU servers, or AI infrastructure
. - Strong knowledge of
Linux, CUDA, drivers, and GPU optimization
. - Experience with
cluster management
(Kubernetes/Docker). - Familiarity with
distributed AI training frameworks
(DeepSpeed, Horovod, Megatron-LM).
Nice to Have
- Experience training or supporting
large language models (LLMs)
. - Background in
liquid cooling / advanced data center systems
. - Knowledge of
MLOps practices
for scaling AI workloads.
Tech Stack
- GPUs:
NVIDIA H100/A100, AMD Instinct MI300 - Servers:
NVIDIA DGX, Supermicro, Dell, Lambda Labs - Networking:
InfiniBand, NVSwitch, RoCE - Software:
PyTorch, TensorFlow, Hugging Face, DeepSpeed
-
GPU Architectural Designer
3 days ago
Greater Hyderabad Area, India beBeeEngineer Full time ₹ 2,00,00,000 - ₹ 2,50,00,000Senior Design Engineer for High-Performance GPU ArchitectureWe are seeking an experienced design engineer to lead the development of high-performance matrix multiplication, low-latency interconnects, and power-efficient AI acceleration solutions for GPUs.Key Responsibilities:Design IP blocks for GPU cores, including systolic arrays, vector units, and memory...
-
Principal IP/RTL Design Engineer for GPU
15 hours ago
Greater Hyderabad Area, India Mulya Technologies Full timePrincipal IP/RTL Design Engineer for TPU / GPU Hyderabad / Bangalore Founded by highly respected Silicon Valley veterans - with its design centers established in Santa Clara, California. / Hyderabad/ Bangalore Our pay comprehensively beats "ALL" Semiconductor product players in the Indian market. Position Overview Seeking an IP/RTL Design Engineer with...
-
IP/RTL Design Architect for GPU
5 days ago
Greater Hyderabad Area, India Mulya Technologies Full time ₹ 1,04,000 - ₹ 1,30,878 per yearIP/RTL Design Architect for GPUHyderabadFounded by highly respected Silicon Valley veterans - with its design centers established in Santa Clara, California. / Hyderabad/ BangaloreOur pay comprehensively beats "ALL" Semiconductor product players in the Indian market.Position OverviewSeeking an IP/RTL Design Engineer with 8+ years of experience to design...
-
Principal IP/RTL Design Engineer for GPU
1 day ago
Greater Hyderabad Area, India Mulya Technologies Full timePrincipal IP/RTL Design Engineer for TPU / GPU Hyderabad / BangaloreFounded by highly respected Silicon Valley veterans - with its design centers established in Santa Clara, California. / Hyderabad/ BangaloreOur pay comprehensively beats "ALL" Semiconductor product players in the Indian market. Position OverviewSeeking an IP/RTL Design Engineer with 5+ years...
-
Software Engineer
2 weeks ago
Greater Bengaluru Area, India Eridu AI Full time US$ 1,50,000 - US$ 2,00,000 per yearAbout Eridu AIEridu AI India Private Limited, a wholly owned subsidiary of Eridu Corporation, Saratoga, California, USA, is looking to hire highly motivated and talented professionals for its R&D center in Bengaluru to join our world-class team.Eridu AI is a Silicon Valley-based hardware startup pioneering infrastructure solutions that accelerate training...
-
Senior AI Architecture Engineer
1 week ago
Greater Hyderabad Area, India beBeeArchitecture Full timeWe are seeking a Senior AI Architecture Engineer to join our team.Job DescriptionAs a Senior AI Architecture Engineer, you will design and develop high-performance matrix multiplication units, low-latency interconnects, and power-efficient AI acceleration solutions for TPUs and GPUs.Design IP blocks for TPU cores, including systolic arrays, vector units, and...
-
AI Model Deployment Engineer
2 weeks ago
Greater Hyderabad Area, India beBeeVerification Full time ₹ 13,50,000 - ₹ 2,51,64,000Job Title: Verification Engineering LeadLead the charge in developing cutting-edge AI models for audio and video applications, focusing on inference efficiency and performance optimization across NPUs, GPUs, and CPUs. In this pivotal role, you will spearhead verification efforts for complex SoCs/IPs, collaborating with cross-functional teams to bring...
-
Machine Learning Engineer
1 week ago
Greater Bengaluru Area, India Valiance Solutions Full time US$ 1,25,000 - US$ 1,75,000 per yearAbout the Role:We are seeking an experienced MLOps Engineer to lead the deployment, scaling, and performance optimization of open-source Generative AI models on cloud infrastructure. You'll work at the intersection of machine learning, DevOps, and cloud engineering to help productize and operationalize large-scale LLM and diffusion models.Key...
-
AI Acceleration Technology Lead
5 days ago
Greater Hyderabad Area, India beBeeEngineer Full time ₹ 1,50,00,000 - ₹ 2,00,00,000We are seeking a highly skilled Senior Design Engineer to lead the development of cutting-edge AI acceleration technologies.Key ResponsibilitiesDesign and implement high-performance matrix multiplication and low-latency interconnects for our next-generation AI accelerators.Develop optimized Verilog/SystemVerilog RTL for performance, timing, and area...
-
Platform Engineer
2 weeks ago
Greater Bengaluru Area, India Kluisz Full time ₹ 15,00,000 - ₹ 20,00,000 per yearAbout Us is building the future of intelligent cloud infrastructure—autonomous, secure, and GPU-optimized by design. We are on a mission to redefine how cloud, AI, and GPU-native workloads are built, deployed, and scaled—across private, hybrid, and sovereign environments. Our next-gen platform powers secure AI workloads, real-time inferencing, and...