AI Infrastructure Engineer GPU

3 days ago


Greater Visakhapatnam Area, India LEO DOES IT INC Full time ₹ 15,00,000 - ₹ 28,00,000 per year

AI Infrastructure Engineer (GPU & Server Specialist) – Onsite

We're looking for an
AI Infrastructure Engineer
to help us build a world-class
GPU server environment
for training large language models (GPT-style AI). This role is
onsite
and hands-on — setting up the latest GPUs, servers, and high-performance clusters.

What You'll Do

  • Deploy and configure
    GPU servers
    (NVIDIA H100/A100 or AMD MI300).
  • Set up
    server clusters
    with high-speed networking (InfiniBand, NVLink).
  • Manage
    storage systems
    (NVMe, Lustre, BeeGFS) for AI training data.
  • Optimize environments for
    PyTorch, TensorFlow, and Hugging Face models
    .
  • Monitor and maintain
    system health and performance
    .

What We're Looking For

  • 5+ years of experience in
    HPC, GPU servers, or AI infrastructure
    .
  • Strong knowledge of
    Linux, CUDA, drivers, and GPU optimization
    .
  • Experience with
    cluster management
    (Kubernetes/Docker).
  • Familiarity with
    distributed AI training frameworks
    (DeepSpeed, Horovod, Megatron-LM).

Nice to Have

  • Experience training or supporting
    large language models (LLMs)
    .
  • Background in
    liquid cooling / advanced data center systems
    .
  • Knowledge of
    MLOps practices
    for scaling AI workloads.

Tech Stack

  • GPUs:
    NVIDIA H100/A100, AMD Instinct MI300
  • Servers:
    NVIDIA DGX, Supermicro, Dell, Lambda Labs
  • Networking:
    InfiniBand, NVSwitch, RoCE
  • Software:
    PyTorch, TensorFlow, Hugging Face, DeepSpeed


  • Greater Hyderabad Area, India beBeeEngineer Full time ₹ 2,00,00,000 - ₹ 2,50,00,000

    Senior Design Engineer for High-Performance GPU ArchitectureWe are seeking an experienced design engineer to lead the development of high-performance matrix multiplication, low-latency interconnects, and power-efficient AI acceleration solutions for GPUs.Key Responsibilities:Design IP blocks for GPU cores, including systolic arrays, vector units, and memory...


  • Greater Hyderabad Area, India Mulya Technologies Full time

    Principal IP/RTL Design Engineer for TPU / GPU Hyderabad / Bangalore Founded by highly respected Silicon Valley veterans - with its design centers established in Santa Clara, California. / Hyderabad/ Bangalore Our pay comprehensively beats "ALL" Semiconductor product players in the Indian market. Position Overview Seeking an IP/RTL Design Engineer with...


  • Greater Hyderabad Area, India Mulya Technologies Full time ₹ 1,04,000 - ₹ 1,30,878 per year

    IP/RTL Design Architect for GPUHyderabadFounded by highly respected Silicon Valley veterans - with its design centers established in Santa Clara, California. / Hyderabad/ BangaloreOur pay comprehensively beats "ALL" Semiconductor product players in the Indian market.Position OverviewSeeking an IP/RTL Design Engineer with 8+ years of experience to design...


  • Greater Hyderabad Area, India Mulya Technologies Full time

    Principal IP/RTL Design Engineer for TPU / GPU Hyderabad / BangaloreFounded by highly respected Silicon Valley veterans - with its design centers established in Santa Clara, California. / Hyderabad/ BangaloreOur pay comprehensively beats "ALL" Semiconductor product players in the Indian market. Position OverviewSeeking an IP/RTL Design Engineer with 5+ years...

  • Software Engineer

    2 weeks ago


    Greater Bengaluru Area, India Eridu AI Full time US$ 1,50,000 - US$ 2,00,000 per year

    About Eridu AIEridu AI India Private Limited, a wholly owned subsidiary of Eridu Corporation, Saratoga, California, USA, is looking to hire highly motivated and talented professionals for its R&D center in Bengaluru to join our world-class team.Eridu AI is a Silicon Valley-based hardware startup pioneering infrastructure solutions that accelerate training...


  • Greater Hyderabad Area, India beBeeArchitecture Full time

    We are seeking a Senior AI Architecture Engineer to join our team.Job DescriptionAs a Senior AI Architecture Engineer, you will design and develop high-performance matrix multiplication units, low-latency interconnects, and power-efficient AI acceleration solutions for TPUs and GPUs.Design IP blocks for TPU cores, including systolic arrays, vector units, and...


  • Greater Hyderabad Area, India beBeeVerification Full time ₹ 13,50,000 - ₹ 2,51,64,000

    Job Title: Verification Engineering LeadLead the charge in developing cutting-edge AI models for audio and video applications, focusing on inference efficiency and performance optimization across NPUs, GPUs, and CPUs. In this pivotal role, you will spearhead verification efforts for complex SoCs/IPs, collaborating with cross-functional teams to bring...


  • Greater Bengaluru Area, India Valiance Solutions Full time US$ 1,25,000 - US$ 1,75,000 per year

    About the Role:We are seeking an experienced MLOps Engineer to lead the deployment, scaling, and performance optimization of open-source Generative AI models on cloud infrastructure. You'll work at the intersection of machine learning, DevOps, and cloud engineering to help productize and operationalize large-scale LLM and diffusion models.Key...


  • Greater Hyderabad Area, India beBeeEngineer Full time ₹ 1,50,00,000 - ₹ 2,00,00,000

    We are seeking a highly skilled Senior Design Engineer to lead the development of cutting-edge AI acceleration technologies.Key ResponsibilitiesDesign and implement high-performance matrix multiplication and low-latency interconnects for our next-generation AI accelerators.Develop optimized Verilog/SystemVerilog RTL for performance, timing, and area...

  • Platform Engineer

    2 weeks ago


    Greater Bengaluru Area, India Kluisz Full time ₹ 15,00,000 - ₹ 20,00,000 per year

    About Us is building the future of intelligent cloud infrastructure—autonomous, secure, and GPU-optimized by design. We are on a mission to redefine how cloud, AI, and GPU-native workloads are built, deployed, and scaled—across private, hybrid, and sovereign environments. Our next-gen platform powers secure AI workloads, real-time inferencing, and...