GPU Engineer
2 weeks ago
Role & responsibilities
Job Summary
We are seeking a highly skilled GPU Infrastructure Engineer to join our team. This role focuses on the design, implementation, and management of enterprise network and cloud-based infrastructure to support evolving Azure cloud needs. The ideal candidate will have a strong background in software, network, or systems engineering, along with hands-on experience in managing large-scale cloud and data center operations.
Responsibilities
- Respond to incidents during regular on-call rotations and resolve issues efficiently to minimize downtime.
- Design and plan scalable GPU infrastructure solutions to meet organizational capacity and performance needs.
- Collaborate with cross-functional teams to define and implement GPU infrastructure architecture that aligns with business objectives.
- Evaluate GPU technologies and recommend the best hardware and software configurations.
- Configure and deploy GPU servers, including installation and setup of hardware, software, and networking components.
- Coordinate with vendors for procurement and installation of GPUs and related infrastructure.
- Implement and manage GPU clustering setups for compute-intensive tasks.
- Utilize monitoring tools to assess GPU performance metrics and system health.
- Conduct benchmarking tests and analyze the results to identify performance bottlenecks.
- Optimize workload distribution across GPU resources to ensure maximum efficiency.
- Provide expert troubleshooting support for reporting and resolving GPU-related issues experienced by team members.
- Maintain incident response protocols to address hardware and software failures swiftly and effectively.
- Develop FAQs and knowledge base articles to streamline support processes for internal users.
- Infrastructure Maintenance:
- Schedule and perform routine maintenance, including updates to software, firmware, and drivers related to GPU systems.
- Plan and execute capacity upgrades and expansions as needed, ensuring minimal disruption to services.
- Conduct post-mortem analyses on significant incidents to improve overall system reliability.
- Write scripts for automation of deployment, configuration management, and system monitoring tasks (e.g., Python, Bash).
- Develop tools that increase productivity for engineering and data science teams using GPUs.
- Implement Infrastructure as Code (IaC) practices for efficient and repeatable deployments.
Requirements
- Bachelors or Masters Degree in Computer Science, Information Technology, or a related field.
Technical Experience:
- Proven expertise in software engineering, network engineering, or systems administration.
- Hands-on experience with managing and debugging cloud backend server and networking infrastructure and services.
- Strong understanding of enterprise network and cloud-based architectures, including experience working with Cisco and Azure.
- Experience with cloud platforms providing GPU services (e.g., AWS, Google Cloud, Azure).
- Understanding virtualization technologies (e.g., Docker, Kubernetes) and server orchestration tools.
- Knowledge of network configurations and storage solutions used in GPU environments.
- Strong understanding of GPU architectures (NVIDIA CUDA, AMD ROCm, etc.).
- Experience with AI/ML workloads, HPC, or rendering applications.
- Familiarity with PCIe, memory subsystems (DDR, HBM), and high-speed I/O.
- Understanding of Azure Pipeline , Azure DevOps.
- Demonstrated knowledge in deploying servers and network infrastructure equipment at scale.
Specialized Skills:
- Experience working with GPU hardware or related system engineering.
- Experience with:
- Data center architecture and cloud infrastructure.
- Network infrastructure design and management in hybrid environments.
- Certifications in relevant technologies such as:
- Cisco (e.g., CCNA /CCNP).
- AZ900(Manadatory) , AZ104 (Optional).
- OCI Foundations Associate (Optional)
- ITIL or equivalent certifications (Optional).
-
GPU Developer/ Engineer
2 days ago
Hyderabad, Telangana, India Mirafra Full time ₹ 2,00,000 - ₹ 12,00,000 per yearTitle: GPU Developers/ GPU validation Engineer/ LeadsLocation: Hyderabad or BangaloreDescription:C++ programmingExperience in GPU Architectures, GPU Pipelines, GPU game processing, GPU rendering image processingExperience in OpenCL, Open GL, Vulkan and profilingGFX testing, Sanity/Stability/regression and performance testing
-
Gpu Validation Engineer
2 days ago
Hyderabad, Telangana, India BITSILICA Full time ₹ 9,00,000 - ₹ 12,00,000 per yearCompany DescriptionBITSILICA is a leading Semiconductor Design Services company offering Concept-to-Silicon-to-Software solutions worldwide. With a strong engineering team of over 500 members across India, Singapore, USA, Malaysia, and Vietnam, the company drives innovation in semiconductor and embedded software domains. BITSILICA's expertise includes low...
-
GPU Engineer
1 week ago
Hyderabad, Telangana, India Mirafra Full time ₹ 1,50,000 - ₹ 28,00,000 per yearDescription:BE/BTech/ME/MTech in Computer Science or Electronics or Electrical • Masters/ Bachelors in EE/EC/CS experience in IP/SoC/ASIC Verification.Experience in writing tests for Programmable Architectures like GPU/RISC/CPU or DSP. Experience in profiling.Experience and C++ and scripting languages.Understanding of GPU/AI/ML Processor architecture
-
GPU Compiler
2 weeks ago
Hyderabad, Telangana, India Qualcomm Full time ₹ 15,00,000 - ₹ 25,00,000 per yearGeneral Summary:Our power efficient GPU solution is fundamental to enable new exciting markets like VR, IoT, AI, drone, autonomous driving etc. GPU compiler is a key component of graphics solution. We are looking for talented, self-motivated engineers to create world class GPU compiler products to enable high performance graphics and compute with low power...
-
GPU Functional Verification Sr Lead Engineer
5 days ago
Hyderabad, Telangana, India Qualcomm Full time ₹ 12,00,000 - ₹ 36,00,000 per yearCompanyQualcomm India Private LimitedJob AreaEngineering Group, Engineering Group > Hardware EngineeringGeneral SummaryMinimum Qualifications:Bachelor's degree in Computer Science, Electrical/Electronics Engineering, Engineering, or related field and 4+ years of Hardware Engineering or related work experience.ORMaster's degree in Computer Science,...
-
PyCUDA Engineer
1 day ago
Hyderabad, Telangana, India nugget Full time ₹ 12,00,000 - ₹ 36,00,000 per yearRole DescriptionThis is a full-time on-site role for a PyCUDA Engineer, located in Hyderabad. The PyCUDA Engineer will be responsible for writing and optimizing CUDA code, integrating CUDA functionalities into Python applications, and developing GPU-accelerated algorithms. Daily tasks include conducting performance testing, debugging GPU-related issues, and...
-
Sr Infrastructure automation engineer
1 week ago
Hyderabad, Telangana, India Adroit Innovative Solutions Inc Full time ₹ 15,00,000 - ₹ 25,00,000 per yearWe're Hiring: Senior Infrastructure Automation Engineer (Zero-Touch GPU Cloud Build & Upgrade)If you're a senior engineer with 10+ years in infra automation and want to push the boundaries ofZero-Touch, this role is for you.We're looking for aSenior Infrastructure Automation Engineerto lead the design and implementation of a fullyZero-Touch GPU Cloud Build &...
-
SW Lead Engineer
2 weeks ago
Hyderabad, Telangana, India Quest Global Full time ₹ 20,00,000 - ₹ 25,00,000 per yearJob Requirements Deep Knowledge of C/C++ and Python programmingExperience with Linux Commands is mustExperience with Scripting language like bash/powershellUnderstanding of various python ML frameworks like Pytorch, Transformers etcUnderstanding of various language and compiler for writing highly efficient custom Deep-Learning GPU Kernels. like...
-
AI Software System Engineer
1 week ago
Hyderabad, Telangana, India AMD Full time ₹ 12,00,000 - ₹ 36,00,000 per yearWHAT YOU DO AT AMD CHANGES EVERYTHINGAt AMD, our mission is to build great products that accelerate next-generation computing experiences - from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create...
-
Senior Customer Success Engineer
3 days ago
Hyderabad, Telangana, India DigitalOcean Full time ₹ 12,00,000 - ₹ 36,00,000 per yearDive in and do the best work of your career at DigitalOcean. Journey alongside a strong community of top talent who are relentless in their drive to build the simplest scalable cloud. If you have a growth mindset, naturally like to think big and bold, and are energized by the fast-paced environment of a true industry disruptor, you'll find your place here....