Senior Researcher – GPU Performance

1 week ago


Bengaluru, Karnataka, India Microsoft Full time ₹ 8,00,000 - ₹ 24,00,000 per year
Generative AI is transforming how people create, collaborate, and communicate - redefining productivity across Microsoft 365 and our customers globally. At Microsoft, we run the biggest platform for collaboration and productivity in the world with hundreds of millions of consumer/enterprise users. Tackling AI efficiency challenges is crucial for delivering these experiences at scale.


Within our Microsoft wide Systems Innovation initiative, we are working to advance efficiency across AI systems, where we look at novel designs and optimizations across AI stacks: models, AI frameworks, cloud infrastructure, and hardware. We are an Applied Research team driving mid- and long-term product innovations. We closely collaborate with multiple research teams and product groups across the globe who bring a multitude of technical expertise in cloud systems, machine learning and software engineering. We communicate our research both internally and externally through academic publications, open-source releases, blog posts, patents, and industry conferences. Further, we also collaborate with academic and industry partners to advance the state of the art and target material product impact that will affect 100s of millions of customers.

We are looking for a Senior Researcher – Hardware/Software Codesign researcher to explore hardware/kernel-level optimizations to deliver significant efficiency gains for Large Language Models and Generative AI experiences.

The ideal candidate will have a strong background in GPU architecture, accelerator design, machine learning, or systems research and the ambition to apply them to large scale production systems. This role combines deep technical expertise in GPU architecture with practical implementation skills to create efficient, scalable computational kernels. Further, the ideal candidate must have demonstrated a history of solving hard technical problems and is motivated to tackle the hardest problems in building a full end-to-end AI stack. An entrepreneurial approach and ability to take initiative and move fast are essential.

Have a look at this link for reading: Efficient AI - Microsoft Research

Microsoft's mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.



  • Bengaluru, Karnataka, India Microsoft Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Generative AI is transforming how people create, collaborate, and communicate - redefining productivity across Microsoft 365 and our customers globally. At Microsoft, we run the biggest platform for collaboration and productivity in the world with hundreds of millions of consumer/enterprise users. Tackling AI efficiency challenges is crucial for delivering...


  • Bengaluru, Karnataka, India Careernet Full time ₹ 12,00,000 - ₹ 24,00,000 per year

    Key Skills: Triton, C++, GPU Runtime Optimization, Multi-GPU Systems, TVM, XLA, MLIR, ROCm, Transformer Inference.Roles & Responsibilities:Architect high-performance inference runtimes, kernel dispatchers, and memory planners for large diffusion and transformer workloads.Lead investigations into cross-GPU performance bottlenecks, communication overheads, and...


  • Bengaluru, Karnataka, India Qualcomm Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Company:Qualcomm India Private LimitedJob Area:Engineering Group, Engineering Group > Systems EngineeringGeneral Summary:Job DescriptionResponsibilities:This position will be responsible for research, analysis and improvement of Qualcomm's Adreno GPU compiler and system performance to our world wide customers. From the analyses and experiments on GPU shaders...


  • Bengaluru, Karnataka, India Imagination Technologies Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    The roleYou will be part of a diverse and distributed team of engineers who maintain and develop our GPU compiler software, supporting a range of graphics and compute APIs while targeting multiple GPU generations with varying ISAs. The GPU compiler is a central part of the drivers that we develop for these APIs. As such, they are critical to achieving...


  • Bengaluru, Karnataka, India Norwin Technologies Full time ₹ 6,00,000 - ₹ 18,00,000 per year

    Title: GPU + Kubernetes ExpertLocation: BangaloreExperience: 8+ YearsJob Description:Were looking for an experienced Infrastructure Engineer with a strong background in Kubernetes (K8s), GPU-based workloads, and scaling large distributed systems. We need builders, not just maintainers. Ideal candidates will have hands-on experience developing and...


  • Bengaluru, Karnataka, India Norwin Technologies Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    Dear Candidate,Were looking for an experiencedInfrastructure Engineerwith a strong background inKubernetes (K8s),GPU-based workloads, andscaling large distributed systems. We need builders, not just maintainers. Ideal candidates will have hands-on experiencedeveloping and stress-testing infrastructure at scale, not just reporting bottlenecks, but solving...


  • Bengaluru, Karnataka, India Qualcomm Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Company:Qualcomm India Private LimitedJob Area:Engineering Group, Engineering Group > Hardware EngineeringGeneral Summary: As a leading technology innovator, Qualcomm pushes the boundaries of what's possible to enable next-generation experiences and drives digital transformation to help create a smarter, connected future for all. As a Qualcomm Hardware...


  • Bengaluru, Karnataka, India Qualcomm Full time ₹ 8,00,000 - ₹ 12,00,000 per year

    Company:Qualcomm India Private LimitedJob Area:Engineering Group, Engineering Group > Hardware EngineeringGeneral Summary:As a leading technology innovator, Qualcomm pushes the boundaries of what's possible to enable next-generation experiences and drives digital transformation to help create a smarter, connected future for all. As a Qualcomm Hardware...


  • Bengaluru, Karnataka, India Plural Hire Full time ₹ 5,00,000 - ₹ 15,00,000 per year

    Hiring for a VC Backed Deep Tech StartupAbout This RoleWe are seeking a highly motivated HPC Performance Profiling Intern to join our High-PerformanceComputing (HPC) team. The intern will focus on CPU/MPI/GPU performance profiling andoptimization for our advanced HPC simulation and optimization frameworks. This role is critical toaddressing current...


  • Bengaluru, Karnataka, India Qualcomm Full time ₹ 9,00,000 - ₹ 12,00,000 per year

    Company:Qualcomm India Private LimitedJob Area:Engineering Group, Engineering Group > Systems EngineeringGeneral Summary:Analyze and evaluate GPU architecture/microarchitecture and workload for performance and power optimizationsExperiance in Artificial intelligenceGPU power modeling and estimation for projection and correlationGPU workload analysis,...