Senior Researcher – Gpu Performance

1 week ago


Bangalore Karnataka, India Microsoft Full time

Generative AI is transforming how people create collaborate and communicate - redefining productivity across Microsoft 365 and our customers globally At Microsoft we run the biggest platform for collaboration and productivity in the world with hundreds of millions of consumer enterprise users Tackling AI efficiency challenges is crucial for delivering these experiences at scale Within our Microsoft wide Systems Innovation initiative we are working to advance efficiency across AI systems where we look at novel designs and optimizations across AI stacks models AI frameworks cloud infrastructure and hardware We are an Applied Research team driving mid- and long-term product innovations We closely collaborate with multiple research teams and product groups across the globe who bring a multitude of technical expertise in cloud systems machine learning and software engineering We communicate our research both internally and externally through academic publications open-source releases blog posts patents and industry conferences Further we also collaborate with academic and industry partners to advance the state of the art and target material product impact that will affect 100s of millions of customers We are looking for a Senior Researcher - Hardware Software Codesign researcher to explore hardware kernel-level optimizations to deliver significant efficiency gains for Large Language Models and Generative AI experiences The ideal candidate will have a strong background in GPU architecture accelerator design machine learning or systems research and the ambition to apply them to large scale production systems This role combines deep technical expertise in GPU architecture with practical implementation skills to create efficient scalable computational kernels Further the ideal candidate must have demonstrated a history of solving hard technical problems and is motivated to tackle the hardest problems in building a full end-to-end AI stack An entrepreneurial approach and ability to take initiative and move fast are essential Have a look at this link for reading Microsoft s mission is to empower every person and every organization on the planet to achieve more As employees we come together with a growth mindset innovate to empower others and collaborate to realize our shared goals Each day we build on our values of respect integrity and accountability to create a culture of inclusion where everyone can thrive at work and beyond Responsibilities Design implement and optimize GPU kernels for complex computational workloads such as AI inferencing Research and develop novel optimization techniques for generation of GPU kernels Profile and analyze kernel performance using advanced diagnostic tools Generate automated solutions for kernel optimization and tuning Collaborate with other researchers to improve model performance Document optimization strategies and maintain performance benchmarks Contribute to the development of internal GPU computing frameworks Qualifications Required Qualifications Doctorate in relevant field OR equivalent experience Solid understanding of GPU architecture memory hierarchies parallel computing and algorithm optimization Hands-on experience in GPU programming including performance profiling and optimization tools Advanced C programming skills Other Requirements Ability to meet Microsoft customer and or government security screening requirements are required for this role These requirements include but are not limited to the following specialized security screenings Microsoft Cloud Background Check This position will be required to pass the Microsoft Cloud background check upon hire transfer and every two years thereafter Preferred Qualifications 5 years of experience in GPU programming and optimization expert knowledge of CUDA ROCm Triton PTX CUTLASS or similar GPU programming frameworks Experience with machine learning frameworks PyTorch TensorFlow Familiarity with compiler optimization techniques and background in auto-tuning and automated code generation Publication record in relevant conferences or journals MLSys NeurIPS ICML ICLR AISTATS ACL EMNLP NAACL ISCA MICRO ASPLOS HPCA SOSP OSDI NSDI etc Microsoft is an equal opportunity employer All qualified applicants will receive consideration for employment without regard to age ancestry color family or medical care leave gender identity or expression genetic information marital status medical condition national origin physical or mental disability political affiliation protected veteran status race religion sex including pregnancy sexual orientation or any other characteristic protected by applicable laws regulations and ordinances If you need assistance and or a reasonable accommodation due to a disability during the application or the recruiting process please send a request via the M365Core M365Research Research



  • Bangalore, Karnataka, India Intel Full time

    Job Details Performs functional verification of graphics logic components including 3D graphics media and display to ensure design will meet specification requirements Defines and develops scalable and reusable IP verification plans test benches and architecture for verification environment to ensure coverage to confirm to graphics microarchitecture...


  • bangalore, India Best NanoTech Full time

    About the Company- Undisputed leader in AI computingOur client is the world's leading pioneer in accelerated computing. Originally known for inventing the GPU and revolutionizing gaming, they are now the primary force powering the AI era, providing the infrastructure for everything from self-driving cars to ChatGPT. You will be joining a trillion-dollar...


  • bangalore, India Best NanoTech Full time

    About the Company- Undisputed leader in AI computingOur client is the world’s leading pioneer in accelerated computing. Originally known for inventing the GPU and revolutionizing gaming, they are now the primary force powering the AI era, providing the infrastructure for everything from self-driving cars to ChatGPT. You will be joining a trillion-dollar...


  • Bangalore, India Best NanoTech Full time

    About the Company- Undisputed leader in AI computing Our client is the world’s leading pioneer in accelerated computing . Originally known for inventing the GPU and revolutionizing gaming, they are now the primary force powering the AI era , providing the infrastructure for everything from self-driving cars to ChatGPT. You will be joining a trillion-dollar...


  • Bengaluru, Karnataka, India Advanced Micro Devices, Inc Full time

    Overview: **WHAT YOU DO AT AMD CHANGES EVERYTHING** We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded....


  • Bangalore Division, India Best NanoTech Full time

    About the Company- Undisputed leader in AI computing Our client is the world’s leading pioneer in accelerated computing . Originally known for inventing the GPU and revolutionizing gaming, they are now the primary force powering the AI era , providing the infrastructure for everything from self-driving cars to ChatGPT. You will be joining a trillion-dollar...


  • bangalore district, India Best NanoTech Full time

    About the Company- Undisputed leader in AI computing Our client is the world’s leading pioneer in accelerated computing . Originally known for inventing the GPU and revolutionizing gaming, they are now the primary force powering the AI era , providing the infrastructure for everything from self-driving cars to ChatGPT. You will be joining a trillion-dollar...

  • GPU Compiler Expert

    4 days ago


    bangalore, India beBeeCompiler Full time

    We are seeking a seasoned expert in GPU compiler development to join our esteemed team of engineers. This is an exceptional opportunity to leverage your technical expertise and contribute to the development of cutting-edge tools that enable the creation of high-performance applications and libraries for HPC, DL, and Autonomous Driving domains.About the...


  • bangalore, India beBeeArchitecture Full time

    AI GPU Software ArchitectJob Description:As a Senior AI GPU Software Architect, you will lead the design and development of the complete software ecosystem for our novel GPU backend.This includes compilers, drivers, runtimes, cloud integration, and AI frameworks designed to support next-generation AI workloads and graphics.You will work closely with hardware...


  • Bangalore, Karnataka, India Qualcomm Full time

    Company Qualcomm India Private Limited Job Area Engineering Group Engineering Group Hardware Engineering General Summary Qualcomm is a company of inventors that unlocked 5G ushering in an age of rapid acceleration in connectivity and new possibilities But this is just the beginning It takes inventive minds with diverse skills backgrounds and cultures to...