AI Inference Kernel Engineer

6 days ago


bangalore, India Phinity Full time

We look forward to when AI can discover the next quantum AI accelerator, or when AI can make RL much more compute-efficient. We want to enable AI to bootstrap its own intelligence, to discover new computational paradigms. Just as AlphaEvolve discovered a 23% speedup in Gemini's critical kernels and achieved 32.5% improvements in FlashAttention, we're building the infrastructure that will enable every AI model to optimize its own compute stack. Of course, to automate algorithm and hardware discovery, we need to break the data barrier. CUDA is a low-resource language, and kernel optimization depends a lot on context and hardware that models simply are not trained on.Phinity is building the canonical training data infrastructure that will enable agentic hardware engineering and optimization, which will fuel algorithmic discovery. We are building environments for agents to learn to write kernel from a spec and optimize them on specific hardware, and eventually, to discover new hardware breakthroughs. Our customers include one of the largest frontier model labs.We're seeking top engineers for a contractor role who can optimize hardware for model training and inference workloads, who can bake their industry experience into a model. This is a hybrid Systems Engineer/AI research role where you will be looking through and debugging model reasoning traces and designing the optimal CUDA problems to teach unreleased models to automate your work in industry. Please do not apply unless you have optimized kernels before.Skill requirements:Languages: CUDA, C++, Python,Frameworks: JAX/XLA, PyTorch, TensorFlow (at the C++ level), PallasLibraries: cuBLAS, cuDNN, CUTLASS, CUB, ThrustCompiler Tools: NVCC, PTX assembly, MLIR/XLA understandingHardware Knowledge: SM architecture, tensor cores, memory hierarchies (HBM, L2, shared, registers)Apply if you have:Achieved >10x speedups on production ML workloadsWritten kernels that outperform vendor librariesOptimized attention, GEMM, or convolution at the assembly levelBuilt custom fusions that beat XLA/Triton compiler outputPublished papers or open-source kernels used in production



  • bangalore, India Phinity Full time

    We look forward to when AI can discover the next quantum AI accelerator, or when AI can make RL much more compute-efficient. We want to enable AI to bootstrap its own intelligence, to discover new computational paradigms. Just as AlphaEvolve discovered a 23% speedup in Gemini's critical kernels and achieved 32.5% improvements in FlashAttention, we're...


  • bangalore, India Turiyam AI Full time

    Company Overview:At TuriyamAI, we are pioneering world leading GenAI semiconductor solutions from India, for India and the World. Our breakthrough solutions are set to redefine the future of AI computing, driving unparalleled efficiency, performance, and accessibility for enterprises worldwide.Job Description:We are looking for an exceptional engineers to...


  • Bangalore, India Turiyam AI Full time

    Company Overview: At TuriyamAI, we are pioneering world leading GenAI semiconductor solutions from India, for India and the World. Our breakthrough solutions are set to redefine the future of AI computing, driving unparalleled efficiency, performance, and accessibility for enterprises worldwide. Job Description: We are looking for an exceptional engineers to...


  • Bangalore Division, India Turiyam AI Full time

    Company Overview: At TuriyamAI, we are pioneering world leading GenAI semiconductor solutions from India, for India and the World. Our breakthrough solutions are set to redefine the future of AI computing, driving unparalleled efficiency, performance, and accessibility for enterprises worldwide. Job Description: We are looking for an exceptional engineers to...


  • bangalore district, India Turiyam AI Full time

    Company Overview: At TuriyamAI, we are pioneering world leading GenAI semiconductor solutions from India, for India and the World. Our breakthrough solutions are set to redefine the future of AI computing, driving unparalleled efficiency, performance, and accessibility for enterprises worldwide. Job Description: We are looking for an exceptional engineers to...

  • AI Kernel Developer

    1 week ago


    bangalore, India Intel Corporation Full time

    Job Details:Job Description: We are looking for a dynamic and passionate contributors to work on Intel's next generation GPUs.Works with internal engineering teams and stakeholders to deliver optimized complex math kernel operators for Intel GPU's.Partners with AI framework and workload engineers as needed to optimize endtoend AI models on Intel hardware...

  • Kernel Engineer

    14 hours ago


    bangalore, India People's Growth HR Solutions Full time

    Job Title: Kernel EngineerLocation: Bangalore, IndiaExperience: 4+ YearsKey Responsibilities:Design, develop, and maintain the core operating system of our products Collaborate with software development team to integrate new features and functionalities into the kernel Identify and troubleshoot any issues related to the kernel and provide timely solutions...


  • bangalore, India Sustainability Economics.ai Full time

    Location:  Bengaluru, Karnataka   About the Company:   Sustainability Economics.ai is a global organization, pioneering the convergence of clean energy and AI, enabling profitable energy transitions while powering end-to-end AI infrastructure. By integrating AI-driven cloud solutions with sustainable energy, we create scalable, intelligent ecosystems that...


  • bangalore, India Sustainability Economics.ai Full time

    Location: Bengaluru, Karnataka    About the Company:    Sustainability Economics.ai is a global organization, pioneering the convergence of clean energy and AI, enabling profitable energy transitions while powering end-to-end AI infrastructure. By integrating AI-driven cloud solutions with sustainable energy, we create scalable, intelligent ecosystems...


  • Bangalore, India Linux Kernel & LDD Full time

    Role Description We're seeking an Enthusiastic Software Engineering Intern for a 6-month full-time hybrid position in Bengaluru. This internship offers hands-on experience in: Linux kernel basics and device driver fundamentals Practical embedded systems development Real-world project implementation Learning Outcomes During this internship, you will:...