AI Inference Kernel Engineer

4 weeks ago

Delhi, India Phinity Full time

We look forward to when AI can discover the next quantum AI accelerator, or when AI can make RL much more compute-efficient. We want to enable AI to bootstrap its own intelligence, to discover new computational paradigms. Just as AlphaEvolve discovered a 23% speedup in Gemini's critical kernels and achieved 32.5% improvements in FlashAttention, we're building the infrastructure that will enable every AI model to optimize its own compute stack. Of course, to automate algorithm and hardware discovery, we need to break the data barrier. CUDA is a low-resource language, and kernel optimization depends a lot on context and hardware that models simply are not trained on. Phinity is building the canonical training data infrastructure that will enable agentic hardware engineering and optimization, which will fuel algorithmic discovery. We are building environments for agents to learn to write kernel from a spec and optimize them on specific hardware, and eventually, to discover new hardware breakthroughs. Our customers include one of the largest frontier model labs. We're seeking top engineers for a contractor role who can optimize hardware for model training and inference workloads, who can bake their industry experience into a model. This is a hybrid Systems Engineer/AI research role where you will be looking through and debugging model reasoning traces and designing the optimal CUDA problems to teach unreleased models to automate your work in industry. Please do not apply unless you have optimized kernels before. Skill requirements: Languages: CUDA, C++, Python, Frameworks: JAX/XLA, PyTorch, TensorFlow (at the C++ level), Pallas Libraries: cuBLAS, cuDNN, CUTLASS, CUB, Thrust Compiler Tools: NVCC, PTX assembly, MLIR/XLA understanding Hardware Knowledge: SM architecture, tensor cores, memory hierarchies (HBM, L2, shared, registers) Apply if you have: Achieved >10x speedups on production ML workloads Written kernels that outperform vendor libraries Optimized attention, GEMM, or convolution at the assembly level Built custom fusions that beat XLA/Triton compiler output Published papers or open-source kernels used in production

AI Inference Kernel Engineer

4 weeks ago

Delhi, India Phinity Full time

We look forward to when AI can discover the next quantum AI accelerator, or when AI can make RL much more compute-efficient. We want to enable AI to bootstrap its own intelligence, to discover new computational paradigms. Just as AlphaEvolve discovered a 23% speedup in Gemini's critical kernels and achieved 32.5% improvements in FlashAttention, we're...
AI Accelerator Kernel Engineer

3 weeks ago

New Delhi, India Turiyam AI Full time

Company Overview:At TuriyamAI, we are pioneering world leading GenAI semiconductor solutions from India, for India and the World. Our breakthrough solutions are set to redefine the future of AI computing, driving unparalleled efficiency, performance, and accessibility for enterprises worldwide.Job Description:We are looking for an exceptional engineers to...
AI Accelerator Kernel Engineer

1 week ago

New Delhi, India Turiyam AI Full time

Company Overview: At TuriyamAI, we are pioneering world leading GenAI semiconductor solutions from India, for India and the World. Our breakthrough solutions are set to redefine the future of AI computing, driving unparalleled efficiency, performance, and accessibility for enterprises worldwide.Job Description: We are looking for an exceptional engineers to...
AI Accelerator Kernel Engineer

3 weeks ago

New Delhi, India Turiyam AI Full time

Company Overview: At TuriyamAI, we are pioneering world leading GenAI semiconductor solutions from India, for India and the World. Our breakthrough solutions are set to redefine the future of AI computing, driving unparalleled efficiency, performance, and accessibility for enterprises worldwide.Job Description: We are looking for an exceptional engineers to...
AI Accelerator Kernel Engineer

29 minutes ago

New Delhi, India Turiyam AI Full time

Company Overview:At TuriyamAI, we are pioneering world leading GenAI semiconductor solutions from India, for India and the World. Our breakthrough solutions are set to redefine the future of AI computing, driving unparalleled efficiency, performance, and accessibility for enterprises worldwide.Job Description:We are looking for an exceptional engineers to...
Research Engineer – Generative AI

2 weeks ago

Delhi, India Abacus.AI Full time

Research Engineer – Generative AI (LLMs)Location:RemoteAbacus.AI is a leading Generative AI company building a future where AI assists and automates most work and business processes for enterprises and professionals.We are looking for a Research Engineer to help design, train, and optimize large language models and high‑performance inference systems.What...
Research Engineer – Generative AI

1 week ago

Delhi, India Abacus.AI Full time

Research Engineer – Generative AI (LLMs) Location: Remote Abacus.AI is a leading Generative AI company building a future where AI assists and automates most work and business processes for enterprises and professionals. We are looking for a Research Engineer to help design, train, and optimize large language models and high‑performance inference systems....
Research engineer – generative ai

3 days ago

Delhi, India Abacus.AI Full time

Research Engineer – Generative AI (LLMs)Location:RemoteAbacus. AI is a leading Generative AI company building a future where AI assists and automates most work and business processes for enterprises and professionals.We are looking for a Research Engineer to help design, train, and optimize large language models and high‑performance inference...
Research Engineer – Generative AI

1 week ago

New Delhi, India Abacus.AI Full time

Research Engineer – Generative AI (LLMs) Location:RemoteAbacus.AI is a leading Generative AI company building a future where AI assists and automates most work and business processes for enterprises and professionals. We are looking for a Research Engineer to help design, train, and optimize large language models and high‑performance inference...
Research Engineer – Generative AI

1 week ago

New Delhi, India Abacus.AI Full time

Research Engineer – Generative AI (LLMs) Location:RemoteAbacus.AI is a leading Generative AI company building a future where AI assists and automates most work and business processes for enterprises and professionals. We are looking for a Research Engineer to help design, train, and optimize large language models and high‑performance inference...

Americas

Europe

Asia / Oceania

Africa

AI Inference Kernel Engineer