LLM Systems Performance Engineer
2 weeks ago
We look forward to when AI can discover the next quantum AI accelerator, or when AI can make RL much more compute-efficient. We want to enable AI to bootstrap its own intelligence, to discover new computational paradigms. Just as AlphaEvolve discovered a 23% speedup in Gemini's critical kernels and achieved 32.5% improvements in FlashAttention, we're building the infrastructure that will enable every AI model to optimize its own compute stack. Of course, to automate algorithm and hardware discovery, we need to break the data barrier. CUDA is a low-resource language, and kernel optimization depends a lot on context and hardware that models simply are not trained on.Phinity is building the canonical training data infrastructure that will enable agentic hardware engineering and optimization, which will fuel algorithmic discovery. We are building environments for agents to learn to write kernel from a spec and optimize them on specific hardware, and eventually, to discover new hardware breakthroughs. Our customers include one of the largest frontier model labs.We're seeking top engineers for a contractor role who can optimize hardware for model training and inference workloads, who can bake their industry experience into a model. This is a hybrid Systems Engineer/AI research role where you will be looking through and debugging model reasoning traces and designing the optimal CUDA problems to teach unreleased models to automate your work in industry. Please do not apply unless you have optimized kernels before.Skill requirements:Languages: CUDA, C++, Python,Frameworks: JAX/XLA, PyTorch, TensorFlow (at the C++ level), PallasLibraries: cuBLAS, cuDNN, CUTLASS, CUB, ThrustCompiler Tools: NVCC, PTX assembly, MLIR/XLA understandingHardware Knowledge: SM architecture, tensor cores, memory hierarchies (HBM, L2, shared, registers)Apply if you have:- Achieved >10x speedups on production ML workloads - Written kernels that outperform vendor libraries - Optimized attention, GEMM, or convolution at the assembly level - Built custom fusions that beat XLA/Triton compiler output - Published papers or open-source kernels used in production
-
Distinguished LLM Engineer
2 weeks ago
New Delhi, India Trident Consulting Full timeTrident Consulting is looking for a " Distinguished LLM Engineer - Chennai/ Tirunelveli/ Coimbatore" .Role: Distinguished LLM Engineer Location: Chennai/ Tirunelveli/ Coimbatore Type: Fulltime Salary:Depends on your experience and the current market rateDo you want to use your AI expertise to drive real-world impact? We’re hiring aDistinguished LLM...
-
Senior LLM Engineer
3 weeks ago
New Delhi, India RingCentral Full timeJob Description: We are seeking an experienced AI Engineer with a strong background in Natural Language Understanding (NLU) who is passionate about pushing the boundaries of Conversational AI. In this role, you will design, develop, and deploy scalable AI solutions leveraging LLMs, Retrieval-Augmented Generation (RAG), and prompt engineering techniques to...
-
Senior LLM Engineer
2 weeks ago
New Delhi, India RingCentral Full timeJob Description:We are seeking an experienced AI Engineer with a strong background in Natural Language Understanding (NLU) who is passionate about pushing the boundaries of Conversational AI. In this role, you will design, develop, and deploy scalable AI solutions leveraging LLMs, Retrieval-Augmented Generation (RAG), and prompt engineering techniques to...
-
AI Platform Engineer
1 day ago
New Delhi, India Brainwave Science Full timeJob Title: AI Platform Engineer – Cognitive Health & LLM Systems Location: Delhi | Employment Type: Full-Time | Work Type: Onsite Welcome to Basil Health — where Neuroscience meets Artificial Intelligence to redefine mental wellness. About Basil Health Basil Health is an applied AI startup transforming healthcare through intelligent...
-
AI Platform Engineer – Cognitive Health
1 day ago
new delhi, India Brainwave Science Full timeJob Title: AI Platform Engineer – Cognitive Health & LLM Systems Location: Delhi | Employment Type: Full-Time | Work Type: Onsite Overview Welcome to Basil Health — where Neuroscience meets Artificial Intelligence to redefine mental wellness. About Basil Health Basil Health is an applied AI startup transforming healthcare through...
-
AI Platform Engineer – Cognitive Health
2 days ago
new delhi, India Brainwave Science Full timeJob Title: AI Platform Engineer – Cognitive Health & LLM Systems Location: Delhi | Employment Type: Full-Time | Work Type: Onsite Overview Welcome to Basil Health — where Neuroscience meets Artificial Intelligence to redefine mental wellness. About Basil Health Basil Health is an applied AI startup transforming healthcare through intelligent...
-
AI Platform Engineer – Cognitive Health
3 days ago
New Delhi, India Brainwave Science Full timeJob Title: AI Platform Engineer – Cognitive Health & LLM Systems Location: Delhi | Employment Type: Full-Time | Work Type: Onsite Overview Welcome to Basil Health — where Neuroscience meets Artificial Intelligence to redefine mental wellness. About Basil Health Basil Health is an applied AI startup transforming healthcare through intelligent...
-
AI Platform Engineer – Cognitive Health
2 days ago
new delhi, India Brainwave Science Full timeJob Title: AI Platform Engineer – Cognitive Health & LLM Systems Location: Delhi | Employment Type: Full-Time | Work Type: Onsite Overview Welcome to Basil Health — where Neuroscience meets Artificial Intelligence to redefine mental wellness. About Basil Health Basil Health is an applied AI startup transforming healthcare through...
-
AI Platform Engineer – Cognitive Health
15 hours ago
New Delhi, India Brainwave Science Full timeJob Title: AI Platform Engineer – Cognitive Health & LLM Systems Location: Delhi | Employment Type: Full-Time | Work Type: Onsite Overview Welcome to Basil Health — where Neuroscience meets Artificial Intelligence to redefine mental wellness. About Basil Health Basil Health is an applied AI startup transforming healthcare through intelligent...
-
AI Platform Engineer – Cognitive Health
11 hours ago
New Delhi, India Brainwave Science Full timeJob Title: AI Platform Engineer – Cognitive Health & LLM Systems Location:Delhi |Employment Type:Full-Time |Work Type:OnsiteOverviewWelcome toBasil Health— whereNeuroscience meets Artificial Intelligenceto redefine mental wellness.About Basil HealthBasil Health is anapplied AI startuptransforming healthcare through intelligent technology. Our mission is...