LLM Systems Performance Engineer
6 days ago
We look forward to when AI can discover the next quantum AI accelerator, or when AI can make RL much more compute-efficient. We want to enable AI to bootstrap its own intelligence, to discover new computational paradigms. Just as AlphaEvolve discovered a 23% speedup in Gemini's critical kernels and achieved 32.5% improvements in FlashAttention, we're building the infrastructure that will enable every AI model to optimize its own compute stack. Of course, to automate algorithm and hardware discovery, we need to break the data barrier. CUDA is a low-resource language, and kernel optimization depends a lot on context and hardware that models simply are not trained on.Phinity is building the canonical training data infrastructure that will enable agentic hardware engineering and optimization, which will fuel algorithmic discovery. We are building environments for agents to learn to write kernel from a spec and optimize them on specific hardware, and eventually, to discover new hardware breakthroughs. Our customers include one of the largest frontier model labs.We're seeking top engineers for a contractor role who can optimize hardware for model training and inference workloads, who can bake their industry experience into a model. This is a hybrid Systems Engineer/AI research role where you will be looking through and debugging model reasoning traces and designing the optimal CUDA problems to teach unreleased models to automate your work in industry. Please do not apply unless you have optimized kernels before.Skill requirements:Languages: CUDA, C++, Python,Frameworks: JAX/XLA, PyTorch, TensorFlow (at the C++ level), PallasLibraries: cuBLAS, cuDNN, CUTLASS, CUB, ThrustCompiler Tools: NVCC, PTX assembly, MLIR/XLA understandingHardware Knowledge: SM architecture, tensor cores, memory hierarchies (HBM, L2, shared, registers)Apply if you have:Achieved >10x speedups on production ML workloadsWritten kernels that outperform vendor librariesOptimized attention, GEMM, or convolution at the assembly levelBuilt custom fusions that beat XLA/Triton compiler outputPublished papers or open-source kernels used in production
-
Senior LLM Engineer
4 days ago
bangalore district, India RingCentral Full timeJob Description: We are seeking an experienced AI Engineer with a strong background in Natural Language Understanding (NLU) who is passionate about pushing the boundaries of Conversational AI. In this role, you will design, develop, and deploy scalable AI solutions leveraging LLMs, Retrieval-Augmented Generation (RAG), and prompt engineering techniques to...
-
Generative AI Engineer
4 days ago
bangalore, India BigRio Full timeJob Title: Generative AI Engineer (LLM Expert – AWS Focus) Location: Remote Employment Type: Ongoing Contract About BigRio BigRio is a Boston-based, remote-first technology consulting firm specializing in advanced data, cloud, and software engineering solutions . We partner with forward-thinking organizations to deliver scalable, secure, and...
-
Generative AI Engineer
5 days ago
bangalore, India BigRio Full timeJob Title: Generative AI Engineer (LLM Expert – AWS Focus) Location: Remote Employment Type: Ongoing Contract About BigRio BigRio is a Boston-based, remote-first technology consulting firm specializing in advanced data, cloud, and software engineering solutions . We partner with forward-thinking organizations to deliver scalable, secure, and...
-
Generative AI Engineer
6 days ago
bangalore, India BigRio Full timeJob Title: Generative AI Engineer (LLM Expert – AWS Focus)Location: Remote Employment Type: Ongoing ContractAbout BigRioBigRio is a Boston-based, remote-first technology consulting firm specializing in advanced data, cloud, and software engineering solutions. We partner with forward-thinking organizations to deliver scalable, secure, and high-performance...
-
Generative AI Engineer
4 days ago
bangalore, India BigRio Full timeJob Title: Generative AI Engineer (LLM Expert – AWS Focus)Location: RemoteEmployment Type: Ongoing ContractAbout BigRioBigRio is a Boston-based, remote-first technology consulting firm specializing in advanced data, cloud, and software engineering solutions. We partner with forward-thinking organizations to deliver scalable, secure, and high-performance...
-
Senior LLM Engineer
3 days ago
bangalore, India IdeaSouq Full timeAbout Us At IdeaSouq, we are building the "AI operating system" to transform how private market investors and funds discover, evaluate, and manage their opportunities. Traditional investment workflows are drowning in data silos, manual screening, and overwhelming deal flow. We're a startup building the solution: an AI analyst that turns this deal flow chaos...
-
Agentic AI Developer – LLM Systems
1 week ago
bangalore, India AIMLEAP Full timeAgentic AI Developer – LLM Systems & AutomationExperience: 3–5 YearsLocation: Remote (WFH)Mode of Engagement: Full-timeNo of Positions: 4Educational Qualification: B.E./B.Tech/M.E./M.Tech in Computer Science, AI/ML, or relatedIndustry: IT – AI/ML & Automation ServicesNotice Period: Immediate JoinerWhat We Are Looking ForAI & LLM...
-
Agentic AI Developer – LLM Systems
1 week ago
bangalore, India AIMLEAP Full timeAgentic AI Developer – LLM Systems & Automation Experience: 3–5 Years Location: Remote (WFH) Mode of Engagement: Full-time No of Positions: 4 Educational Qualification: B.E./B.Tech/M.E./M.Tech in Computer Science, AI/ML, or related Industry: IT – AI/ML & Automation Services Notice Period: Immediate Joiner What We Are Looking For AI & LLM...
-
Senior LLM Engineer
3 days ago
bangalore, India IdeaSouq Full timeAbout UsAt IdeaSouq, we are building the "AI operating system" to transform how private market investors and funds discover, evaluate, and manage their opportunities.Traditional investment workflows are drowning in data silos, manual screening, and overwhelming deal flow. We're a startup building the solution: an AI analyst that turns this deal flow chaos...
-
.NET Developer with LLM
1 week ago
bangalore, India APPIT Software Inc Full timeFREELANCEJob Title: .NET Developer with LLM ExperienceJob Description:We are seeking a highly skilled .NET Developer with at least 8 years of experience in .NET development and a minimum of 3 years of hands-on experience with LLMs (Large Language Models). The ideal candidate will have a deep understanding of both .NET technologies and modern AI frameworks,...