Ai inference kernel engineer

6 days ago


Rajahmundry, India Phinity Full time

We look forward to when AI can discover the next quantum AI accelerator, or when AI can make RL much more compute-efficient. We want to enable AI to bootstrap its own intelligence, to discover new computational paradigms. Just as Alpha Evolve discovered a 23% speedup in Gemini's critical kernels and achieved 32.5% improvements in Flash Attention, we're building the infrastructure that will enable every AI model to optimize its own compute stack. Of course, to automate algorithm and hardware discovery, we need to break the data barrier. CUDA is a low-resource language, and kernel optimization depends a lot on context and hardware that models simply are not trained on. Phinity is building the canonical training data infrastructure that will enable agentic hardware engineering and optimization, which will fuel algorithmic discovery. We are building environments for agents to learn to write kernel from a spec and optimize them on specific hardware, and eventually, to discover new hardware breakthroughs. Our customers include one of the largest frontier model labs. We're seeking top engineers for a contractor role who can optimize hardware for model training and inference workloads, who can bake their industry experience into a model. This is a hybrid Systems Engineer/AI research role where you will be looking through and debugging model reasoning traces and designing the optimal CUDA problems to teach unreleased models to automate your work in industry. Please do not apply unless you have optimized kernels before. Skill requirements: Languages: CUDA, C++, Python, Frameworks: JAX/XLA, Py Torch, Tensor Flow (at the C++ level), Pallas Libraries: cu BLAS, cu DNN, CUTLASS, CUB, Thrust Compiler Tools: NVCC, PTX assembly, MLIR/XLA understanding Hardware Knowledge: SM architecture, tensor cores, memory hierarchies (HBM, L2, shared, registers) Apply if you have: Achieved >10x speedups on production ML workloads Written kernels that outperform vendor libraries Optimized attention, GEMM, or convolution at the assembly level Built custom fusions that beat XLA/Triton compiler output Published papers or open-source kernels used in production



  • Rajahmundry, India S2T AI - AI-Powered Investigations Full time

    We're seeking a forward-thinkingWeb Scraping Engineerwho leverages AI tools to accelerate development and streamline data extraction processes. Join our India team and work at the intersection of traditional scraping expertise and cutting-edge AI-powered development.The Role: Design and implement scalable data extraction solutions using AI to rapidly...


  • Rajahmundry, India Socratix AI Full time

    About Socratix AI Socratix AI builds AI coworkers for fraud and risk teams—agents that investigate alerts in real time and replace manual workflows with fast, explainable decisions. Our agents integrate seamlessly into existing workflows and reason across signals like human analysts, helping teams cut fraud losses, reduce false positives, and scale...

  • Agentic ai engineer

    6 days ago


    Rajahmundry, India Intellectt Inc Full time

    Agentic AI Engineer (100% Remote) Company: Intellectt Experience: 8+ Years Job Description: Intellectt is seeking a highly experienced Agentic AI Engineer to lead the design and deployment of intelligent, autonomous AI systems. The ideal candidate will have hands-on expertise in LLMs, Lang Chain, Lang Graph, RAG , and multi-agent architectures , with the...

  • AI Engineer

    2 days ago


    Rajahmundry, India Uplevyl Full time

    Job Title: AI Engineer Location: Onsite – Noida Type: Full-time About Us At Uplevyl, we're redefining what intelligent communities look like. Through our AI-powered agents, we’re building scalable, agentic community systems that reduce manual work and increase member engagement—especially for women-centric organizations. We’re looking for...

  • AI/ML Engineer

    3 weeks ago


    Rajahmundry, India People Prime Worldwide Full time

    About Client: A leading global information technology, consulting, and business process services organization, the company delivers innovative solutions that enable clients across industries to thrive in the digital era. With a strong focus on technology-driven transformation, it helps enterprises harness the power of cloud, AI, automation, and analytics to...

  • AI Engineer Intern

    2 days ago


    Rajahmundry, India Alchemic (previously Echo) Full time

    Role: AI Engineer Intern (Full-time Internship, Remote)This is a remote full-time paid internship for an AI Engineer. You will help us push the boundaries of what LLMs can do by designing, testing, and optimizing prompts, building multi-step prompt pipelines, writing scaffolding code around LLM calls, benchmarking outputs, and integrating AI features into...

  • AI Engineer

    2 weeks ago


    Rajahmundry, India Clustrex Data Private Limited Full time

    Hiring: AI Engineer – AI + Robotics Location: Madipakkam, Chennai Experience: 0-3 YearsWe’re building next-gen AI + Robotics systems and are looking for a passionate AI Engineer with strong math, deep learning, RL, and computer vision skills.What you’ll work on:Deep learning for perceptionReinforcement learning for robotic controlGenerative AI +...

  • Software Engineer

    1 week ago


    Rajahmundry, India Generative Futures Full time

    🚀 Software Engineer (Golang & Python) 🌍Are you ready for a unique opportunity to join an exciting AI startup based in the US? We’re on the hunt for a Software Engineer with strong experience in Golang and Python to help our client build cutting-edge AI infrastructure.This is the perfect chance to dive into the world of AI, work on innovative...

  • Software Engineer

    1 week ago


    Rajahmundry, India Generative Futures Full time

    🚀 Software Engineer (Golang & Python) 🌍Are you ready for a unique opportunity to join an exciting AI startup based in the US? We’re on the hunt for a Software Engineer with strong experience in Golang and Python to help our client build cutting-edge AI infrastructure.This is the perfect chance to dive into the world of AI, work on innovative...

  • Technical Co Founder

    2 weeks ago


    Rajahmundry, India GoodSpace AI Full time

    🚀 Technical Co-Founder – BOSS (Business Operating System Stack) Location: Noida or Remote Compensation: Equity Only (No Salary) Experience: 5+ YearsBOSS is a wholly owned subsidiary of Goodspace.ai, building a cognitive operating system for businesses. I’m looking for a hands-on Technical Co-Founder who can help architect and build the platform from...