Ai inference kernel engineer

4 weeks ago


Vellore, India Phinity Full time

We look forward to when AI can discover the next quantum AI accelerator, or when AI can make RL much more compute-efficient. We want to enable AI to bootstrap its own intelligence, to discover new computational paradigms. Just as Alpha Evolve discovered a 23% speedup in Gemini's critical kernels and achieved 32.5% improvements in Flash Attention, we're building the infrastructure that will enable every AI model to optimize its own compute stack. Of course, to automate algorithm and hardware discovery, we need to break the data barrier. CUDA is a low-resource language, and kernel optimization depends a lot on context and hardware that models simply are not trained on. Phinity is building the canonical training data infrastructure that will enable agentic hardware engineering and optimization, which will fuel algorithmic discovery. We are building environments for agents to learn to write kernel from a spec and optimize them on specific hardware, and eventually, to discover new hardware breakthroughs. Our customers include one of the largest frontier model labs. We're seeking top engineers for a contractor role who can optimize hardware for model training and inference workloads, who can bake their industry experience into a model. This is a hybrid Systems Engineer/AI research role where you will be looking through and debugging model reasoning traces and designing the optimal CUDA problems to teach unreleased models to automate your work in industry. Please do not apply unless you have optimized kernels before. Skill requirements: Languages: CUDA, C++, Python, Frameworks: JAX/XLA, Py Torch, Tensor Flow (at the C++ level), Pallas Libraries: cu BLAS, cu DNN, CUTLASS, CUB, Thrust Compiler Tools: NVCC, PTX assembly, MLIR/XLA understanding Hardware Knowledge: SM architecture, tensor cores, memory hierarchies (HBM, L2, shared, registers) Apply if you have: Achieved >10x speedups on production ML workloads Written kernels that outperform vendor libraries Optimized attention, GEMM, or convolution at the assembly level Built custom fusions that beat XLA/Triton compiler output Published papers or open-source kernels used in production


  • AI Engineer

    3 weeks ago


    Vellore, India SwarmLens Full time

    AI Engineer/Senior AI EngineerWe are redefining the future of autonomous intelligence and we’re looking for an exceptional Agentic AI Engineer to help us build it.Role OverviewAs an Agentic AI Engineer on our core team, you will architect, build, and ship sophisticated autonomous multi-agent systems that push beyond traditional LLM workflows. Your work...

  • AI/ML Engineer

    3 weeks ago


    Vellore, India Edstem Technologies Full time

    We are seeking an AI/ML Engineer with over 5 years of expertise in conventional machine learning and experience or interest in generative AI to develop Sports Companion GPT - Aiko for sports applications, specifically targeting Cricket and Football. The ideal candidate will have extensive experience building and optimizing ML models and working with large...


  • Vellore, India Innova ESI Full time

    Role - AI/ML Engineer Must Hv - RAG, Airflow, LangChain Exp- 7-12 Yrs Location - Pune/ BNG/Gurugram JD: Must-Have Skills - Expert in CV, NLP, speech models, and multimodal transformer models. - Strong understanding of GPU optimization, distributed training (DeepSpeed, Horovod). - Experience with real-time/near-real-time video processing. - Hands-on with: -...

  • AI Tech Architect

    3 weeks ago


    Vellore, India Recro Full time

    AI Tech Architect (7–10 yrs) — Agentic & Gen AI Platforms Location: Bengaluru / Gurugram Team: AI Platforms & Architecture Employment: Full-time Key Skills:Python, FastAPI, AWS (EKS, Bedrock, OpenSearch, S3, RDS), GenAI & RAG Architecture, Agent Frameworks (Semantic Kernel, LangGraph, AutoGen), Vector Databases, Observability (OpenTelemetry, Datadog),...


  • vellore, India beBeeArtificialIntelligence Full time

    Job Title: AI/ML Specialist for Sports Companion GPTJob OverviewWe are seeking a skilled AI/ML specialist to develop Sports Companion GPT, a cutting-edge platform for sports applications. The ideal candidate will have expertise in designing and developing AI models for Sports Companion GPT, leveraging conventional machine learning techniques and generative...

  • Ai engineer

    2 weeks ago


    Vellore, India Sutra.AI Full time

    Role: AI Engineer About Sutra. AI Sutra. AI is a rapidly growing AI Enterprise Saa S Platform company focused on building data-to-decision automation at scale . Our mission is to help enterprises transform raw data into intelligent, actionable insights through AI, automation, and decision intelligence. Role Summary We’re seeking an AI Engineer who is...

  • Ai Engineer

    3 weeks ago


    Vellore, India Whatjobs IN C2 Full time

    Location: Role is remote, however you must be based within a City location in India Our client who provides world’s leading Agent and Agentic solution platform designed for teams looking to streamline operational efficiency and scale productivity without needing to add headcount is currently hiring for an AI Engineer. What You'll Do AI/ML Engineering Build...


  • Vellore, India Debales AI Full time

    About Debales AI  Debales AI builds autonomous AI Agents that seamlessly integrate into existing systems — no new dashboards, no added workflow overhead. With 100+ integrations and 80+ specialized AI Agents, we streamline high-volume operations across Logistics, E-Commerce, and Education, helping teams scale efficiency without scaling headcount. Role...

  • Senior Ai Engineer

    3 weeks ago


    Vellore, India Whatjobs IN C2 Full time

    Company Description Salesmantu is a revenue operations advisory and a technology company. We are looking to support a client to fill a role for them in AI space. The Role We’re looking for a Senior AI Engineer to design and build Nova’s intelligence core - the models, pipelines, and adaptive agents that drive candidate-role matching, behavioural scoring,...


  • Vellore, India Whatjobs IN C2 Full time

    About Albertsons Companies Inc. : As a leading food and drug retailer in the United States, Albertsons Companies, Inc. operates over 2,200 stores across 35 states and the District of Columbia. Our well-known banners across the United States, including Albertsons, Safeway, Vons, Jewel-Osco and others, serve more than 36 million U.S customers each week. We...