Full Stack LLM Engineer

1 week ago


Bengaluru Karnataka India, Karnataka Cerebras Systems Full time

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include global corporations across multiple industries, national labs, and top-tier healthcare systems. In January, we announced a multi-year, multi-million-dollar partnership with Mayo Clinic, underscoring our commitment to transforming AI applications across various fields. In August, we launched Cerebras Inference, the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. About The RoleThis teams' principal responsibility is to rapidly bring up state-of-the-art open-source models, frameworks and data engineering. Success in this role requires a system-minded generalist who thrives in fast-paced bringup environments and is comfortable working across the entire software stack. Your work will play a critical role in achieving unprecedented levels of performance, efficiency, and scalability for AI applications.ResponsibilitiesContribute to the end-to-end bring up of frameworks for RL, inference serving, ML models on Cerebras CSX systems.Work across the stack: model architecture translation, graph lowering, compiler optimizations, runtime integration, and performance tuning.Debug performance and correctness issues spanning model code, compiler IRs, runtime behavior, and hardware utilization.Propose and prototype improvements across tools, APIs, or automation flows to accelerate future bring ups.Skills & QualificationsBachelor’s, Master’s, or PhD in Computer Science, Engineering, or a related field with 8 to 12 years’ experience.Comfort navigating the full AI toolchain: Python modelling code, compiler IRs, performance profiling, etc.Strong debugging skills across performance, numerical accuracy, and runtime integration.Experience with deep learning frameworks (e.g., PyTorch, TensorFlow) and familiarity with model internals (e.g., attention, MoE, diffusion).Proficiency in C/C++ programming and experience with low-level optimization.Strong background in optimization techniques, particularly those involving NP-hard problems.What We OfferCompetitive salary and benefits package.Opportunities for professional growth and career advancement.A dynamic and innovative work environment.The chance to work on cutting-edge technologies and make a significant impact on the future of AI. Why Join CerebrasPeople who are serious about software make their own hardware. At Cerebras we have built a breakthrough architecture that is unlocking new opportunities for the AI industry. With dozens of model releases and rapid growth, we’ve reached an inflection point in our business. Members of our team tell us there are five main reasons they joined Cerebras:Build a breakthrough AI platform beyond the constraints of the GPU.Publish and open source their cutting-edge AI research.Work on one of the fastest AI supercomputers in the world.Enjoy job stability with startup vitality.Our simple, non-corporate work culture that respects individual beliefs.Read our blog: Five Reasons to Join Cerebras in 2025.Apply today and become part of the forefront of groundbreaking advancements in AICerebras Systems is committed to creating an equal and diverse environment and is proud to be an equal opportunity employer. We celebrate different backgrounds, perspectives, and skills. We believe inclusive teams build better products and companies. We try every day to build a work environment that empowers people to do their best work through continuous learning, growth and support of those around them.This website or its third-party tools process personal data. For more details, click here to review our CCPA disclosure notice.


  • Full Stack Engineer

    2 weeks ago


    Bengaluru, Karnataka, India, Karnataka Recro Full time

    Who You AreYou're a seasoned Full Stack Engineer with 5+ years of experience who thrives on buildingproducts from zero to one. You're a technical leader who can own an entire product vertical,make architectural decisions, and mentor junior engineers. You understand that in anearly-stage startup, you need to balance technical excellence with pragmatic choices...

  • Full Stack Engineer

    2 weeks ago


    Bengaluru, Karnataka, India, Karnataka Recro Full time

    You're a Full Stack Engineer with 2+ years of experience who enjoys building products thatcreate real-world impact. You're comfortable working across the entire technology stack andeager to take on new challenges that accelerate your growth. You understand that in anearly-stage startup, you need to be versatile, move quickly, and contribute to products that...

  • Full Stack Engineer

    1 week ago


    Bengaluru, Karnataka, India, Karnataka Recro Full time

    About the jobYou’re a senior full stack engineer (2+ years) who thrives in fast-paced, early-stage environments. You’re eager to build products from scratch, own an entire product line, and lead a small but growing engineering team. You balance technical rigor with the pragmatism required to ship quickly, and you’re comfortable making architectural...


  • Bengaluru, Karnataka, India, Karnataka Falcon Reality Full time

    We are seeking a skilled Full-Stack AI Engineer to join our dynamic team. The ideal candidate will have hands-on experience with LangChain, large language models (LLMs), Fine Tuning, and API integration. You will play a key role in developing AI agents, implementing Retrieval-Augmented Generation (RAG), and managing data scraping and structuring...

  • Full Stack Engineer

    1 week ago


    Bengaluru, Karnataka, India, Karnataka TechConnexions - Startup Hiring Specialists Full time

    Job Title: Full-Stack Engineer Company: [Funded Startup in Observability space]Location: Bangalore, IndiaType: Full-Time | On-site (Hybrid) Experience:2-4 yrsBudget: 14-15LPA NP: Max 30 daysAbout Us:We’re a leading technology firm delivering innovative software solutions in the Observability space, revolutionizing how businesses gain insights into their...


  • Bengaluru, Karnataka, India, Karnataka Droisys Full time

    About Company,Droisys is an innovation technology company focused on helping companies accelerate their digital initiatives from strategy and planning through execution. We leverage deep technical expertise, Agile methodologies, and data-driven intelligence to modernize systems of engagement and simplify human/tech interaction.Amazing things happen when we...

  • Full Stack Engineer

    1 week ago


    Bengaluru, Karnataka, India TechConnexions - Startup Hiring Specialists Full time ₹ 15,00,000 - ₹ 18,00,000 per year

    Job Title: Full-Stack EngineerCompany: [Funded Startup in Observability space]Location: Bangalore, IndiaType: Full-Time | On-site (Hybrid)Experience:2-4 yrsBudget: 14-15LPANP: Max 30 daysAbout Us:We're a leading technology firm deliveringinnovative software solutions in theObservability space,revolutionizing how businesses gain insights into their systems...


  • Bengaluru, Karnataka, India, Karnataka Luxoft Full time

    Project descriptionWe need a Python Developer to work for a leading investment bank client.ResponsibilitiesDesign, develop, and maintain full-stack Python applications with modern frontend frameworksBuild and optimize RAG (Retrieval-Augmented Generation) systems for AI applicationsCreate and implement efficient vector databases and knowledge storesDevelop...

  • LLM Engineer

    3 days ago


    Bengaluru, Karnataka, India Net Connect Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    Notice Period: Immediate to 15 DaysRole Overview:We are seeking a skilled and innovative LLM Engineer to design, implement, and optimize advanced GenAI solutions across drug discovery, clinical development, and manufacturing. The ideal candidate will have hands-on experience with LLMs, deep learning, MLOps, and agentic frameworks, with a strong focus on...

  • Full Stack

    1 day ago


    Bengaluru, Karnataka, India PostQode Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    Location: HyderabadExperience: 6-7 years (Full Stack, AI/ML, and Agentic Application Expertise)Why PostQode?Step into the future of software engineering at PostQode, where intelligent agents redefine the entire testing lifecycle—from APIs to end-to-end application workflows. As pioneers in Agentic AI for continuous testing, PostQode empowers Engineering...