Senior LLM Engineer

6 days ago


bangalore district, India RingCentral Full time

Job Description: We are seeking an experienced AI Engineer with a strong background in Natural Language Understanding (NLU) who is passionate about pushing the boundaries of Conversational AI. In this role, you will design, develop, and deploy scalable AI solutions leveraging LLMs, Retrieval-Augmented Generation (RAG), and prompt engineering techniques to power intelligent products and services. As part of our ML/AI team, you’ll own the full lifecycle of model development — from data preparation and fine-tuning to inference optimization and deployment in production environments. Responsibilities: Design, fine-tune, and deploy LLM-based applications for Conversational AI use cases Build scalable retrieval-augmented generation (RAG) pipelines that combine LLMs with structured/unstructured data sources Develop prompt engineering strategies, templates, and evaluation frameworks for LLM-driven workflows Collaborate with cross-functional teams to identify and implement AI-driven solutions to business problems Optimize models for low-latency inference using quantization, distillation, and other model optimization techniques (e.g., ONNX, TensorRT) Build robust data processing, labeling, and augmentation pipelines to improve model performance Implement monitoring and evaluation systems for deployed LLMs, ensuring reliability, fairness, and safety Stay current with emerging trends in LLMs, retrieval systems, and generative AI frameworks Requirements: 5-8 years of hands-on experience in NLU Strong proficiency in Python and PyTorch and related frameworks (like Hugging Face Transformers, Sentence Transformers etc.) Proven experience developing and deploying NLP or LLM pipelines in production environments at scale Solid understanding of transformer architectures and attention mechanisms Proficiency in using LLM provider APIs such as OpenAI, Gemini etc.including prompt design, fine-tuning, and evaluation Experience with model optimization techniques such as quantization, pruning, ONNX, TensorRT, or model distillation Bachelor’s or Master’s degree in Computer Science, Artificial Intelligence, or a related field Nice to Have: Hands-on experience with RAG and vector databases (e.g., FAISS, Qdrant, pgVector etc. ) Prior work on LLM fine-tuning, alignment, or evaluation Experience with LLM orchestration frameworks such as LlamaIndex or similar tools Familiarity with multi-provider LLM orchestration, integrating APIs from OpenAI, Gemini etc. and others for fallbacks, routing, or ensemble strategies Knowledge of MLOps for LLMs, including model serving and monitoring Understanding of embedding models, context management, and token optimization for scalable LLM applications


  • Senior LLM Engineer

    4 days ago


    bangalore district, India IdeaSouq Full time

    About Us At IdeaSouq, we are building the "AI operating system" to transform how private market investors and funds discover, evaluate, and manage their opportunities. Traditional investment workflows are drowning in data silos, manual screening, and overwhelming deal flow. We're a startup building the solution: an AI analyst that turns this deal flow chaos...

  • Senior LLM Engineer

    5 days ago


    bangalore, India IdeaSouq Full time

    About Us At IdeaSouq, we are building the "AI operating system" to transform how private market investors and funds discover, evaluate, and manage their opportunities. Traditional investment workflows are drowning in data silos, manual screening, and overwhelming deal flow. We're a startup building the solution: an AI analyst that turns this deal flow chaos...

  • Senior LLM Engineer

    4 days ago


    bangalore, India IdeaSouq Full time

    About UsAt IdeaSouq, we are building the "AI operating system" to transform how private market investors and funds discover, evaluate, and manage their opportunities.Traditional investment workflows are drowning in data silos, manual screening, and overwhelming deal flow. We're a startup building the solution: an AI analyst that turns this deal flow chaos...


  • pune district, India Rapid7 Full time

    Principal LLM Engineer Join Rapid7: Secure the Future with AI Are you ready to lead the charge in integrating cutting-edge Large Language Models (LLMs) into world-class Cyber Security products? Rapid7 is looking for a Principal LLM Engineer with a rare combination of deep Data Science expertise, mastery of production MLOps, and 13+ years of experience. You...


  • Bangalore, India ANSR Full time

    ANSR is hiring for one of its clients. About 4flow: Headquartered in Berlin, Germany, 4flow provides consulting, software and services for logistics and supply chain management. More than 1300 team members leverage their supply chain expertise and IT know-how to best serve their customers at 20+ locations around the world. 4flow develops and implements lean,...


  • bangalore, India BigRio Full time

    Job Title: Generative AI Engineer (LLM Expert – AWS Focus)Location: Remote Employment Type: Ongoing ContractAbout BigRioBigRio is a Boston-based, remote-first technology consulting firm specializing in advanced data, cloud, and software engineering solutions. We partner with forward-thinking organizations to deliver scalable, secure, and high-performance...


  • bangalore, India BigRio Full time

    Job Title: Generative AI Engineer (LLM Expert – AWS Focus)Location: RemoteEmployment Type: Ongoing ContractAbout BigRioBigRio is a Boston-based, remote-first technology consulting firm specializing in advanced data, cloud, and software engineering solutions. We partner with forward-thinking organizations to deliver scalable, secure, and high-performance...

  • Data Integration

    6 days ago


    chennai district, India Chargebee Full time

    About the Role We are seeking a highly motivated Software Engineer with a strong foundation in Java (Spring Boot) , data integration , and a growing expertise in Large Language Models (LLMs) . This role is ideal for engineers who enjoy working at the intersection of scalable data systems and AI-driven applications , building robust pipelines while also...


  • bangalore, India BigRio Full time

    Job Title: Generative AI Engineer (LLM Expert – AWS Focus) Location: Remote Employment Type: Ongoing Contract About BigRio BigRio is a Boston-based, remote-first technology consulting firm specializing in advanced data, cloud, and software engineering solutions . We partner with forward-thinking organizations to deliver scalable, secure, and...


  • bangalore, India BigRio Full time

    Job Title: Generative AI Engineer (LLM Expert – AWS Focus) Location: Remote Employment Type: Ongoing Contract About BigRio BigRio is a Boston-based, remote-first technology consulting firm specializing in advanced data, cloud, and software engineering solutions . We partner with forward-thinking organizations to deliver scalable, secure, and...