LLM Application

20 hours ago


India AI Guru Full time

Job Description

Company Description

AI Guru builds innovative AI augmentation tools that significantly enhance the capabilities of elite professionals. Our mission is to democratize the use of AI superpowers, ensuring every ambitious professional can benefit from the advanced capabilities traditionally reserved for top consulting firms and Fortune 500 companies. Headquartered by industry veterans who have designed leading AI systems at esteemed organizations like Bloomberg, AWS, and Cerebras, AI Guru's tools are trusted by over 20,000 professionals globally, driving substantial career advancements and operational efficiencies.

Role Description

Design and implement the application layer that connects large language models (LLMs) to real-world data pipelines. You will build and maintain the orchestration logic that retrieves relevant context, feeds it to LLMs, and returns reliable, structured outputs for production systems.

Key Responsibilities

- Architect and maintain the end-to-end LLM orchestration pipeline (retrieval prompt construction model call post-processing).
- Create reusable prompt templates and dynamic context builders for diverse data sources.
- Develop deterministic post-processing and validation layers (schema enforcement, range/regex checks).
- Integrate LLM outputs into backend APIs and user-facing applications.
- Monitor and optimize LLM performance for latency, accuracy, and cost.
- Collaborate with backend, data, and QA teams to improve accuracy and robustness.
- Implement safeguards such as rate limiting, fallback strategies, and prompt versioning.

Required Skills & Experience

- Strong programming skills in Python or TypeScript/Node.js for production services.
- Hands-on experience with LLM frameworks (e.g., LangChain, LlamaIndex, or similar orchestration tools).
- Expertise in prompt engineering and structured output handling (e.g., JSON schemas).
- Familiarity with vector databases (Pinecone, Weaviate, pgvector, etc.) and retrieval strategies.
- Knowledge of CI/CD pipelines, containerization (Docker/Kubernetes), and cloud deployment (AWS/GCP/Azure).
- Strong testing habits for data- and prompt-driven applications.

Nice to Have

- Experience with unstructured data (documents, email, audio, etc.) or information extraction.
- Background in evaluation metrics for retrieval and generation (recall@k, F1, nDCG).
- Understanding of event-driven architectures and message queues (Kafka/SQS).

======

No head hunters please


  • Senior AI Engineer

    2 weeks ago


    India beBeeArtificial Full time US$ 1,04,000 - US$ 1,30,878

    Job DescriptionWe are seeking an experienced engineer to join our team and work on the development of Large Language Model (LLM) applications.The successful candidate will be responsible for translating raw model capability into lean, reliable, and user-ready features. This includes building MCP servers, architecting RAG pipelines, automating LLM...

  • LLM Backend Engineer

    2 weeks ago


    India Sparsa AI Full time

    Sparsa AI is a Singapore and Germany based Industrial-AI Startup, building the next generation of agentic AI platform to transform how physical industries—such as manufacturing and logistics—make decisions and optimize their operations. Our AI agents orchestrate complex workflows across business functions and enterprise applications including ERP, MES,...


  • india FlashIntel Full time

    Role Overview FlashIntel is seeking a dedicated and innovative Research Engineer with a focus on Multimodal Large Language Models (m-LLMs), Text-to-Speech (TTS) technologies, and agentic workflows. This position offers a unique opportunity to engage in cutting-edge research and development of AI solutions that integrate various data modalities and enhance...


  • India FlashIntel Full time

    Role Overview FlashIntel is seeking a dedicated and innovative Research Engineer with a focus on Multimodal Large Language Models (m-LLMs), Text-to-Speech (TTS) technologies, and agentic workflows. This position offers a unique opportunity to engage in cutting-edge research and development of AI solutions that integrate various data modalities and enhance...


  • India FlashIntel Full time

    Role Overview FlashIntel is seeking a dedicated and innovative Research Engineer with a focus on Multimodal Large Language Models (m-LLMs), Text-to-Speech (TTS) technologies, and agentic workflows. This position offers a unique opportunity to engage in cutting-edge research and development of AI solutions that integrate various data modalities and enhance...


  • India FlashIntel Full time

    Role Overview FlashIntel is seeking a dedicated and innovative Research Engineer with a focus on Multimodal Large Language Models (m-LLMs), Text-to-Speech (TTS) technologies, and agentic workflows. This position offers a unique opportunity to engage in cutting-edge research and development of AI solutions that integrate various data modalities and enhance...


  • India FlashIntel Full time

    Role OverviewFlashIntel is seeking a dedicated and innovative Research Engineer with a focus on Multimodal Large Language Models (m-LLMs), Text-to-Speech (TTS) technologies, and agentic workflows. This position offers a unique opportunity to engage in cutting-edge research and development of AI solutions that integrate various data modalities and enhance...


  • India Umbrella Micro Enterprises Full time

    Fullstack Product Engineer – AI-Powered Products (Langchain/ RAG /LLM applications, Ruby, React)We're looking for a high-ownership Fullstack Product Engineer to join our team in building and shipping cutting-edge AI-powered products at serious speed. Must-Haves:- Product-driven mindset: You make tactical decisions fast and care deeply about product...

  • Data Scientist

    3 days ago


    india C5i Full time

    T Job description We are seeking a highly skilledData Scientistwith a passion for AI, machine learning, and deep learning to join our dynamic team. The ideal candidate will have experience in generative models, LLMs, and advanced AI techniques, and will contribute to solving complex business challenges.Job Title:Data Scientist (Generative AI...


  • India Umbrella Micro Enterprises Full time

    Fullstack Product Engineer – AI-Powered Products (Langchain/ RAG /LLM applications, Ruby, React) We’re looking for a high-ownership Fullstack Product Engineer to join our team in building and shipping cutting-edge AI-powered products at serious speed. ✅ LLM / Langchain Equivalent experience : You've built real features or products using Langchain...