
LLM Application
20 hours ago
Job Description
Company Description
AI Guru builds innovative AI augmentation tools that significantly enhance the capabilities of elite professionals. Our mission is to democratize the use of AI superpowers, ensuring every ambitious professional can benefit from the advanced capabilities traditionally reserved for top consulting firms and Fortune 500 companies. Headquartered by industry veterans who have designed leading AI systems at esteemed organizations like Bloomberg, AWS, and Cerebras, AI Guru's tools are trusted by over 20,000 professionals globally, driving substantial career advancements and operational efficiencies.
Role Description
Design and implement the application layer that connects large language models (LLMs) to real-world data pipelines. You will build and maintain the orchestration logic that retrieves relevant context, feeds it to LLMs, and returns reliable, structured outputs for production systems.
Key Responsibilities
- Architect and maintain the end-to-end LLM orchestration pipeline (retrieval prompt construction model call post-processing).
- Create reusable prompt templates and dynamic context builders for diverse data sources.
- Develop deterministic post-processing and validation layers (schema enforcement, range/regex checks).
- Integrate LLM outputs into backend APIs and user-facing applications.
- Monitor and optimize LLM performance for latency, accuracy, and cost.
- Collaborate with backend, data, and QA teams to improve accuracy and robustness.
- Implement safeguards such as rate limiting, fallback strategies, and prompt versioning.
Required Skills & Experience
- Strong programming skills in Python or TypeScript/Node.js for production services.
- Hands-on experience with LLM frameworks (e.g., LangChain, LlamaIndex, or similar orchestration tools).
- Expertise in prompt engineering and structured output handling (e.g., JSON schemas).
- Familiarity with vector databases (Pinecone, Weaviate, pgvector, etc.) and retrieval strategies.
- Knowledge of CI/CD pipelines, containerization (Docker/Kubernetes), and cloud deployment (AWS/GCP/Azure).
- Strong testing habits for data- and prompt-driven applications.
Nice to Have
- Experience with unstructured data (documents, email, audio, etc.) or information extraction.
- Background in evaluation metrics for retrieval and generation (recall@k, F1, nDCG).
- Understanding of event-driven architectures and message queues (Kafka/SQS).
======
No head hunters please
-
Senior AI Engineer
2 weeks ago
India beBeeArtificial Full time US$ 1,04,000 - US$ 1,30,878Job DescriptionWe are seeking an experienced engineer to join our team and work on the development of Large Language Model (LLM) applications.The successful candidate will be responsible for translating raw model capability into lean, reliable, and user-ready features. This includes building MCP servers, architecting RAG pipelines, automating LLM...
-
LLM Backend Engineer
2 weeks ago
India Sparsa AI Full timeSparsa AI is a Singapore and Germany based Industrial-AI Startup, building the next generation of agentic AI platform to transform how physical industries—such as manufacturing and logistics—make decisions and optimize their operations. Our AI agents orchestrate complex workflows across business functions and enterprise applications including ERP, MES,...
-
AI LLM Research Engineer
3 days ago
india FlashIntel Full timeRole Overview FlashIntel is seeking a dedicated and innovative Research Engineer with a focus on Multimodal Large Language Models (m-LLMs), Text-to-Speech (TTS) technologies, and agentic workflows. This position offers a unique opportunity to engage in cutting-edge research and development of AI solutions that integrate various data modalities and enhance...
-
AI LLM Research Engineer
3 days ago
India FlashIntel Full timeRole Overview FlashIntel is seeking a dedicated and innovative Research Engineer with a focus on Multimodal Large Language Models (m-LLMs), Text-to-Speech (TTS) technologies, and agentic workflows. This position offers a unique opportunity to engage in cutting-edge research and development of AI solutions that integrate various data modalities and enhance...
-
AI LLM Research Engineer
4 days ago
India FlashIntel Full timeRole Overview FlashIntel is seeking a dedicated and innovative Research Engineer with a focus on Multimodal Large Language Models (m-LLMs), Text-to-Speech (TTS) technologies, and agentic workflows. This position offers a unique opportunity to engage in cutting-edge research and development of AI solutions that integrate various data modalities and enhance...
-
AI LLM Research Engineer
3 days ago
India FlashIntel Full timeRole Overview FlashIntel is seeking a dedicated and innovative Research Engineer with a focus on Multimodal Large Language Models (m-LLMs), Text-to-Speech (TTS) technologies, and agentic workflows. This position offers a unique opportunity to engage in cutting-edge research and development of AI solutions that integrate various data modalities and enhance...
-
AI LLM Research Engineer
4 days ago
India FlashIntel Full timeRole OverviewFlashIntel is seeking a dedicated and innovative Research Engineer with a focus on Multimodal Large Language Models (m-LLMs), Text-to-Speech (TTS) technologies, and agentic workflows. This position offers a unique opportunity to engage in cutting-edge research and development of AI solutions that integrate various data modalities and enhance...
-
Fullstack Product Engineer – AI, RAG, LLMs
3 weeks ago
India Umbrella Micro Enterprises Full timeFullstack Product Engineer – AI-Powered Products (Langchain/ RAG /LLM applications, Ruby, React)We're looking for a high-ownership Fullstack Product Engineer to join our team in building and shipping cutting-edge AI-powered products at serious speed. Must-Haves:- Product-driven mindset: You make tactical decisions fast and care deeply about product...
-
Data Scientist
3 days ago
india C5i Full timeT Job description We are seeking a highly skilledData Scientistwith a passion for AI, machine learning, and deep learning to join our dynamic team. The ideal candidate will have experience in generative models, LLMs, and advanced AI techniques, and will contribute to solving complex business challenges.Job Title:Data Scientist (Generative AI...
-
Fullstack Product Engineer
4 days ago
India Umbrella Micro Enterprises Full timeFullstack Product Engineer – AI-Powered Products (Langchain/ RAG /LLM applications, Ruby, React) We’re looking for a high-ownership Fullstack Product Engineer to join our team in building and shipping cutting-edge AI-powered products at serious speed. ✅ LLM / Langchain Equivalent experience : You've built real features or products using Langchain...