Apply Now Principal LLM

3 weeks ago


India Oracle Full time

Job Description We are looking for a senior engineer who specializes in LLM systems, prompt engineering, and agentic application deployment, combined with strong MLOps and cloud platform engineering experience. You will design, deploy, and scale Generative AI models, retrieval-augmented generation (RAG) pipelines, and autonomous agent frameworks on OCI. In this role, you'll work closely with data scientists, platform architects, and research teams to build production-grade AI systems, including: - LLM finetuning and adaptation - Prompt and prompt-chain optimization - Multi-agent orchestration frameworks - Automated evaluation and guardrail systems - Model + data drift monitoring and continuous retraining workflows You will be a key contributor to defining our AI platform architecture, ensuring operational scale, efficiency, security, and reliability. Your responsibilities will include: LLM & Agentic Development: - Design, evaluate, and optimize prompts, prompt chains, and agent behaviors. - Build and deploy RAG systems, vector search pipelines, and knowledge-grounding layers. - Develop agent orchestration workflows using frameworks like LangChain, LlamaIndex, Guidance, or AG2. - Integrate LLMs with external tools, APIs, and internal business systems. LLMOps & Platform Engineering: - Deploy and host open-source and proprietary LLMs on OCI (e.g., GPT, Llama, Mistral, Grok). - Implement automated evaluation frameworks to measure truthfulness, relevance, safety, latency, and cost. - Manage fine-tuning, LoRA adaptation, or embedding model selection. Data Pipeline & Quality: - Build pipelines that ensure data freshness, traceability, and semantic relevance for downstream LLM tasks. - Use data validation frameworks (e.g., Great Expectations, Evidently) to detect drift or knowledge degradation. Observability, Monitoring & Cost Optimization: - Track LLM system performance, token usage, latency, and operational anomalies. - Implement model guardrails, safety layers, and automated fallback behavior. Collaboration & Mentorship - Work directly with Data Science + Product to translate domain problems into LLM+Agent architectures. - Mentor engineers and scientists on LLM deployment, prompt strategy, and evaluation methods. - Work closely with architects, product teams, data engineers, and other stakeholders to deliver end-to-end AI solutions that address business needs. Technical Skills: - Strong Python engineering background. - Experience with LLMs, RAG pipelines, or agent frameworks (LangChain, LlamaIndex, Haystack, AG2, etc.). - Hands-on cloud infrastructure experience (OCI, AWS, GCP, or Azure). - Experience with vector databases (e.g., Chroma, Pinecone, Weaviate, Milvus, PGVector). - Experience with Kubernetes, Docker, and CI/CD automation. Nice to Have: - Experience fine-tuning or adapting LLMs (e.g., LoRA, QLoRA, RLHF, supervised finetuning). - Prompt evaluation and automated testing frameworks (e.g., RAGAS, TruLens, DeepEval). - Experience deploying microservices architectures in production environments. Qualifications: - 8+ years of experience in software engineering, machine learning engineering, or platform engineering, with at least 2+ years focused on ML/AI systems in production. - Hands-on experience developing or deploying Large Language Model (LLM) systems, including prompt engineering, RAG pipelines, agent-based workflows, or LLM fine-tuning. - Strong proficiency in Python and experience with one or more LLM/agent frameworks (e.g., LangChain, LlamaIndex, Haystack, Guidance, AG2). - Experience designing and operating cloud-native ML systems on OCI, AWS, GCP, or Azure. - Proficiency with Kubernetes, Docker, and CI/CD pipelines for deploying and scaling services. - Experience with data workflow orchestration (e.g., Airflow, Prefect, Dagster) and data validation frameworks (e.g., Great Expectations, Evidently). - Strong understanding of vector databases (e.g., Pinecone, Weaviate, Milvus, Chroma, Postgres + pgvector). - Demonstrated ability to build and maintain production monitoring, alerting, and observability dashboards (e.g., Prometheus, Grafana). - Excellent communication and collaboration skills with the ability to mentor and lead technical discussions. - Bachelor's or master's degree in computer science, engineering, or a related field, or equivalent practical experience. Career Level - IC4


  • Principal Engineer

    1 hour ago


    Hyderabad, India GrowthAXL Full time

    Job Description Principal Engineer Service Now Now Assist AI Agents Role Job Description - Principal Engineer for ServiceNow focusing on Now Assist AI Agents - Lead the development of modern Service Management and Automation solutions, primarily in Healthcare - Design, develop, and own AI agent platform components, including prompt engineering, integrations,...


  • India EPM Solutions Full time

    Principal Consultant - Solutions Architecture EPM Solutions | Remote | Immediate Start About EPM Solutions EPM specialist and Jedox Partner of the Year 2025 (Asia). We deliver Enterprise Performance Management solutions to major enterprises with a proven track record and a growing client base. The Opportunity Principal Consultant opportunity for an...


  • Bengaluru, India Oracle Full time

    Job Description Job Description Oracle Cloud Infrastructure blends the speed of a startup with the scale of an enterprise leader. Our Generative AI Solutions team builds advanced AI solutions that run on powerful cloud infrastructure tackling real-world, global challenges. As part of this team, you'll contribute to large-scale cloud solutions utilizing...


  • Bengaluru, India Microsoft Full time

    Job Description Within AI Platform, theGenAI Systemsteam inAzure AIis seeking aPrincipal Applied Scientist to lead the development ofcustomized LLM solutionsfor diverse customer needs. This role focuses onfine-tuning, distillation, synthetic data generation, and custom model development, enabling enterprises to build optimized, scalable AI systems. Why Join...


  • Bengaluru, India Microsoft Full time

    Job Description Microsoft Ads powers one of the world's largest digital advertising ecosystems, delivering billions of recommendations every day. We are seeking a Principal Applied Scientist to define and advance R&D of retrieval, matching, ranking, and generation algorithms across our ad recommendation systems. In this role, you will shape the science and...


  • Bengaluru, India Microsoft Full time

    Job Description Overview Microsoft Ads powers one of the world's largest digital advertising ecosystems, delivering billions of recommendations every day. We are seeking a Principal Applied Scientist to define and advance R&D of retrieval, matching, ranking, and generation algorithms across our ad recommendation systems. In this role, you will shape the...


  • Hyderabad, India Microsoft Full time

    Job Description Overview Security represents the most critical priorities for our customers in a world awash in digital threats, regulatory scrutiny, and estate complexity. Microsoft Security aspires to make the world a safer place for all. We want to reshape security and empower every user, customer, and developer with a security cloud that protects them...


  • India SalaryGuide Full time

    🚀 About Us We’re building the career intelligence platform for modern professionals — starting with marketing. Today, job seekers rely on outdated and biased tools to evaluate employers. We’re changing that by collecting and structuring messy public data — jobs, salaries, org structures, tech stacks, and growth patterns — into clear, data-driven...


  • Pune, India Vsynergize AI Full time

    Job Description Job Title : Senior AI Engineer Location : Pune, Maharashtra Experience : 3+ Years Employment Type : Full-Time About The Role We are seeking a highly skilled and motivated Senior AI Engineer to join our innovative technology team. This role offers the opportunity to work on next-generation Artificial Intelligence and Machine Learning...


  • India People Prime Worldwide Full time

    Important Note (Please Read Before Applying) 🚫 Do NOT apply if: • You have less than 8 years or more than 12 years of total experience. • You do not have hands-on experience in ML/AI application development. • You have no experience in leading teams or production deployments. • You are not familiar with LLMs / Generative AI concepts (RAG,...