GenAI Developer – LLMs, RAG

7 days ago


Cochin, Kerala, India Milestone Technologies, Inc. Full time ₹ 12,00,000 - ₹ 36,00,000 per year

GenAI Developer – LLMs, RAG & Knowledge Graphs (Neo4j)

Experience: 5–8 years total (2+ years in Generative AI)
Location: Kochi
Job Type: Full-time

Role Overview

We're looking for a hands-on GenAI Developer to design, build, and productionize LLM-powered applications—retrieval-augmented generation (RAG), conversational agents, document understanding, and decision support. You'll combine AWS Bedrock/SageMaker with open-source stacks and integrate a Neo4j knowledge graph to deliver reliable, explainable AI in a product environment.

What You'll Do

  • Ship end-to-end GenAI features: data ingestion → chunking/embedding → RAG pipelines → prompt orchestration → evals → deployment → monitoring.
  • Build robust RAG systems: hybrid retrieval (vector + keyword), query re-writing, reranking, citations, and grounding on enterprise data.
  • Model orchestration on AWS: Bedrock model selection (Claude, Llama, etc.), prompt flows, guardrails, and cost/perf tuning; SageMaker for custom training/fine-tuning (LoRA/QLoRA/PEFT).
  • Knowledge Graph (Neo4j): design schema/ontologies, build entity/relation extractors, create KG-augmented RAG, and run graph reasoning with Cypher & GDS.
  • Eval & quality: define automatic evals (faithfulness, answer relevance, toxicity, latency), A/B tests, and red-teaming; maintain prompt and dataset versioning.
  • Ops & reliability: implement LLMOps (observability, tracing, guardrails, rate/cost controls), CI/CD, IaC, and runtime monitoring.
  • APIs & apps: expose services via REST/GraphQL; build lightweight UIs or chat backends, integrate auth, and log interactions for continuous improvement.
  • Security & compliance: PII handling, prompt injection defense, output filtering, and enterprise IAM hygiene.

Core Skills & Tech Stack

Languages & Frameworks

  • Python (FastAPI), (nice-to-have), SQL
  • GenAI frameworks: LangChain / LlamaIndex / Haystack
  • Prompt tooling & observability: Langfuse, Weights & Biases (or equivalents)

LLMs & Modeling

  • Retrieval & embeddings: Bedrock embeddings, sentence-transformers, Cohere/HF embeddings
  • Fine-tuning/distillation: LoRA/QLoRA/PEFT, bitsandbytes, Hugging Face ecosystem
  • Rerankers & rewriters: cross-encoders, T5/FLAN or similar
  • Safety/guardrails: content filters, groundedness checks, regex/policy engines

AWS (Focus)

  • Bedrock (model access, guardrails), SageMaker (training, endpoints), ECR/ECS/EKS, Lambda, API Gateway, S3, CloudWatch, Step Functions
  • OpenSearch / Kendra for retrieval; RDS/ Aurora / DynamoDB for metadata/state
  • IAM, Secrets Manager, VPC for security & networking

Knowledge Graphs

  • Neo4j (schema/ontology design, Cypher, APOC), Graph Data Science (GDS)
  • Entity/relation extraction pipelines; KG-RAG patterns; graph-based reasoning and path explanations

Vector/Search

  • OpenSearch vector, pgvector, Milvus, or Pinecone (any 1–2 strongly)

MLOps / LLMOps

  • MLflow or SageMaker Model Registry; Docker; CI/CD (GitHub Actions/GitLab)
  • Tracing/telemetry, dataset & prompt versioning, cost/latency dashboards

Qualifications

  • Bachelor's/Master's in CS/EE/Math or equivalent practical experience
  • 5–8 years building data/ML systems; 2+ years specifically with LLMs/GenAI
  • Proven production deployments on AWS and experience with Neo4j in real use cases

  • Python Developer

    4 days ago


    Cochin, Kerala, India 17e6bb59-0c56-4589-8f9c-2a0b6ef7c227 Full time ₹ 2,00,000 - ₹ 12,00,000 per year

    Hiring Python Developer (GenAI & Agentic Frameworks). Strong in Python, OOP, clean code, LangChain, RAG pipelines, embeddings, vector DB, LLM orchestration, LangGraph/CrewAI/AutoGen, CI/CD.


  • Cochin, Kerala, India Focaloid Technologies Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Key Responsibilities:Design and develop GenAI applications leveraging LLMs (OpenAI, Anthropic, LLaMA, Mistral, etc.) and frameworks.Build and optimize RAG pipelines using vector databases (Pinecone, Weaviate, FAISS, Milvus, Chroma).Implement prompt engineering, fine-tuning, and model evaluation to improve accuracy and business relevance.Develop and deploy AI...


  • Cochin, Kerala, India Voxtron Solutions Full time ₹ 38,40,000 - ₹ 57,60,000 per year

    Job Title: Python DeveloperLocation: REMOTEAbout the Role:We are seeking an experienced Python Developer with deep expertise in AI workflows, AI Agents, and Voice AI Agents. The ideal candidate will design, develop, and deploy intelligent conversational systems capable of handling complex, multi-step business processes across multiple channels. The role...


  • Cochin, Kerala, India -a41e-4acb-b08e-809d83e8df2e Full time ₹ 6,00,000 - ₹ 12,00,000 per year

    Full Stack with Python,LangChain, AutoGen,Llama Index,LLM APIs,prompt engineering,microservices and RESTfulAPIs,,,or Angular, Azure,containerization,SQL/NoSQL,data pipelines,vector db,embedding models,AI agent frameworks,RAG

  • AI Developer

    7 days ago


    Cochin, Kerala, India Pearlsoft Technologies LLP Full time ₹ 96,00,000 per year

    We are looking for an experienced Python AI Developer with 4–7 years of hands-on experience in real-world AI/ML solutions. You will be responsible for building, deploying, and maintaining scalable AI systems in production, with a focus on Machine Learning, Deep Learning, LLMs, Agents, and NLP. This role demands strong software engineering fundamentals,...


  • Cochin, Kerala, India SayOne Technologies Full time ₹ 12,00,000 - ₹ 24,00,000 per year

    About Us:SayOne is a fast growing information technology and digital services company headquartered in India. We help our clients to become future ready by harnessing the power of new and emerging technologies. SayOne has delivered over 400+ projects to 75+ Start-ups and SMEs from North America, EU, Australia and the Middle East. Please feel free to know...


  • Cochin, Kerala, India NviSust Innovations Pvt. Ltd. Full time ₹ 1,20,000 - ₹ 2,40,000 per year

    WE ARE HIRING..Position: AI/ML & Automation Intern (Generative AI Focus)We're seeking passionate AI enthusiasts to join our team and work on cutting edge Generative AI and automation projects. This internship offers hands-on experience in developing LLMs, building automation workflows, and applying advanced ML concepts to real-world business challenges....


  • Cochin, Kerala, India SS Consulting Kochi Full time ₹ 10,00,000 - ₹ 25,00,000 per year

    We're Hiring: Mid-Level AI/LLM EngineerIf building the future of AI excites you, this role will feel like home.We're looking for someone who lives and breathesLLMs, LangChain, CrewAI, RAG, vector DBs—and wants to turn cutting-edge ideas into real-world impact.What makes this role exciting?You'll be working on: Building next-gen LLM applications Designing...


  • Cochin, Kerala, India Linnk Group Full time ₹ 5,00,000 - ₹ 12,00,000 per year

    Job Title: Python Developer – AI Platform (FastAPI & Vector DBs)Location: KochiType: Full-timeWe're looking for aSenior Python Developerto join our engineering team and help power real-time AI features within our SaaS platform. You'll play a key role in building scalable backend services usingFastAPI, integrating withvector databaseslike Pinecone or FAISS,...

  • Lead AI Engineer

    2 days ago


    Cochin, Kerala, India Experion Technologies Full time ₹ 15,00,000 - ₹ 20,00,000 per year

    Job Location: KochiRemoteTrivandrumExperience5+ YearsJob PurposeWork as Lead AI Engineer for US-based customer, focusing on AI agent development, LLM fine-tuning, and deploying scalable AI solutions on Google Cloud and Kubernetes infrastructure.Job Description / Duties and ResponsibilitiesDesign and deploy AI agents capable of task execution and...