LLM Engineer

20 hours ago


Pune Maharashtra India, Maharashtra Bajaj Broking Full time

Why this role matters

We’re building production-grade GenAI systems at the core of trading, risk, and enterprise decision-making. This is not a demo lab or a prompt-only role—you’ll design and ship real LLM platforms used by traders, compliance teams, and leadership every day.

If you enjoy solving hard problems at the intersection of LLMs, distributed systems, security, and real-world constraints, this role gives you scale, ownership, and impact.


What you’ll build

As a Senior LLM Engineer, you’ll own the end-to-end lifecycle of Large Language Model–powered systems—from architecture to production operations.


You’ll work on:

  • LLM-native platforms for trading intelligence, risk signals, compliance automation, fraud detection, and enterprise knowledge systems
  • RAG and agentic workflows that integrate market data, internal systems, and secure tools
  • High-performance, cost-aware inference stacks running in hybrid and multi-cloud environments
  • Evaluation, safety, and governance frameworks that make LLMs reliable in regulated financial environments


What you’ll do

Architecture & Engineering

  • Design and ship end-to-end LLM systems: prompt orchestration, RAG pipelines, agents/tool-use, and workflow automation
  • Build low-latency, scalable APIs and microservices for inference and orchestration
  • Implement LLMOps/MLOps pipelines: model versioning, prompt CI/CD, experiment tracking, and automated A/B testing

Data & Retrieval Systems

  • Engineer secure ingestion pipelines for market data, trades, compliance records, and voice/text transcripts
  • Design vector search systems with smart chunking, hybrid retrieval (BM25 + embeddings), re-ranking, and auditability

Evaluation, Safety & Model Risk

  • Define how “good” looks: hallucination detection, task-level metrics, adversarial testing, and red-teaming
  • Build human-in-the-loop validation, bias/fairness checks, and explainability where required
  • Help formalize model risk management for LLMs in production

Security & Compliance (Done Right)

  • Work with Security and Risk teams to embed Zero Trust, least-privilege access, PII controls, and audit trails
  • Implement guardrails: content filters, policy enforcement, safe tool invocation, and full traceability

Observability & Cost Engineering

  • Instrument everything: latency, throughput, token usage, prompt drift, errors, and SLOs
  • Actively optimize cost using model selection, quantization, caching, batching, and autoscaling

Technical Leadership

  • Partner with Trading, Risk, Compliance, Infra, and Platform teams
  • Mentor engineers on LLM patterns, prompt engineering, and production best practices
  • Contribute to internal platforms, reusable components, and engineering standards


What we’re looking for

Experience

  • 5–10+ years in ML / platform / backend engineering
  • 3+ years building and operating production LLM systems


Strong hands-on skills in:

  • Languages: Python (primary), plus TypeScript / Go / Java
  • LLM frameworks: LangChain, LlamaIndex, Semantic Kernel, DSPy (or similar)
  • RAG systems: FAISS, Milvus, Pinecone; hybrid search; cross-encoder re-ranking
  • Model ecosystems: OpenAI / Azure OpenAI, Anthropic, Vertex AI; open-source (Llama, Mistral, Phi)
  • Inference optimization: vLLM, Triton, quantization (GPTQ/AWQ), batching, caching
  • LLMOps/MLOps: MLflow or W&B, model registries, CI/CD, feature stores
  • Cloud & infra: Docker, Kubernetes, Terraform, event-driven systems
  • Observability: Prometheus, Grafana, OpenTelemetry (metrics, logs, traces)
  • Security fundamentals: OAuth/OIDC, RBAC, secrets management, encryption, data governance


Why you’ll enjoy working here

  • Real production impact—no toy demos
  • Hard engineering problems with clear ownership and scale
  • Freedom to choose the right models and architectures, not just the trendy ones
  • A chance to define how GenAI is done responsibly in financial services


  • LLM Engineer

    3 weeks ago


    Mumbai, Maharashtra, India, Maharashtra Dimensionless Technologies Full time

    Company DescriptionDimensionless Technologies, established in 2016 in Mumbai, India, is a global IT consultancy and AI services provider. Committed to innovation, we leverage the power of artificial intelligence to transform businesses across industries, enhancing operational efficiency and driving growth. Our expert team of data scientists, machine learning...

  • LLM Ops Engineer

    2 days ago


    Pune, Maharashtra, India Pattern Full time

    Monitor, evaluate, and optimize AI/LLM workflows in production environments. Ensure reliable, efficient, and high-quality AI system performance by building out an LLM Ops platform that is self-serve for the engineering and data science departments. Key Responsibilities:-Collaborate with data scientists and software engineers to integrate an LLM Ops platform...

  • LLM Ops Engineer

    2 weeks ago


    Pune, Maharashtra, India Pattern® Full time ₹ 7,50,000 - ₹ 22,50,000 per year

    Monitor, evaluate, and optimize AI/LLM workflows in production environments. Ensure reliable, efficient, and high-quality AI system performance by building out an LLM Ops platform that is self-serve for the engineering and data science departments.Key Responsibilities:-Collaborate with data scientists and software engineers to integrate an LLM Ops platform...

  • Senior LLM Engineer

    4 days ago


    Pune, Maharashtra, India Infocusp Innovations Full time

    About the roleThe Senior LLM Engineer will lead the design, development, and deployment of complex agentic AI solutions within production environments, working with multi-agent frameworks and orchestrating large language models. The position is for scalable applied AI, with a focus on delivering robust and fair agentic systems that deliver organizational...

  • AI Engineer

    3 weeks ago


    Mumbai, Maharashtra, India, Maharashtra Kayana | Ordering & Payment Solutions Full time

    Job Title: AI Engineer (LLMs, Agentic Systems & Model Training)Location: MumbaiEmployment Type: Full-TimeExperience Level: Mid–SeniorAbout the RoleWe are seeking a highly skilled AI Engineer with deep expertise in Large Language Models (LLMs), AI Agents, and advanced retrieval and fine-tuning techniques. The ideal candidate has hands-on experience training...

  • GTM Engineer

    3 weeks ago


    Pune, Maharashtra, India, Maharashtra nRev Full time

    Why this role mattersNRev is in true zero-to-one territory: we’re building an AI-powered Revenue Orchestration platform that lets GTM teams spin up custom agents to automate, enrich, and accelerate every step of the enterprise sales cycle. We have early revenue, rabidly enthusiastic design-partners, and a awesome product.About the role:We are looking for a...

  • AI Tech Lead

    20 hours ago


    Pune, Maharashtra, India, Maharashtra Bright Matrix Global Full time

    Role OverviewWe are seeking an AI Tech Lead to architect and implement real-time AI systems, including LLM pipelines, voice automation, and knowledge-enhanced applications. The candidate should have strong hands-on AI/LLM experience and the ability to define best practices, scalable architectures, and system KPIs.Key ResponsibilitiesLead design of...


  • Pune, Maharashtra, India, Maharashtra LTIMindtree Full time

    Job Title: AI Engineer Experience: 5-8 YearsLocation: PuneNotice Period: Immediate to 15 DaysJob Role:Minimum 4 years IT experience Out of which minimum of two years' experience in Data Science and Azure Should have worked in at least one GenAI customer project or PoC with a recent LLM model Flexible to work in the shiftsPrimary skillsHandson in Prompt...


  • Pune, Maharashtra, India, Maharashtra AcquireX Full time

    Location: Viman Nagar, PuneAbout AcquireXBe part of the AcquireX team that unleashes the power of leading-edge technologies to help improve e-commerce processes in the e-commerce world.PurposeOwn our Generative AI technical vision. You will rapidly prototype and lead a dedicated team of two engineers to launch our company's first intelligent search and...


  • Pune, Maharashtra, India, Maharashtra Response Informatics Full time

    JD :Responsible for building agentic workflows using modern LLM orchestration framework to automate and optimize xomplex business process in the Travel domain.Individual contributor (IC), owning end to end development of intelligent agents and services that power customer experiences, recommendations and backend automation.Design and implement agentic and...