- AI Evals Engineer

2 weeks ago


Bengaluru, Karnataka, India Pibit Full time ₹ 12,00,000 - ₹ 36,00,000 per year

Description :

About :

is transforming the underwriting landscape with Generative AI.

Our SaaS solutions help US-based insurance companies make smarter, faster decisions by optimizing underwriting processes, reducing risk, and improving premiums.

Were hiring an AI Evals Engineer to lead the systems that measure and maintain our AIs clarity, accuracy, and trustworthinesswhile directly connecting insights from real customer use.

Youll build gold-standard test sets, automate both offline and online evaluations, trace customer interactions end-to-end, and integrate quality signals into our product and release pipelinesenabling us to move quickly while preserving trust.

Position Overview :

As an AI Evals Engineer, youll build the evaluation, monitoring, and quality infrastructure that ensures our AI systems stay accurate, reliable, and customer-trusted.

Youll collaborate closely with ML engineers, product, and customer teams to design gold-standard test sets, automate eval pipelines, trace customer queries, and wire quality signals into our release process.

This role is ideal for someone who wants to grow as an applied ML/LLM engineer with a deep focus on evaluation, observability, and continuous improvement.

Key Responsibilities :

- Collaborate with ML and product engineers to design and implement evaluation and observability systems for AI models.

- Build automated o?ine and online eval pipelines for key use cases (RAG, agents, chat, extraction).

- Develop and maintain gold-standard datasets, synthetic/adversarial test cases, and regression suites in CI/CD.

- Define and track LLM quality metrics such as factuality, grounding precision/recall, latency, and cost.

- Instrument end-to-end tracing of customer queries across retrieval, inference, and post-processing to debug and improve quality.

- Partner with Customer Success and Support teams to translate feedback into structured QA signals and test updates.

- Run and analyze A/B tests and model/prompt experiments, ensuring statistical rigor and measurable improvements.

- Integrate evaluation and monitoring signals into deployment pipelines to prevent regressions and enforce release quality gates.

- Build dashboards and visibility tools that surface model performance trends by feature, prompt, and version.

- Contribute to documentation, evaluation governance, and best practices for model updates and prompt changes.

Technical Requirements :

- LLM & ML : GPT, Claude, Gemini, Mixtral, Llama, Hugging Face OSS models

- LLMOps & Evaluation : OpenAI Evals, LangSmith, LangChain, LangGraph, MLflow, LangFuse, DeepEval, LlamaIndex, SageMaker, AWS Bedrock, Azure AI

- Databases : PostgreSQL, MongoDB, Pinecone, ChromaDB

- Cloud : AWS, Azure

- DevOps & Monitoring : Kubernetes, Docker, OpenTelemetry, Datadog, Honeycomb

- Languages : Python, SQL, JavaScript

- Certifications (Bonus) : AWS Machine Learning Specialty, AWS Solutions Architect Professional, Azure Solutions Architect Expert

What You'll Do :

- Build, automate, and maintain LLM evaluation pipelines and gold datasets for our AI products.

- Establish quantitative quality metrics and acceptance criteria for production LLM systems.

- Implement observability and tracing across AI workflows to detect regressions and ensure reliability.

- Work on real-world generative AI and NLP applications, particularly in high-trust domains.

- Collaborate with data, product, and engineering teams to close the loop between customer feedback and model quality.

- Gain hands-on experience with cloud ML infrastructure and modern LLMOps tooling.

- Contribute to improving model accuracy, safety, and trustworthiness through experimentation and data-driven evaluation.

What You Need to Succeed :

- Bachelors or Masters degree in Computer Science, Machine Learning, or a related field.

- Minimum 2 years of experience in ML, data science, or evaluation/QA roles.

- Strong understanding of ML fundamentals, deep learning, and LLM-based systems.

- Proficiency in Python and SQL for data analysis, model evaluation, and automation.

- Familiarity with LLMOps tools (LangChain, MLflow, SageMaker, etc.) and basic DevOps (Docker, Kubernetes).

- Curiosity, ownership, and a problem-solving mindsetyou thrive in ambiguous, high-impact environments.

- Excellent communication and collaboration skills to work cross-functionally and drive quality improvements end-to-end.

Why Join Us :

- Work directly with experienced founders and senior engineers.

- Get hands-on mentorship in advanced ML and LLMOps.

- Be part of a high-energy team that values learning and innovation.

- Contribute to building AI-first products shaping the future of insurance tech.

- Enjoy a culture that celebrates both hard work and growth


  • AI Evals Engineer

    4 days ago


    Bengaluru, Karnataka, India Docket Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    About The RoleWe're hiring anAI Evals Engineerto own the evaluation and observability systems that keep our AI clear, accurate, and trustworthywhile closing the loop with customers. You'll design gold‑standard test sets, automate offline/online evaluation,trace customer queries end‑to‑end, and wire quality signals into our product and release process...


  • Bengaluru, Karnataka, India eeKee AI Full time ₹ 8,00,000 - ₹ 24,00,000 per year

    Company DescriptionEekee AI is an AI-driven life coach designed to help employees feel grounded at work and build a lasting sense of purpose. Rooted in Ikigai and Viktor Frankl's logotherapy, Eekee uses daily conversations and psychometric signals to identify strengths and values, suggesting resources like books, courses, and team rituals. Eekee supports HR...

  • AI Engineer

    1 week ago


    Bengaluru, Karnataka, India Plum Benefits Private Limited Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    AI Automation Engineer (Internal AI Lead)Location: Bengaluru (India) About PlumPlum is re-imagining employee healthcare & insurance benefits for fast-growing Indian businesses. We combine modern insurance products, primary, preventive care and data-driven claims to protect 5,000+ companies and 1 million+ lives today. Our next milestone—10 million lives by...


  • Bengaluru, Karnataka, India Quash Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    About the RoleWe're looking for a hands-on Applied AI Engineer who lives and breathes LLMs. This isn't a research role — you'll be solving real-world problems, shipping features, and building production-ready systems that make our agent smarter every week.What You'll DoBuild LLM-powered systems: prompt chains, retrieval pipelines, eval frameworksWork...

  • Platform Tech Lead

    2 weeks ago


    Bengaluru, Karnataka, India Sarvam AI Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    Platform Tech LeadCompany Overview: is a pioneering generative AI startup headquartered in Bengaluru, India. Our mission is to make generative AI accessible and impactful for Bharat. Founded by a team of AI experts, is dedicated to developing cost-effective, high-performance AI agents tailored for the Indian market, enabling enterprises to tap into new...


  • Bengaluru, Karnataka, India blue yonder Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    Job Description Scope: We are seeking a highly skilled AI/Prompt Engineer to design, implement, and maintain artificial intelligence (AI) and machine learning (ML) solutions for our organization. The ideal candidate will have a deep understanding of AI and ML technologies, as well as experience with data analysis, software development, and cloud...

  • AI Engineer

    3 days ago


    Bengaluru, Karnataka, India Weekday AI Full time ₹ 6,00,000 - ₹ 8,00,000

    This role is for one of the Weekday's clientsSalary range: Rs Rs ie INR 6-8 LPA)Min Experience: 0 yearsLocation: BangaloreJobType: full-timeWe are looking for a passionate and motivated AI Engineer to join our growing team. This role is ideal for individuals who are eager to build a career in artificial intelligence and machine learning. You will work...

  • AI Engineer

    3 days ago


    Bengaluru, Karnataka, India Weekday AI Full time ₹ 60,00,000 - ₹ 80,00,000 per year

    This role is for one of the Weekday's clientsSalary range: Rs Rs ie INR 6-8 LPA)Min Experience: 0 yearsLocation: BangaloreJobType: full-timeWe are looking for a passionate and motivated AI Engineer to join our growing team. This role is ideal for individuals who are eager to build a career in artificial intelligence and machine learning. You will work...

  • AI Engineer

    2 days ago


    Bengaluru, Karnataka, India Weekday AI Full time ₹ 6,00,000 - ₹ 8,00,000 per year

    This role is for one of the Weekday's clientsSalary range: Rs Rs ie INR 6-8 LPA)Min Experience: 0 yearsLocation: BangaloreJobType: full-time We are looking for a passionate and motivated AI Engineer to join our growing team. This role is ideal for individuals who are eager to build a career in artificial intelligence and machine learning. You will work...

  • AI Engineer

    2 weeks ago


    Bengaluru, Karnataka, India Amogha AI Full time ₹ 9,00,000 - ₹ 12,00,000 per year

    About Amogha AIAt Amogha AI, we are building India's first voice-led conversational AI app specifically for mental health, therapy, and emotional well-being. Our mission is to provide a supportive, empathetic listener in your pocket, 24/7. We are creating a next-generation product that understands user context, provides therapy-grade support, and guarantees...