AI Evals

2 weeks ago

New Delhi, India BharatGen Full time

Job Summary: We are looking for an AI Evaluation & Test Engineer to join our growing team to ensure that our generative AI models and applications are safe, accurate, trustworthy, and deliver an elegant user experience. You will serve as the first customer of our AI systems. This role is ideal for product-minded engineers who obsess over product quality and customer-centricity, and are passionate about shaping the behavior of AI systems in the real world.Key Responsibilities: Build and maintain AI evaluation pipelines to test, measure, and evaluate the behavior and performance of AI systems. Implement traces, spans, and session tracking for observability and identify error propagation in multi-step pipelines. Define AI quality metrics and KPIs around factuality, faithfulness, toxicity, grounding precision/recall, latency, cost, etc., with clear acceptance bars. Implement evaluation and testing automation to enable end-to-end system and regression testing at scale. Define criteria for and implement release gates in the CI/CD pipeline. Find creative ways to break products. Assist in root cause analysis and troubleshooting of bugs and field issues. Collaborate with cross-functional teammates from product, engineering, linguistics, and customer support to shape human-AI interaction paradigms and ensure that our AI models and applications deliver the desired outcome and user experience.Minimum Qualifications and Experience: Bachelor’s or Master’s degree in CS/CE/IT/EE/E&TC or related fields with 5+ years of experience in manual and automation testing of software products, with at least 2 years in evaluating and testing AI/ML products.Required Expertise: Strong software testing fundamentals and expertise in writing test plans, executing test cases, and generating detailed reports and dashboards. Strong analytical and debugging skills, and attention to detail. Proficiency in Python, scripting, and software testing automation frameworks and tools such as Pytest, Selenium, Robot Framework, etc. Working knowledge of generative AI models, AI agents, and related concepts such as retrieval augmented generation (RAG), prompt engineering, context engineering, explainability, traceability, observability, guard rails, reasoning, specificity, etc. Sound understanding of the fundamental differences in the approach for testing conventional software versus evaluating generative AI systems. Team player with excellent interpersonal skills and the ability to collaborate effectively with remote and cross-functional team members. Go-getter attitude and ability to flourish in a fast-paced, startup environment. Experience in any of the following would be a big plus - - AI evaluation frameworks such as Arize, Braintrust, DeepEval, LangSmith, Ragas - AI safety and red teaming experience, e.g., prompt injection, jailbreak, adversarial and stress testing. - Different types of AI evaluation methods, e.g, Human-in-the-loop, LLM-as-a-Judge.

Senior AI Developer

4 weeks ago

New Delhi, India SoluLab Full time

Founding AI Engineer (Senior)About the Role We’re a stealth AI startup in Ahmedabad. As Founding AI Engineer, you will design and ship LLM-powered agents and RAG pipelines with rigorous evals, safety, and reliability, then shape the platform architecture across BE/FE and help hire the AI team.Read this first: This is a startup role. In intense weeks,...
Generative AI Senior Engineer

3 weeks ago

New Delhi, India ACL Digital Full time

Hi All....we are hiring for Senior GEN AI Engineer role for Bangalore Location.Work Experience : 6 Years - 10 Years- Design, develop, and productonize GenAI applications focused towards solving API integration challenges encompassing all phases of development. - Ideation and data driven experimentation & evaluation - Design, develop, and implement...
Generative AI Senior Engineer

3 weeks ago

New Delhi, India ACL Digital Full time

Hi All.... we are hiring for Senior GEN AI Engineer role for Bangalore Location.Work Experience : 6 Years - 10 Years Design, develop, and productonize GenAI applications focused towards solving API integration challenges encompassing all phases of development. Ideation and data driven experimentation & evaluation Design, develop, and implement microservices...
Engineering Manager

3 weeks ago

New Delhi, India Taggd Full time

Why this role We’re building enterprise‑gradeAgentic AI platform & applications for recruitment —from sourcing and screening to interview assistance and offer orchestration. You’ll lead a small, high‑leverage team that ships fast, measures rigorously, and scales responsibly.What you’ll do Own delivery end‑to‑end:backlog, execution, quality,...
Lead Applied AI Engineer

2 weeks ago

New Delhi, India Taggd Full time

Lead Applied AI Engineer Location:Gurgaon Function:Engineering (Applied AI) Reports to:CTO Team:2-3 AI engineers Experience:8–10 years (majority in Applied AI/LLMs; solid traditional ML) Why this role We’re building agentic AI for recruitment workflows—sourcing, screening, interview assistance, and offer orchestration. You’ll own LLM/agent design,...
Lead Applied AI Engineer

1 week ago

New Delhi, India Taggd Full time

Lead Applied AI Engineer Location:Gurgaon Function:Engineering (Applied AI) Reports to:CTO Team:2-3 AI engineers Experience:8–10 years (majority in Applied AI/LLMs; solid traditional ML) Why this role We’re building agentic AI for recruitment workflows—sourcing, screening, interview assistance, and offer orchestration. You’ll own LLM/agent design,...
Lead Applied AI Engineer

3 weeks ago

New Delhi, India Taggd Full time

Lead Applied AI EngineerLocation: GurgaonFunction: Engineering (Applied AI)Reports to: CTOTeam: 2-3 AI engineersExperience: 8–10 years (majority in Applied AI/LLMs; solid traditional ML)Why this roleWe’re building agentic AI for recruitment workflows—sourcing, screening, interview assistance, and offer orchestration. You’ll own LLM/agent design,...
Lead Applied AI Engineer

3 weeks ago

New Delhi, India Taggd Full time

Why this role We’re building agentic AI for recruitment workflows—sourcing, screening, interview assistance, and offer orchestration. You’ll own LLM/agent design, retrieval, evaluation, safety, and targetedtraditional MLmodels where they outperform or complement LLMs.What you’ll do Hands-on AI (70–80%):design & buildagent workflows(tool use,...
Lead QA – Automated Testing

1 week ago

New Delhi, India RapidBrains Full time

Lead QA – Automated Testing & AI Validation Experience: 6+ Years Employment Type: Contract Notice Period: ImmediateRole OverviewWe are seeking a Lead QA – Automated Testing & AI Validation with strong expertise in Python automation, LLM evaluation, RAG pipelines, observability, adversarial testing, and Azure monitoring. This role will lead quality...
Agentic AI Engineer

3 weeks ago

New Delhi, India Intellectt Inc Full time

Agentic AI EngineerLocation: Hyderabad – OnsiteNotice Period: Immediate Joiners PreferredRole OverviewDesign, build, and operationalize agentic AI systems for enterprise, healthcare-focused applications using LangChain, LangGraph, and multi-agent architectures. Build autonomous agents for decision-support, RAG workflows, and intelligent orchestration in...

Americas

Europe

Asia / Oceania

Africa

AI Evals