▷ [Immediate Start] Llm Reliability & Evaluation Engineer

17 hours ago

Mohali Punjab, India XenonStack Full time

ABOUT XENONSTACK XenonStack is the fastest-growing Data and AI Foundry for Agentic Systems enabling enterprises to gain real-time and intelligent business insights We deliver innovation through Agentic Systems for AI Agents Vision AI Platform Inference AI Infrastructure for Agentic Systems Our mission is to accelerate the world s transition to AI Human Intelligence by making AI agents reliable explainable and enterprise-ready THE OPPORTUNITY We are seeking an LLM Reliability Evaluation Engineer to ensure that large language models LLMs and agentic AI systems meet enterprise-grade standards of accuracy safety and trustworthiness This role focuses on evaluating benchmarking and stress-testing LLMs in real-world workflows building frameworks for reliability robustness and continuous improvement If you thrive at the intersection of AI research applied testing and responsible deployment this is the role for you KEY RESPONSIBILITIES Evaluation Frameworks Design and implement LLM evaluation pipelines covering accuracy robustness safety and bias Develop automated systems for benchmarking models on enterprise-relevant tasks Reliability Engineering Conduct stress tests adversarial testing and edge-case evaluations Build tools to measure latency consistency and error recovery in multi-turn interactions Metrics Monitoring Define KPIs such as factual accuracy hallucination rate toxicity and compliance alignment Establish real-time monitoring for drift anomalies and performance regressions Collaboration Alignment Partner with ML engineers product managers and domain experts to align evaluation with business objectives Work with Responsible AI teams to implement ethical explainable and compliant evaluation practices Continuous Improvement Feed insights from evaluation into fine-tuning RLHF RLAIF pipelines and model selection Maintain a central repository of test cases benchmarks and evaluation results Research Innovation Stay current with state-of-the-art LLM evaluation techniques from academic benchmarks to applied enterprise metrics Explore automated evaluation using agentic test harnesses and synthetic data generation SKILLS QUALIFICATIONS Must-Have 3-6 years in AI ML NLP or applied model evaluation Strong understanding of LLM architectures prompt engineering and failure modes Hands-on with evaluation frameworks Eval harnesses Ragas OpenAI Evals DeepEval Proficiency in Python and libraries like LangChain LangGraph LlamaIndex Hugging Face Experience with vector databases RAG pipelines and knowledge graph integration Familiarity with bias fairness testing and Responsible AI frameworks Good-to-Have Experience with reinforcement learning RLHF RLAIF and reward modeling Exposure to agentic evaluation frameworks multi-agent stress testing synthetic user simulators Knowledge of compliance and safety requirements for BFSI GRC or SOC use cases Contributions to open-source evaluation libraries or research papers WHY SHOULD YOU JOIN US Agentic AI Product Company Ensure reliability in cutting-edge AI platforms that are redefining enterprise adoption A Fast-Growing Category Leader Be part of one of the fastest-growing AI Foundries powering Fortune 500 enterprises with trustworthy AI Career Mobility Growth Grow into roles such as AI Systems Architect Responsible AI Engineer or Reliability Engineering Lead Global Exposure Work on enterprise-scale evaluation challenges across BFSI Healthcare Telecom and GRC Create Real Impact Your evaluations will directly shape production-grade AI agents used in mission-critical systems Culture of Excellence Our values Agency Taste Ownership Mastery Impatience and Customer Obsession empower you to innovate fearlessly Responsible AI First Join a company that prioritizes trustworthy explainable and compliant AI XENONSTACK CULTURE - JOIN US MAKE AN IMPACT At XenonStack we believe in shaping the future of intelligent systems We foster a culture of cultivation built on bold human-centric leadership principles where deep work simplicity and adoption define everything we do Our Cultural Values Agency - Be self-directed and proactive Taste - Sweat the details and build with precision Ownership - Take responsibility for outcomes Mastery - Commit to continuous learning and growth Impatience - Move fast and embrace progress Customer Obsession - Always put the customer first Our Product Philosophy Obsessed with Adoption - Making AI accessible reliable and enterprise-ready Obsessed with Simplicity - Turning complex evaluation challenges into seamless automated frameworks Be part of our mission to accelerate the world s transition to AI Human Intelligence by making AI agents not just powerful but trustworthy and reliable

Llm Programmer

2 weeks ago

Mohali, Punjab, India Antier Solutions Full time

HR54 FULL-TIME MOHALI 2-4 YEARS **Key Responsibilities**: - Research, develop, and fine-tune large language models (e.g., GPT, BERT, T5) to solve complex NLP tasks, such as text generation, sentiment analysis, question answering, and more. - Experiment with different model architectures, hyper parameters, and training techniques to optimize model...
Evaluator

6 days ago

Mohali, India CM AUTO SALES PVT LTD Full time

**Job description** Job Responsibilities: Technically evaluate the Cars/Old cars Submit a report for each Cars according to the defined process Estimate the repair work needed by the used Cars Estimate the market price of the Cars based on the inspection. **We are looking for**: **Minimum 3 -10yrs of Relevant Experience (car driven skills+DL...
Immediate Start! Data Analyst

2 weeks ago

Mohali, Punjab, India Skilllabs Full time

Roles and Responsibilities Data Collection and Organization Collect and store data on sales numbers market research logistics linguistics or other behaviors Data Analysis Analyze data using statistical techniques and machine learning exercises to identify trends patterns and correlations Reporting and Visualization Generate reports and present findings to...
Prompt Engineer

1 week ago

Mohali, Punjab, India Webdigitalblog Full time ₹ 12,00,000 - ₹ 36,00,000 per year

We are seeking a Prompt Engineer to design, test, and refine prompts that enhance the performance of Large Language Models (LLMs) such as OpenAI GPT, Anthropic Claude, and Google Gemini. The ideal candidate will combine analytical thinking with creativity to build effective, reliable, and engaging AI interactions across our products and workflows.Key...
Fresher ! Graduates ! Immediate Joiners!

2 days ago

Mohali, Punjab, India BSCJ Enterprises Full time

Hiring for Freshers ! Graduates ! Immediate Joiners Required International Voice Process Rotational Shifts # US Shifts **Qualifications**: Graduation or Undergraduate 3 Years Diploma in any field **Requirements**: Excellent Communication Skills ( Fluent English) Knowledge of BPO Freshers / Experienced both can apply References will be...
[Immediate Start] Staff Nurse

17 hours ago

Mohali, Punjab, India Max Healthcare Full time

JOB DESCRIPTION I JOB DETAILS Job Title Staff Nurse Administrative Reporting CNO Functional Reporting Respective Nursing Heads Direct Reports II JOB PURPOSE To assist in delivering high quality nursing care in the hospital III KEY RESPONSIBILITIES Core Responsibilities - Nursing Knowledge Adaptability Awareness of the departmental vision mission objectives...
Tele Caller- Immediate Joining

6 days ago

Mohali, Punjab, India hightech batteries pvt ltd Full time

**Immediate Joining**: - **Fresher/experience both are welcome**: - **Extra earning**: - **Office location: - Sector 70** No money need to be deposited Starting Salary from 10000 to 15000 **Job Types**: Full-time, Permanent Pay: ₹10,000.00 - ₹15,000.00 per month Work Location: In person
▷ (Immediate Start) Consultant

4 weeks ago

Mohali, Punjab, India PHFI Full time

The Public Health Foundation of India PHFI is working towards building a healthier India It is helping to address the limited institutional and systems capacity in India by strengthening education and training advancing research and technology and facilitating policy and practice in the area of Public Health PHFI is headquartered in New Delhi with national...
Used Car Evaluator

2 weeks ago

Ludhiana, Punjab, India Adyan Consultants Pvt Ltd Full time

A used car evaluator inspects and assesses used cars, and determines their market value. They also prepare documentation and reports, and collaborate with sales and marketing teams. Responsibilities - Examine the condition of used cars, including the engine, body, interiors, and structural damage - Determine the market value of used cars - Prepare evaluation...
Immediate Hiring Freight Broker

2 weeks ago

Mohali, India immensity logistics Full time

Immensity Logistics hiring for the below-mentioned position: FreightBroker We required a minimum of 3- 6 months of relevant experience for the above-mentioned profile. Immediate joiners are preferred. References are most welcome. **Salary**: ₹35,000.00 - ₹85,000.00 per month **Benefits**: - Health insurance Schedule: - Night shift Supplemental...

Americas

Europe

Asia / Oceania

Africa

▷ [Immediate Start] Llm Reliability & Evaluation Engineer