
AI Agent Evaluation Analyst
2 hours ago
At Mindrift , innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI.
The Mindrift platform, launched and powered by Toloka , connects domain experts with cutting-edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real-world expertise from across the globe.
Who we're looking for:
We’re looking for curious and intellectually proactive contributors, the kind of person who double-checks assumptions and plays devil’s advocate.
Are you comfortable with ambiguity and complexity? Does an async, remote, flexible opportunity sound exciting? Would you like to learn how modern AI systems are tested and evaluated?
This is a flexible, project-based opportunity well-suited for:
- Analysts, researchers, or consultants with strong critical thinking skills.
- Students (senior undergrads / grad students) looking for an intellectually interesting gig.
- People open to a part-time and non-permanent opportunity.
About the project:
We’re on the hunt for QAs for autonomous AI agents for a new project focused on validating and improving complex task structures, policy logic, and agent evaluation frameworks. Throughout the project, you’ll have to balance quality assurance, research, and logical problem-solving. This project opportunity is ideal for people who enjoy looking at systems holistically and thinking through scenarios, implications, and edge cases.
You do not need a coding background, but you must be curious, intellectually rigorous, and capable of evaluating the soundness and consistency of complex setups. If you’ve ever excelled in things like consulting, CHGK, Olympiads, case solving, or systems thinking — you might be a great fit.
What you’ll be doing:
- Reviewing evaluation tasks and scenarios for logic, completeness, and realism.
- Identifying inconsistencies, missing assumptions, or unclear decision points.
- Helping define clear expected behaviors (gold standards) for AI agents.
- Annotating cause-effect relationships, reasoning paths, and plausible alternatives.
- Thinking through complex systems and policies as a human would to ensure agents are tested properly.
- Working closely with QA, writers, or developers to suggest refinements or edge case coverage.
How to get started:
Apply to this post, qualify, and get the chance to contribute to a project aligned with your skills, on your own schedule. Shape the future of AI while building tools that benefit everyone.
Requirements
- Excellent analytical thinking: Can reason about complex systems, scenarios, and logical implications.
- Strong attention to detail: Can spot contradictions, ambiguities, and vague requirements.
- Familiarity with structured data formats: Can read, not necessarily write JSON/YAML.
- Can assess scenarios holistically: What's missing, what’s unrealistic, what might break?
- Good communication and clear writing (in English) to document your findings.
We also value applicants who have:
- Experience with policy evaluation, logic puzzles, case studies, or structured scenario design.
- Background in consulting, academia, olympiads (e.g. logic/math/informatics), or research.
- Exposure to LLMs, prompt engineering, or AI-generated content.
- Familiarity with QA or test-case thinking (edge cases, failure modes, “what could go wrong”).
- Some understanding of how scoring or evaluation works in agent testing (precision, coverage, etc.).
Benefits
- Get paid for your expertise, with rates that can go up to $38/hour depending on your skills, experience, and project needs.
- Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments.
- Participate in an advanced AI project and gain valuable experience to enhance your portfolio.
- Influence how future AI models understand and communicate in your field of expertise.
-
Prayagraj, India Bigwave Digital Full timeVibe Coder – Agentic AI Marketing Workflow Strategist (Remote – Part-Time – Flexible Location) Big Wave Digital is a specialist Digital Marketing and Technology recruitment agency, delivering premium results for over 15 years to companies that value quality. With offices in Sydney and the Bay of Plenty, we operate nationally and internationally. ...
-
Founders Office
1 week ago
Prayagraj, India Agnost AI Full timeJoin us in building the future of AI infrastructure, your chance to be our Employee #1. We're creating the analytics layer for the agent economy: think Google Analytics, but for AI agents. Founded by engineers from Cisco and Microsoft with backgrounds from IITM CSE and VIT, we're backed by Entrepreneur First and building the future that the enterprise agent...
-
Freelance AI Automation
4 days ago
Prayagraj, India Scale It Up AI Full timeEngagement: Commission + Retainer Partnership About the Opportunity I run outreach campaigns that generate meetings with businesses interested in AI automation solutions . My focus is on finding prospects, qualifying their needs, and bringing opportunities to the table. I’m looking for a freelance automation partner who can help turn those opportunities...
-
Generative AI Engineer
2 weeks ago
Prayagraj, India Codewalla Full timeAbout CodewallaCodewalla is a New York–based product studio with engineering teams in India. Since 2005, we’ve built innovative products that scale. We work at the intersection of design, engineering, and AI developing systems shaped by real business needs and tested in the real world. Our team moves fast, thinks deeply, and cares about pushing what...
-
AI Engineer
6 hours ago
Prayagraj, India ReadyTech Full timePosition description Apply now Share About us: Making a meaningful...
-
Prayagraj, India Microsoft Full timeOverview Are you passionate about application development and excited by the opportunity to shape the future of software engineering with AI? Join Microsoft as a Software Solution Engineer – Cloud & AI, where you’ll help enterprise developers solve complex challenges and build the next generation of intelligent applications. Microsoft is transforming...
-
Prayagraj, India Microsoft Full timeOverview Are you passionate about application development and excited by the opportunity to shape the future of software engineering with AI? Join Microsoft as a Software Solution Engineer – Cloud & AI, where you’ll help enterprise developers solve complex challenges and build the next generation of intelligent applications. Microsoft is...
-
Freelance AI Red Team Engineer
9 hours ago
Prayagraj, India Mindrift Full timeThis opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency. At Mindrift , innovation meets opportunity. We believe in using the power of collective intelligence to ethically shape the future of AI....
-
Business Process Analyst
9 hours ago
Prayagraj, India Suncorp Group Full timeWe’re never just satisfied with how things are – because we know how things could be. And it’s our expert Technology team who forge ahead every day to make those ‘what ifs’ a reality. Welcome to a place where you can chase real progress and drive real change. And that includes your own career. Being a part of Technology at Suncorp Group means...
-
Freelance Software Developer
9 hours ago
Prayagraj, India Mindrift Full timeThis opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. At Mindrift , innovation meets opportunity. We believe in using the power of collective intelligence to ethically shape the future of AI. What we do The Mindrift platform connects specialists with AI projects from major...