Mathematical Reasoning Evaluator

3 days ago


Remote, India Weekday Full time US$ 12,400 - US$ 20,800 per year

This role is for one of our clients

We are seeking highly qualified mathematicians to join a pioneering AI research initiative in partnership with a leading artificial intelligence lab. As a Mathematical Reasoning Evaluator, you'll leverage your deep understanding of mathematics to assess, refine, and enhance the reasoning accuracy of AI systems tackling complex quantitative and theoretical problems.

This opportunity is ideal for PhD scholars, postdoctoral researchers, or advanced Master's graduates who are passionate about applying their analytical expertise to advance the capabilities of next-generation AI models.

Focus Areas

We're looking for experts across a range of mathematical and applied disciplines, including:

Pure Mathematics: Real Analysis, Complex Analysis, Functional Analysis, Algebra, Geometry, Number Theory, Combinatorics

Applied Mathematics: Differential Equations (ODEs, PDEs), Dynamical Systems, Mathematical Physics, Optimization, Numerical Methods

Statistical & Computational Fields: Probability Theory, Stochastic Processes, Graph Theory, Data-driven Modeling

Key Responsibilities

Assess the accuracy, rigor, and logical flow of AI-generated mathematical reasoning and problem solutions.

Evaluate structured tasks and datasets based on predefined rubrics to ensure conceptual clarity and precision.

Verify correctness of intermediate steps, proofs, and analytical arguments.

Identify reasoning errors, gaps, or ambiguities and provide constructive, evidence-based feedback.

Collaborate asynchronously with a global community of experts using dedicated evaluation tools.

Contribute to defining benchmarks that measure AI model performance in mathematical reasoning and abstraction.

Qualifications

PhD (or currently pursuing) or Master's degree in Mathematics, Applied Mathematics, or a closely related STEM discipline.

Strong command of graduate-level mathematics and logical reasoning.

Excellent analytical, problem-solving, and written communication skills.

Ability to evaluate complex mathematical content with precision and clarity.

Self-motivated and comfortable working in a remote, asynchronous environment.

Engagement Details

Type: Part-time (approx. 20 hours/week)

Location: Fully remote

Schedule: Flexible and asynchronous

Compensation

Hourly Rate: USD $20–$30/hour (based on expertise and experience)

Contract Type: Independent contractor

Payments: Processed weekly via Stripe Connect

Key Skills

Mathematical Analysis \u007C Logical Evaluation \u007C Proof Verification \u007C Data Annotation \u007C Research & Documentation \u007C AI Model Assessment

We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.



  • Remote, India Weekday Full time US$ 40,000 - US$ 60,000 per year

    This role is for one of our clientsWe are seeking accomplished physicists and engineers to contribute to a cutting-edge AI research program in collaboration with a leading artificial intelligence lab. As a Scientific Content Evaluator – Physics, you will play a vital role in ensuring the scientific precision, conceptual accuracy, and analytical depth of...


  • Remote, India SkillsBury Full time ₹ 10,40,000 - ₹ 16,20,000 per year

    We are looking for a passionate and dedicated Mathematics Teacher to join our team. The ideal candidate should have strong subject knowledge, effective communication skills, and the ability to make mathematics engaging and understandable for students.Responsibilities:Teach mathematics to students at the assigned grade levels.Develop lesson plans,...


  • Remote, India Insight7 Technologies Full time

    **This is a freelance position** **Hourly Rate**: Up to $50.00 per hour, based on the level of expertise required for specific projects **Payment Schedule**: Weekly payouts with no need for invoicing—automatically receive timely payments! We work with top professionals across various fields to generate training data and evaluate AI model performance,...


  • Remote, India Excellent Opportunity Full time

    Design and develop structured reasoning tasks rooted in programming challenges to train LLMs. Create datasets that test and improve an LLM’s ability to solve complex, multi-step problems with clear and logical explanations. Collaborate with researchers and engineers to align task objectives with model training goals. Refine and iterate on task designs...


  • Remote, India ToppersNotes Full time ₹ 2,00,000 - ₹ 6,00,000 per year

    About Us:Toppersnotes is dedicated to providing high-quality, affordable, and effective learning resources to competitive exam aspirants across India. To strengthen our content quality in regional languages, we are looking for subject-knowledgeable individuals who can review, evaluate, and improve our study materials in their respective languages.Position...


  • Remote, India Weekday Full time US$ 40,000 - US$ 60,000 per year

    This role is for one of our clientsWe are inviting passionate scientists with advanced expertise in chemistry and related disciplines to contribute to a pioneering AI research program in partnership with a global artificial intelligence lab. As a Scientific Content Evaluator – Chemistry, you will leverage your academic and analytical background to assess,...


  • Remote, India K12 Online School Full time

    Teaching: Plan and present lessons, and use a variety of teaching methods, including interdisciplinary approaches and 21st century skills. - Assessment: Set and mark assignments and exams, and evaluate student performance. - Preparation: Prepare lesson plans, worksheets, and question papers. - Student support: Provide feedback to students, and help them...


  • Remote, India ezyxam learning company Full time

    **Job Description: Curriculum Video and Mathematics Content Reviewer** As a Curriculum Video and Mathematics Content Reviewer, your primary responsibility will be assessing and providing feedback on educational videos that align with specific curriculum standards. You will meticulously review each video, focusing on the script content and presentation to...

  • Sr. JavaScript

    2 weeks ago


    Remote, India Codash Solutions Full time ₹ 2,40,000 - ₹ 3,20,000 per year

    Role Overview:The Code Reasoning / Code Benchmarks role involves designing algorithmically rich coding problems and evaluation systems that test reasoning, correctness, and performance. You will develop robust code, clear technical specifications, and comprehensive test suites across areas like data structures, graph algorithms, and number theory. The role...


  • Remote, India HelpStudyAbroad.com Full time

    A **Guest Quant Faculty for GRE** is a specialized role in educational institutions, coaching centers, or online platforms that offer GRE (Graduate Record Examinations) preparation. This position focuses on teaching and mentoring students in the quantitative reasoning section of the GRE, which covers topics like arithmetic, algebra, geometry, and data...