🌿 Back to all jobs

🥝 Remote

24-MAG | New York, United States | Posted June 18, 2026

Job Description

We are sharing a specialised part-time consulting opportunity for professors, PhD students, and advanced academic researchers experienced in domain-specific problem design, Python-based evaluation, benchmark task development, and structured reasoning assessment.

This role supports current and upcoming remote consulting opportunities focused on academic benchmark task design, Python-based evaluation workflows, domain-specific problem development, golden solution preparation, model behavior analysis, and high-quality project execution. Selected professionals will apply their academic expertise to create challenging real-world tasks, define precise expected outputs, develop executable tests, and evaluate reasoning or problem-solving performance across advanced subject areas.

Key Responsibilities

Professionals in this role may contribute to:

Academic Task Design & Development

  • ...

Apply for This Position

Submit Application