Heartune / ROBOTheory-79k
PublicThe official repository of ROBOTheory-79k. ROBOTheory-79k is a large-scale, expert-curated dataset that contains 79,239 expert-level questions spanning 4 core domains (Mathematics Foundation, Mechanical Systems, Perception & Control, Electrical & Programming) and 24 sub-fields, available in Chinese, English, and French.
ROBOTheory-79k is an academic research project that tests whether AI models truly understand robotics engineering theory or just memorize patterns. It contains 79,000 expert-level questions spanning math, mechanics, sensors, and programming, along with scripts to evaluate any AI model on these questions. Researchers use this to measure AI capabilities, identify weaknesses, and track progress over time. The project includes a sophisticated scoring system where AI judges evaluate complex answers, producing detailed reports and comparisons across different models.
How It Works
You learn about a massive collection of 79,000 expert-level questions about robotics engineering that researchers created to test AI understanding.
The questions cover four main areas: math foundations, mechanical systems, sensors and control, and electrical programming—plus 24 specialized sub-fields.
You choose which AI assistant you want to evaluate—any model that can chat, whether from a cloud service or running on your own computer.
Connect to an AI service online and let it answer thousands of questions automatically.
Use your own GPU to run open AI models and generate answers without internet.
The chosen AI works through every robotics question, from multiple-choice to complex proofs and programming challenges.
Another AI acts as an expert teacher, carefully evaluating whether the answers are correct, partially correct, or need improvement.
You get a complete breakdown showing scores by topic, question type, and overall performance—plus how your AI compares to others on the leaderboard.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.