BrainBench is a dataset of brainteaser questions and evaluation tools that test large language models on commonsense reasoning failures humans rarely make.
How It Works
You stumble upon BrainBench, a clever set of brainteasers designed to reveal where smart AI assistants trip up on everyday reasoning that humans handle easily.
You download the collection of 100 tricky questions in English or Chinese, complete with answers and category explanations.
You choose which popular AI thinkers like Claude or GPT to test by linking them up simply.
With one go, you run the full test, letting each AI tackle the puzzles several times while a fair judge checks their answers.
You watch as the tests complete, building up results safely without overwhelming your setup.
Beautiful charts and tables appear, ranking AIs by accuracy and reliability, highlighting tough categories like hidden constraints.
You now have clear insights into AI reasoning strengths and gaps, perfect for sharing in reports or discussions.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.