InteractiveBench provides interactive benchmarks to evaluate AI models on math reasoning, situation puzzles, trust games, and poker through simulated competitions.
How It Works
You find this fun collection of AI challenges on a code sharing site, perfect for testing how smart different AIs are at puzzles, math, games, and trust dilemmas.
Link up your favorite AI models from a service so they can join the challenges and show their skills.
Choose what to test them on, like tricky math problems, brain-teaser situations, poker showdowns, or trust games.
Hit start and watch the AIs compete head-to-head, asking questions, solving puzzles, or bluffing in games.
Check live updates, scores, graphs, and stats as each AI battles it out round by round.
Review who won, their strategies, and insights into which AI thinks deepest in interactive scenarios.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.