gameworld-project / gameworld
PublicGameWorld: Towards Standardized and Verifiable Evaluation of Multimodal Game Agents
GameWorld is a benchmark that tests AI agents on playing 34 browser-based games by analyzing screenshots and using keyboard or mouse controls.
How It Works
You hear about this fun benchmark where smart AIs try to play classic browser games like 2048, Flappy Bird, and Snake.
Create a cozy spot on your computer and link it to a few clever AI helpers so they can see and control the games.
Choose a simple game like 2048, pick an AI teammate, and see it slide tiles, merge numbers, and chase high scores right before your eyes.
Run tests on dozens of games at once, pitting different AIs against puzzles, runners, and platformers to find the champions.
Peek at a dashboard showing real-time scores, funny mistakes, and video replays of every dramatic moment.
Celebrate with charts and videos revealing which AI dominated the leaderboard and conquered the most games.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.