hkust-nlp / LOCA-bench
PublicBenchmarking Language Agents Under Controllable and Extreme Context Growth
LOCA-bench is a testing playground that measures how well AI agents manage growing amounts of information across games, math, coding, and real-world tasks.
How It Works
You hear about a helpful tool that tests how well AI assistants handle really long chats and big piles of info without forgetting details.
Download the tool and set it up on your computer with a simple script that installs everything you need.
Connect your favorite AI service like a smart helper so it can join the tests.
Choose from easy short tests or super long ones up to giant novels worth of info to see how your AI copes.
Hit start and see your AI tackle puzzles, games, math problems, and real tasks while the info grows huge.
Check colorful charts and step-by-step replays showing exactly where your AI shines or struggles with length.
Now you know your AI's superpower for handling endless details, ready to build better smart helpers!
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.