stevibe / BenchLocal
PublicBenchLocal is a desktop app for running, comparing, and managing LLM Bench Packs.
BenchLocal is a desktop application for running, comparing, and managing standardized benchmarks to evaluate large language models.
How It Works
You hear about a friendly desktop app that makes it easy to test and compare how smart different AI chatbots are at real tasks.
Download the app to your computer and launch it for the first time β it sets up your personal testing space.
Connect the AI chat services you use, so the app can talk to them during tests.
Browse and add ready-made test collections, like challenges for math, tools, or following instructions.
Choose AIs to compare, tweak settings if you like, and start β watch live progress as they tackle each challenge.
Review detailed results, rankings, and logs to understand strengths and weaknesses of each AI.
Celebrate finding the top-performing AI for your needs, with history saved for future comparisons.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.