Messy repo filled with messy tests about hardware and LLMs. Built for me, public for you.
A public archive of messy benchmark results and analysis comparing cloud and local AI models on practical tasks like code auditing, financial memos, and business writing.
How It Works
You stumble upon this collection of real-world tests comparing different AI helpers on everyday jobs like reviewing code changes or writing business notes.
Skim the main guide and quick charts to see which AI works best for simple questions like 'which one for coding?'.
Check the head-to-head scores and see clear advice on picking the right AI for your tasks, like safer fact-checking or faster summaries.
Explore folders with full test results, like AI reviewing dozens of code updates or building investment reports.
See perfect audits from big online AIs.
Review tests on smaller AIs you run yourself.
Use the summary tables to decide the best AI for your needs, like reliable research or quick edits.
You now know exactly which AI fits your work, backed by real tests and easy guides.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.