anadim / llm-benchmark-matrix
PublicCited 83-model x 49-benchmark LLM evaluation matrix with 18 matrix completion methods
A collection of AI model benchmark scores with a predictor tool to estimate missing results using advanced blending techniques.
How It Works
You hear about a free tool that gathers real test scores for dozens of AI models on challenges like math, coding, and reasoning.
Browse the list of popular AIs like GPT or Claude, and see tests covering knowledge, coding, math, and more.
Pick any model and test, like 'What would the latest GPT score on a coding challenge?', and instantly get a smart prediction.
See predictions for existing top models.
Input known results and unlock estimates for everything else.
Get a complete set of predicted scores, accurate to within a few points on average.
Now you can rank any model confidently, even with incomplete data, saving hours of research.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.