LYiHub / Advanced-LLM-Tests
Public全维度的前沿大语言模型自动化评测套件。涵盖逻辑推理、智能体编程、网页特效代码生成以及百万Token级长文本解析(GPT-5.4 / Claude 4.7 / DeepSeek-V4 等)
This project provides automated scripts to evaluate and compare popular large language models on capabilities including logic reasoning, knowledge recall, creative writing, long-context understanding, web animation coding, and agent-based programming tasks.
How It Works
You find this handy collection of tests online that lets everyday folks see how different smart AI helpers stack up against each other.
You download the simple folder of test files to your desktop or documents, ready to explore at your own pace.
You add a quick note with details from your AI accounts, like a special passcode, so the tests can chat with them.
You choose what to test, like solving puzzles, writing stories, or creating cool web animations, feeling excited to see the results.
You start a test and see all the AIs tackle the same task one by one, generating answers, stories, or even playable web pages right before your eyes.
Open the new folders that appear, filled with easy-to-read summaries of each AI's performance on your chosen challenge.
Now you know exactly which AI shines brightest for writing, logic, or creative coding, making it simple to choose the best one for your needs.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.