alexziskind1 / draftbench
PublicBenchmark tool for measuring speculative decoding speedups. Sweep draft/target model combinations and generate interactive charts.
draftbench is a benchmarking tool that automates testing combinations of large target AI models and smaller draft models to identify the optimal pairing for faster text generation using speculative decoding.
How It Works
You learn about a handy tool that tests small helper AIs with your big AI to find the fastest combo for quicker chats.
Download a few large main AIs and matching smaller helpers onto your computer.
Jot down a simple plan naming your main AIs, helpers, and test settings like how much text to generate.
Start the automatic run and watch it fire up each pair, measure speeds, and save results as it goes.
Give it time to finish all tests, with updates showing speeds and how well each helper predicts correctly.
Open beautiful interactive graphs highlighting the best pairs that boost your AI speed by up to 80%.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.