vvt004 / speech-eval-arena
PublicA small CLI harness for evaluating speech LLMs and ASR models on standard benchmarks (LibriSpeech, FLEURS, VoxPopuli).
Speech Eval Arena is a command-line tool that lets researchers test how well AI speech recognition models transcribe audio by running them against standard benchmark datasets and measuring accuracy scores.
How It Works
You want to know which speech recognition AI does the best job on different types of speech, like reading books or news broadcasts.
You install the tool with a simple command, and everything is ready to go in seconds.
You pick a model like Whisper or Canary, and choose what kind of speech to test it on—English audio, Mandarin, or noisy recordings.
The tool plays through all the audio clips, asking the AI to write down what it hears, and saves each guess.
The tool compares the AI's guesses against the correct answers and calculates a score showing how accurate it was.
Run a second AI on the same test and see which one is more accurate with a clear comparison table.
Generate a report showing all your test results in a neat table you can share with others.
You now know exactly how well each speech AI performs, helping you pick the right one for your project.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.