andraiming / speech-tokenizer-arena
PublicA side-by-side benchmarking playground for discrete speech tokenizers (EnCodec, HuBERT-units, SpeechTokenizer, etc.).
Speech Tokenizer Arena is a benchmarking tool that compares different speech compression technologies by running the same audio through multiple methods and measuring quality, efficiency, and speech recognition accuracy to help researchers choose the best option for their needs.
How It Works
You've collected audio files and need to find the best way to compress them while keeping quality high.
You download Speech Tokenizer Arena, a tool that tests different compression methods side-by-side on your audio.
You choose from options like EnCodec, DAC, SpeechTokenizer, or HuBERT units based on what you want to compare.
The tool plays your audio through each compression method, measures quality, and tracks how much data each one uses.
See all methods compared in one easy chart with scores for quality, speed, and data usage
View bar graphs and spectrograms showing exactly how each method performed on your audio
With clear numbers in hand, you can confidently choose the compression method that fits your quality needs and data budget.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.