OpenMOSS / MOSS-Audio-Tokenizer
PublicMOSS-Audio-Tokenizer is a Causal Transformer-based audio tokenizer built on the CAT architecture. Trained on 3M hours of diverse audio, it supports streaming and variable bitrates, delivering SOTA reconstruction and strong performance in generation and understanding—serving as a unified interface for next-generation native audio language models.
This repository is the official code for MOSS-Audio-Tokenizer, a high-fidelity neural audio codec that compresses raw audio waveforms into discrete tokens and reconstructs them losslessly.
How It Works
You stumble upon MOSS Audio Tokenizer, a clever tool that packs full sound clips into tiny codes while keeping every detail intact.
Grab the ready-to-use files and prepare a simple spot on your computer to play with sounds.
Pick any audio file from your collection, like a voice recording or favorite tune.
Hit go and see your sound magically shrink into a handful of compact codes that capture everything.
Feed those codes back in and watch a brand new audio file come to life.
Play the rebuilt sound and smile – it matches the original perfectly, ready for your next audio adventure!
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.