FireRedTeam / FireRedVAD
PublicA SOTA Industrial-Grade Voice Activity Detection & Audio Event Detection, supporting 100+ languages, outperforming Silero-VAD, TEN-VAD, FunASR-VAD and WebRTC-VAD
FireRedVAD is a high-performance open-source toolkit for detecting voice activity and audio events such as speech, singing, and music in audio files across over 100 languages.
How It Works
You hear about this handy tool that spots talking, singing, or music moments in any audio recording, working great in over 100 languages.
You create a fresh spot on your computer to play with this audio magic, keeping everything neat and simple.
You download the ready-to-use models that power the super-accurate spotting of voices and sounds – it's quick and easy!
You pick an audio file like a podcast or song and tweak it to the perfect format so it works smoothly.
You tell the tool to analyze your file, choosing to find speech, live streaming voice, or events like singing and music.
Instantly, you get a clear list of exact start and end times for talking or other sounds, making editing a breeze!
Now you can easily cut out silences, focus on voices, or mix tracks perfectly, saving tons of time.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.