lifeiteng / OmniVAD-Kit
PublicCross-platform VAD & Audio Event Detection toolkit — Python (PyPI) + TypeScript (npm) + C API. DFSMN models ~2MB, 200x real-time. Runs everywhere: native, browser (WASM), Node.js.
OmniVAD-Kit is a cross-platform toolkit for voice activity detection and audio event detection, providing lightweight models that run in Python, JavaScript, and native applications.
How It Works
You hear about a simple tool that finds speech, singing, or music moments in your audio recordings.
Download it easily for your computer or web project with a quick install command.
Run it on audio files right from your command line.
Add it to your website or mobile app for live audio.
Upload or play your recording, and it listens carefully.
In seconds, it highlights exact start and end times of speech or music.
You now have clean clips ready for editing, transcribing, or sharing.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.