Dicklesworthstone / franken_whisper
PublicAgent-first Rust ASR orchestration stack: Bayesian backend routing across whisper.cpp/insanely-fast-whisper/whisper-diarization, real-time NDJSON streaming, SQLite persistence, TTY audio transport, conformance harness. 107K lines, 2000+ tests, zero unsafe code.
A command-line tool that orchestrates multiple speech-to-text engines into a unified interface for transcribing audio files or live microphone input with streaming output, persistence, and adaptive selection.
How It Works
You hear about a simple tool that turns spoken words in audio files or recordings into readable text, perfect for meetings or podcasts.
Download and prepare the tool on your computer so it's all set to use.
Pick an audio file from your device, pipe in sound, or record live from your microphone.
Hit go and watch as it smartly picks the best way to convert speech to text quickly and accurately.
Get a clean written version of what was said right away.
See timestamps, speakers, and history for deeper insights.
Everything is automatically stored so you can review past transcriptions anytime.
Enjoy accurate, organized text from your audio, ready to read, share, or use in your projects.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.