KV-cache compression for Whisper-family speech models. Drop-in patch, three eviction policies.
WhisperKV is a lightweight tool that reduces the memory needed by speech recognition models during long audio transcription, allowing users to process extended recordings on smaller computers without retraining the model.
How It Works
You need to convert hours of speech into text, but your computer runs out of memory partway through.
A tiny helper that makes your speech recognition use much less memory without losing accuracy.
A simple one-line installation adds the memory-saving feature to your existing setup.
You tell WhisperKV how much memory to save by choosing a simple setting like 'keep the last 64 words plus the 32 most important ones'.
Your audio plays through the model, which now intelligently discards less useful memory while keeping what matters for accuracy.
Your full audio is transcribed successfully using a fraction of the memory, even on a smaller computer.
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.