yashkc2025 / turboquant
PublicPython implementation of TurboQuant (arXiv 2504.19874). Data-oblivious, near-optimal 1–4 bit vector quantization for streaming KV-caches and databases.
TurboQuant provides smart methods to compress groups of related numbers far more efficiently than basic approaches, complete with tests showing big improvements in size and accuracy.
How It Works
You stumble upon TurboQuant, a clever trick for shrinking bundles of measurements into tiny sizes while keeping them mostly accurate.
You add this shrinking tool to your number-handling playground on your computer.
You run a simple comparison to see how it stacks up against a basic shrinking method.
You see charts proving it squeezes data 16 times smaller with way less fuzziness than the ordinary way!
You play with examples like finding closest matches in data or pretending to save memory for big lists.
You swap it in wherever you need to pack measurements tightly for speed and savings.
Now your measurements zip around super small and sharp, making everything quicker and easier.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.