Vector compression with TurboQuant codecs for embeddings, retrieval, and KV-cache. 10x compression, pure NumPy core — optional GPU acceleration via PyTorch (CUDA/MPS) or MLX (Metal).
Semafold is a compression toolkit for shrinking AI embeddings, vectors, and key-value caches with precise size tracking and quality guarantees.
How It Works
You hear about a handy tool that shrinks your AI data files to save space without losing important details.
You add the tool to your project in moments, no hassle needed.
You bring in your list of numbers or AI memory blocks that take up too much room.
You choose a shrink level and press go—watch your data get 5-10 times smaller right away!
You see a clear report on space saved and quality kept, so you know it's working great.
You swap it back into your AI setup, everything runs smoother and cheaper.
Your AI project now uses way less storage, loads faster, and costs less to run.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.