Dynamis-Labs / spectralquant
Public3% Is All You Need: Breaking TurboQuant's Compression Limit via Spectral Structure
SpectralQuant is an open-source research project providing code to compress AI model memory for faster inference while maintaining quality, demonstrated across multiple models and benchmarks.
How It Works
You find this project on GitHub and get excited about a smarter way to make AI chatbots run faster by compressing their memory.
Download the project to your computer and follow the easy setup guide to get everything ready with one simple command.
Open the welcoming instructions that explain how it finds hidden patterns in AI data to save space.
Feed it short sample texts for a quick 15-second learning step so it understands your AI's patterns.
Run tests on popular chat AIs like Qwen or Llama to see huge speedups and better quality in the colorful graphs.
See 2x faster responses with graphs zooming across lengths.
Notice perfect memory recall in needle tests.
Your chatbot now thinks quicker with less memory – ready for real chats or sharing discoveries!
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.