0xSero / moe-compress
PublicModel-agnostic MoE compression automation: build calibration bundles, run REAP/quantization/benchmark/publish stages, and render auditable reports.
A collection of automation tools that streamline shrinking large AI models by trimming unused sections, compressing data, measuring performance, and producing clear summary reports.
How It Works
You find a handy tool that helps shrink massive AI models to run faster while keeping their smarts.
You collect your big AI model files and some example conversations or texts to test with.
You jot down simple instructions on how much to trim, squeeze, test, and where to save the results.
With one go, it builds a test set, trims extra parts, squeezes the data, checks speeds, and creates a full summary.
You get easy-to-read reports with charts showing sizes, speeds, and quality checks for each version.
Your smaller, faster model is ready to use or share, saving time and resources every day.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.