Qwen3.5-122B-A10B on DGX Spark: 28.3 → 51 tok/s (+80%)
Scripts and tweaks to speed up a massive AI language model on NVIDIA DGX Spark hardware from 28 to 51 tokens per second.
How It Works
You hear about a way to make a huge AI model run much faster on your special NVIDIA computer.
Download the simple files that make everything work.
Get the smart model files so your AI can think.
Click one button to prepare everything automatically with progress updates.
Your setup finishes, now supercharged for lightning responses.
Launch it and connect to chat right away.
Enjoy responses at 51 words per second — 80% faster than before!
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.