AEON-7 / Qwen3.6-NVFP4-DFlash
PublicQwen3.6-35B-A3B-heretic NVFP4 + DFlash speculative decoding on DGX Spark (GB10/sm_121a). Source-built vLLM image + 7 patches + comprehensive deployment guide.
This project offers a pre-configured package to run a highly optimized, quantized Qwen3.6 AI model with speed boosts for specific NVIDIA DGX Spark hardware.
How It Works
You hear about a special setup that makes AI conversations super speedy on powerful NVIDIA computers.
You confirm your computer is the right high-end NVIDIA type with plenty of memory and space.
You grab the ready-made package that has everything prepared for you.
You fetch the AI thinking files and place them in a simple folder.
With one easy command, you bring your turbo AI to life and it's ready to chat.
You ask something fun like 'What is 17 times 23?' and get a quick smart answer.
Now your AI handles tons of fast, smart conversations without any slowdowns.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.