GB10 NVFP4 native MTP reproducibility pack for Qwen3.6-35B-A3B
This repository is a step-by-step guide for running a large AI assistant (Qwen3.6-35B) on NVIDIA GB10 graphics cards. The project includes a compatibility fix for the inference software, launch scripts, and testing tools. It uses special compression techniques to make the AI assistant run efficiently on high-end hardware while maintaining quality responses. The project does not include the AI model files itself—users download those separately—and provides clear documentation so anyone with the right hardware can reproduce the results.
How It Works
You discover a community member shared a working setup for running a powerful AI assistant on new high-end hardware.
Following the instructions, you download the AI assistant files separately from the official model library.
The project includes a small fix that makes everything work properly with your hardware and software setup.
With one simple command, your AI assistant starts up on your computer, ready to answer questions.
You run a quick test to make sure the AI assistant is thinking correctly and responding properly.
Ask one question and see how quickly it responds
Send several questions at once to see how it handles busy periods
Everything is working! Your AI assistant runs fast on your hardware, and you can start using it for real tasks.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.