Entrpi / ds4-on-spark
Publicantirez/ds4 (DwarfStar 4) on NVIDIA DGX Spark β install, benchmarks, and roofline analysis. Steady-state decode at ~95% of bandwidth ceiling; MTP and concurrency analyzed.
This repository provides a complete setup guide for running the DeepSeek-V4 AI model on NVIDIA DGX Spark hardware. It automates the process of downloading the inference engine, building optimized binaries for the specific GPU, downloading the 81-gigabyte quantized model, and starting a server that can answer questions like an AI assistant. The project includes detailed performance benchmarks showing the AI can generate about 24-28 tokens per second during steady use, reaching approximately 95% of the hardware's theoretical speed limit. It also documents a known issue with speculative decoding on CUDA that causes a small performance regression, along with the root cause and planned fix.
How It Works
You learn about a project that lets you run a powerful AI model called DeepSeek-V4 on your NVIDIA DGX Spark computer, with detailed performance measurements.
You run a single installer command that automatically checks your hardware, downloads the AI engine, and gets everything ready to use.
The installer confirms your Spark computer has the right GPU and memory to run the AI model smoothly.
The 81-gigabyte AI model downloads piece by piece, and the inference engine compiles specifically for your hardware.
The installer asks the AI a simple question like 'What is the capital of France?' and verifies it answers correctly.
Launch your AI assistant that answers questions through a web interface, ready whenever you need it.
Measure how fast your Spark can process AI requests and compare against the theoretical limits.
Your DeepSeek-V4 AI is now running on your Spark, able to answer questions, write code, and help with complex reasoning tasks at impressive speeds.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.