simchowitzlabpublic / nano-world-model

Public

A Minimalist, Batteries-included Repository for Advancing World Model Science.

simchowitzlabpublic.github.ionano-world-model diffusion-forcing diffusion-models model-predictive-control nano planning

304

100% credibility

Found May 04, 2026 at 304 stars -- GitGems finds repos before they trend. Get early access to the next one.

AI Analysis

Python

AI Summary

A minimalist open-source toolkit for training diffusion-based video world models on robotics datasets like mazes and pushing tasks, with pretrained checkpoints and built-in planning.

How It Works

🔍 Discover Nano World Model

You find this simple tool on GitHub that lets you train AI to predict future video frames from actions, perfect for robotics or games.

📥 Set up your playground

Create a ready-to-use space with one command, grabbing all the tools you need.

📦 Add your videos

Point to your video clips of robots or games, and it handles the rest.

🚀 Train your predictor

Hit go and watch it learn to imagine what happens next in your videos.

🎥 See predictions

Generate future video frames and see realistic continuations.

🧭 Plan ahead

Use predictions to find smart paths for robots to reach goals.

✨ Master video foresight

Now you create lifelike simulations and clever plans effortlessly.

Sign up to see the full architecture

5 more

Star Growth

See how this repo grew from 304 to 304 stars Sign Up Free

Repurpose This Repo

Repurpose is a Pro feature

Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.

Unlock Repurpose

AI-Generated Review

What is nano-world-model?

Nano-world-model is a minimalist Python repository for training compact video world models that predict future frames from context videos and actions using diffusion-forcing. It solves the pain of sprawling, dependency-heavy setups for video prediction by offering batteries-included pipelines for datasets like DINO-WM, CSGO, and RT-1, delivering long-horizon rollouts, planning, and video-to-3D in minutes from clone. Users get pretrained checkpoints on Hugging Face, ready for advancing world model science without wrestling configs.

Why is it gaining traction?

Its hook is true minimalism—like a lovelace minimalist GitHub profile readme—stripping diffusion training to essentials with Hydra configs, instant conda envs, and one-line commands like `python src/main.py experiment=dino_wm_pusht`. Developers notice seamless scaling across envs, ablation-ready metrics (PSNR, FVD), and apps like CEM planning or pointcloud recon out-of-the-box, beating verbose alternatives in speed and transparency.

Who should use this?

RL researchers benchmarking world models on manipulation tasks (point maze, pusht, rope), vision experts needing action-conditioned video diffusion baselines, or nano-entrepreneurs prototyping nano world model apps on custom videos. Ideal for academics running head-to-head ablations or devs building MPC planners without repo bloat.

Verdict

Grab it if you're diving into nano world models—304 stars show early buzz, solid docs and MIT license make it approachable despite 1.0% credibility score signaling nascent maturity. Test with pretraineds first; extend for production once evals stabilize.

(198 words)

Sign up to read the full AI review Sign Up Free

Similar repos coming soon.

304

Stars

Forks

Followers

Base stars: 304 stars

Bonus: AI verified quality (100%)

Account age: 189 days

Repo age: 4 days

License: MIT

Updated: May 04, 2026