KlingAIResearch

DecMem: Towards Minute-Long Consistent World Generation with Decoupled Memory

10
0
100% credibility
Found Jun 01, 2026 at 10 stars -- GitGems finds repos before they trend. Get early access to the next one.
Sign Up Free
AI Analysis
Python
AI Summary

DecMem is an open-source research project that generates minute-long videos with consistent characters and scenes. It uses a clever two-part memory system: one part remembers the big picture for long-term consistency, while another part focuses on smooth short-term transitions. The system learns from video-action pairs (like gameplay recordings) and can extend videos block-by-block while keeping everything coherent. Built by researchers at Kuaishou Technology and the University of Hong Kong, this project is freely available for anyone to use and modify.

How It Works

1
🔍 Discover the project

You find DecMem through a research paper, project page, or word of mouth about generating minute-long videos with consistent characters and scenes.

2
📦 Download and set up

You download the code and pre-trained model weights from the official repository, creating your own local copy to work with.

3
🎮 Prepare your training data

You gather video clips paired with action data (like gameplay recordings with player movements) to teach the model what you want it to generate.

4
⚙️ Train the model

You run the training process where the model learns to understand how actions connect to video frames, building its memory of how to create smooth, consistent sequences.

5
Generate your video

You provide a starting video or action sequence, and the model creates new frames one block at a time, maintaining consistency with everything that came before.

6
🎬 Watch the result

The model outputs a complete video showing smooth, continuous motion with characters and scenes that stay consistent throughout the entire minute-long sequence.

Sign up to see the full architecture

4 more

Sign Up Free

Star Growth

See how this repo grew from 10 to 10 stars Sign Up Free
Repurpose This Repo

Repurpose is a Pro feature

Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.

Unlock Repurpose
AI-Generated Review

What is DecMem?

DecMem is a video generation system that produces minute-long consistent videos by combining a diffusion model backbone with a novel decoupled memory architecture. The core insight is that standard attention mechanisms struggle with long sequences, so DecMem splits memory into two components: a sparse global memory for efficient retrieval across the full history, and an anchored local memory that keeps short-term transitions smooth. The system trains on gameplay footage paired with action and pose data, enabling controllable long video generation where camera movements and character actions stay coherent across extended clips. It's built in Python on top of the Wan2.1 video diffusion model and uses custom CUDA kernels for the sparse attention operations.

Why is it gaining traction?

The hook is simple: open-source minute-long video generation with consistency is rare. Most diffusion-based video models handle short clips; DecMem specifically targets the long-horizon problem that causes flickering, drift, and broken physics in extended generations. The decoupled memory design is the technical differentiator--by separating what gets attended to globally versus locally, it avoids the quadratic cost explosion of dense attention while maintaining fidelity. The research team (Kling/Kuaishou) has a track record in video generation, which lends credibility even to this early release.

Who should use this?

This is for researchers and engineers working on world simulation, game content generation, or robotics planning where you need extended video sequences that maintain visual and physical consistency. If you're evaluating video generation for training data synthesis or creative tools, DecMem addresses a real gap that standard short-clip models leave unsolved. Be warned: the full pipeline requires H100/H200/H800 GPUs with custom CUDA kernels, so plan your hardware accordingly.

Verdict

DecMem solves a legitimate problem and ships with working inference code, but with 10 stars and a 1.0% credibility score, this is clearly early-stage research code rather than production-ready tooling. The setup is involved--multi-stage training, custom kernel compilation, specific hardware requirements--and documentation, while present, assumes familiarity with video diffusion systems. Worth exploring if you're pushing the boundaries of long video generation, but approach with realistic expectations about the maturity level.

Sign up to read the full AI review Sign Up Free

Similar repos coming soon.