z2tong

z2tong / SCOPE

Public

SCOPE: Simulating Cross-game Operations in Playable Environments for FPS World Models

47
3
100% credibility
Found May 28, 2026 at 47 stars -- GitGems finds repos before they trend. Get early access to the next one.
Sign Up Free
AI Analysis
Python
AI Summary

SCOPE is an interactive AI system that generates realistic video sequences for first-person shooter games. Given a starting image and a sequence of player actions (like joystick movements and button presses), it creates short videos showing what would happen in the game. The system was trained on footage from 7 different games and can generalize to create believable gameplay footage for games it has never seen before.

How It Works

1
🔍 You discover SCOPE

You come across a research project that can imagine what happens next in a video game when you press buttons or move a joystick.

2
đŸ“Ļ You download the AI brain

You grab the trained AI model from the internet - it's been taught using footage from 7 different shooter games.

3
🎮 You pick a game moment

You choose a screenshot from any game - maybe you're aiming down sights in Black Myth: Wukong or running through Genshin Impact.

4
đŸ•šī¸ You define what you'll do

You record your action plan: move the left stick forward, press the fire button, look around with the right stick.

5
⚡ You watch the magic happen

The AI takes your starting image and action plan, then generates a short video showing what your character would do.

đŸŽŦ You get your game video

Out comes a smooth 4-second clip showing your character moving, aiming, and firing - all generated to match your actions.

Sign up to see the full architecture

4 more

Sign Up Free

Star Growth

See how this repo grew from 47 to 47 stars Sign Up Free
Repurpose This Repo

Repurpose is a Pro feature

Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.

Unlock Repurpose
AI-Generated Review

What is SCOPE?

SCOPE is an interactive world model for first-person shooter games that generates playable video footage conditioned on player actions. Given a starting image and a sequence of controller inputs (joystick movement, firing, jumping), it produces short video clips showing what that gameplay would look like. The system handles 10-dimensional action inputs combining continuous joystick movement with discrete button presses, and can generate approximately 4-second clips at 480x832 resolution. It is built in Python on top of the Wan2.2 diffusion architecture and runs inference through a straightforward command-line interface.

Why is it gaining traction?

The standout capability is cross-game generalization. Trained on footage from 7 different FPS titles, the model can generate believable gameplay for entirely unseen games without any fine-tuning. This is unusual in a space where models typically overfit to their training domains. The hybrid action space also matters for realism - it simultaneously processes aiming, movement, and weapon actions rather than treating them in isolation. Developers interested in game AI research, procedural content generation, or simulating player behavior have a clear reason to experiment here.

Who should use this?

Game AI researchers working on world models or player simulation will find this most relevant. Researchers studying cross-domain generalization in video generation have a concrete benchmark. Simulation and synthetic data pipelines that need action-conditioned game footage could integrate this. For production game development, the hardware requirements (24GB VRAM minimum, 80GB recommended) and generation speed make this impractical today. Indie developers hoping to generate actual game content will need significantly more maturity.

Verdict

This is promising academic research with real novelty in cross-game action conditioning, but it scores only 1.0% on credibility metrics. With 47 stars and a recent arXiv submission, the project is in early experimental stages. The model weights are available and the inference code is functional, so serious researchers can reproduce results. Production use cases should wait for community validation, better documentation, and hardware optimization. If you are exploring game world models or action-conditioned video generation for research, this is worth evaluating against alternatives.

Sign up to read the full AI review Sign Up Free

Similar repos coming soon.