THU-SI / Spatial-TTT
PublicOfficial Implementation of Spatial-TTT: Streaming Visual-based Spatial Intelligence with Test-Time Training
Spatial-TTT is an open-source framework for training AI models to perform advanced spatial reasoning on streaming videos using test-time adaptation techniques.
How It Works
You hear about a clever tool that helps AI make sense of spaces and objects in videos, like counting or remembering positions over time.
You set up a simple space on your computer where everything will happen, ready for videos and learning.
You collect short video clips showing everyday scenes, like rooms or paths, to teach the AI about space.
You let the AI watch and learn from the videos, building its ability to track and understand layouts as they unfold.
You run quick checks on new videos to see how well the AI spots positions, counts items, or recalls details.
Your AI now excels at spatial smarts in long videos, delivering reliable answers for real-world scene analysis.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.