ydyhello / Awesome-VLM-Streaming-Video
Public๐ A curated collection of papers and open-source code repositories dedicated to the application of Vision-Language Models (VLMs) for streaming video.
A curated list of research papers, open-source projects, benchmarks, datasets, surveys, and resources focused on Vision-Language Models for streaming video understanding and interaction.
How It Works
You stumble upon this handy list while searching for the latest ideas on AI that understands live videos, like a smart companion watching streams with you.
You scroll through organized categories like projects, reports, memory tricks, and benchmarks, each packed with promising titles and quick summaries.
Your eyes light up on cool entries from big names, with links to papers and ready-to-try examples that make real-time video chat feel magical.
You click on a paper or project that catches your fancy, reading about how AI decides when to speak or remembers long videos without forgetting.
You note down benchmarks, datasets, and resources to fuel your own explorations or stay ahead in video AI trends.
Now you're equipped with a treasure trove of cutting-edge knowledge, ready to follow the leaders in making AI watch and react to videos just like a friend.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.