Hedlen / Awesome-Multimodal-Intelligence
Public专注于 VLM、VLA、世界模型及通用具身智能等方向,收录前沿论文、开源代码与数据集,追踪从感知到决策的下一代智能体技术。 A curated collection for multimodal intelligence research, covering VLMs, VLAs, World Models, and embodied AI — tracking next-generation agent technologies from perception to decision-making, with a focus on papers, code, and datasets.
A curated collection of resources including papers, models, datasets, and tools for multimodal intelligence in AI.
How It Works
You search online for the best resources on AI that combines images, sounds, and words, and stumble upon this handy collection.
You click the GitHub link and land on a clean page full of organized recommendations.
You scroll through handy categories like models, tools, and research papers, picking what sparks your interest.
A cool project or insightful paper catches your eye, and you feel excited to dive in.
You click on promising entries to visit websites, read summaries, or download helpful materials.
You soak up new ideas and start experimenting with what you've found in your own projects.
Now you're equipped with the latest knowledge and ready to create amazing things with smart AI.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.