MilkClouds / awesome-vla-study
PublicA structured reading list on Vision-Language-Action (VLA) models — from diffusion/flow matching foundations through state-of-the-art robot foundation model architectures to data scaling, RL fine-tuning, and world models. Papers in reading order.
A structured study guide curating papers, courses, and resources for learning Vision-Language-Action models in robotics, organized into weekly phases from basics to advanced topics.
How It Works
You stumble upon this friendly roadmap while searching for ways to learn about smart robot brains online.
You glance at the starting requirements to see if you know enough math and AI fundamentals to jump in.
Your basics are solid, so begin the weekly study phases right away.
Watch free video courses to catch up on deep learning essentials.
Dive into organized phases, reading key stories about robot learning one week at a time.
Share what you've learned by presenting papers and chatting about big ideas with others.
Check out linked videos, courses, and similar guides to expand your horizons.
You now understand the cutting edge of how AI makes robots see, think, and act like pros.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.