HumanEgo is a research project that enables robots to learn manipulation tasks—like picking up objects, arranging items, or performing household chores—by watching short videos of humans performing those same tasks. The system uses special egocentric cameras (like Project Aria smart glasses) to record what a person sees and does from their own perspective. Advanced AI models then automatically analyze these videos to track hand movements, identify objects, and reconstruct 3D spatial relationships. This data is used to train a robot policy using flow matching—a technique that learns how to generate correct robot movements from observed human demonstrations. In essence, it allows someone to teach a robot a new skill simply by performing that task themselves while wearing the glasses, rather than writing code or manually demonstrating on the robot.
How It Works
Instead of programming movements by hand, you want the robot to learn by watching you work.
You wear Project Aria glasses that record everything you see and do from your own perspective while you complete a task like picking up a cup or serving bread.
Advanced AI models track your hands, identify objects, and build a 3D understanding of the scene—all automatically.
The trained AI model analyzes your movements and teaches the robot what to do, without any manual programming.
The robot tries to replicate what it learned, adjusting its movements based on feedback until it gets it right.
After watching just minutes of your demonstration, the robot has learned to complete the task independently.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.