BlackOtters / SonicStar
PublicOpen-source Unitree G1 Vision-Language-Action stack for teleop data collection, SonicLatent training, simulation, and real-time whole-body policy deployment(real world deployment TBD).
SonicStar is an open-source project that enables training Vision-Language-Action (VLA) models for the Unitree G1 humanoid robot. It provides tools for collecting demonstration data through teleoperation, training AI models that understand camera images and natural language instructions, and deploying trained models for real-time robot control. The project supports multiple training approaches including flow-matching and diffusion-based methods for predicting robot actions. It integrates with established robotics frameworks like LeRobot and builds on top of vision-language models like Qwen-VL.
How It Works
You discover that researchers have trained a Unitree G1 humanoid robot to understand natural language commands and perform physical tasks.
Before the robot can help you, you need to collect examples of tasks being performed and use them to train an AI model.
Using special software, you control the robot and perform actions like picking up objects while cameras capture everything.
The trained model learns to connect what it sees in camera images with the actions needed to complete tasks described in plain English.
With everything set up, you give the robot a task like 'pick up the red block' and watch it figure out how to do it.
The robot successfully understands your command and performs the task you asked for, learning from the training examples you collected.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.