Toolkit for collecting, merging, auditing, visualizing, and publishing RGB/RGB-D LeRobot VLA datasets.
This is a comprehensive toolkit for creating and preparing robot training datasets with depth information. It enables users to record robot movements using Orbbec RGB-D cameras, merge multiple recording sessions into unified datasets, automatically check data quality for corruption or anomalies, remove problematic episodes, and publish the final cleaned dataset to Hugging Face. The project integrates with the LeRobot ecosystem from Hugging Face and supports the full lifecycle of robot data collection and preparation for training vision-language-action models.
How It Works
You find a project that helps you record robot movements with depth cameras and prepare the data for training robot brains.
You hook up an Orbbec depth camera to your robot arm and start recording yourself controlling it, capturing both video and depth information.
After several recording sessions, you merge all your video clips into one organized dataset with proper episode numbering.
An automated checker reviews every video frame and depth image, flagging any corrupted recordings or suspicious content for your review.
Episodes that passed the quality checks are marked for keeping.
Corrupted or low-quality episodes are marked for removal from your dataset.
The tool creates a fresh copy of your dataset with only the episodes you approved, re-encoding videos to remove unwanted segments.
Your cleaned robot training dataset is uploaded to Hugging Face, ready for anyone to download and use for training robot policies.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.