Python SDK for Stera: Record, Process, Evaluate, and Export Multimodal Data
Stera SDK is a Python software toolkit developed by FPV Labs for processing first-person video recordings. It loads video files containing RGB video, depth maps, camera positions, and sensor data, then runs AI models to track hands, blur faces for privacy, and estimate body pose. The SDK includes visualization tools using Rerun for interactive 3D exploration and generates quality evaluation reports. Finally, it exports everything into organized datasets (video, meshes, annotations, calibrations) ready for machine learning workflows in robotics and AI research. The project is Apache 2.0 licensed and available on PyPI.
How It Works
You capture video footage using the Stera mobile app on your device, collecting rich data with camera motion, hand movements, and 3D depth information.
You open your video file with the SDK, and it automatically organizes all the data streams—RGB video, depth maps, camera positions, and sensor readings—into one tidy structure.
The system runs a hand-tracking model that identifies all 21 joints of each hand in every frame, giving you precise 3D positions of fingers and wrists in real space.
When privacy matters, an AI face detector finds and blurs any people in the footage with a single command, keeping everything clean and professional.
Watch your recorded journey unfold as a 3D visualization with your camera position, hand movements, and the environment mesh all together.
Get an interactive web page showing statistics on data quality, hand detection rates, camera smoothness, and any issues to watch for.
With one command, all your data exports into a neat folder containing video, 3D mesh, hand tracking annotations, and calibration details—ready for machine learning pipelines.
You have clean, annotated first-person video data structured perfectly for training embodied AI systems, robotics models, or vision-language systems.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.