AmmarkoV / SAM3DBody-cpp
PublicReal-time 3D full-body reconstruction from a single camera, C++ runtime, ONNX + ggml, 70-joint skeleton with hands.
This is a C++ implementation of a 3D body pose estimation system. It takes ordinary photos or videos as input and outputs detailed 3D information about where each person's body parts are positioned in space. The system uses a combination of AI models (including a vision transformer called DINO and a person detector called YOLO) to first locate people, then estimate their 3D pose including hand positions, facial orientation, and full body shape. Results can be viewed live, exported to spreadsheets, or integrated with robotics systems. The project runs entirely on your computer without needing to send data to external services.
How It Works
You capture a picture or video of yourself or someone else moving around.
The system automatically finds where each person is in the image and identifies their key body parts like head, hands, and feet.
The magic happens as the AI builds a complete 3D skeleton for each person, showing exactly how they're posed in real space.
See the 3D body model rendered directly on the video feed with body landmarks highlighted.
Save all 70 joint positions to a spreadsheet so you can analyze the motion later.
For researchers and robotics developers, the system publishes body pose data directly to ROS topics for real-world applications.
Whether for animation, health monitoring, or robotics, you end up with accurate 3D body pose data that's ready to use.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.