[ArXiv 2026] Text-Guided 6D Object Pose Rearrangement via Closed-Loop VLM Agents
A research tool that uses AI to interpret text prompts and iteratively adjust the 3D poses of objects in mesh-based scenes for tasks like pouring or chess moves.
How It Works
You stumble upon a fascinating project that uses AI to rearrange objects in 3D scenes based on simple text descriptions, like pouring tea or moving chess pieces.
You get your computer ready by installing a few helper programs and creating a workspace for the magic to happen.
You place 3D model files of objects, like a table with teacups or a chessboard, into a folder to create your starting scene.
You link up a smart AI service that can look at pictures and understand your words to guide the rearrangements.
You type a clear instruction, such as 'Pour the tea into the teacup' or 'Move the knight to f6', telling the AI exactly what to do.
The AI selects the right object, tries different views, and step by step moves and rotates it to match your description perfectly.
You get a folder full of images and updated 3D files showing your scene transformed just as you imagined, ready to explore.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.