Proximity-based Multi-turn Optimization (ProxMO) - Official Implementation
ProxMO is a lightweight framework that improves training of language model agents for multi-turn tasks by smarter reward sharing across steps.
How It Works
You stumble upon a clever tool that trains AI helpers to master tricky, step-by-step chores like tidying rooms or shopping online.
Set up fun virtual worlds where your AI can practice real-life tasks safely and endlessly.
Choose everyday adventures like cleaning, heating food, or finding items to train your AI on.
Hit start and watch your AI get smarter with each try, nailing tough sequences others struggle with.
Review scores showing huge wins over big-name AIs like GPT-4 on every key measure.
Celebrate as your AI aces multi-step tasks, outperforming rivals and ready for action!
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.