EIT-EAST-Lab / C3
PublicOfficial implementation of the paper "Contextual Counterfactual Credit Assignment for Multi-Agent Reinforcement Learning in LLM Collaboration". (by Yanjun Chen)
This repository is a research codebase implementing Contextual Counterfactual Credit Assignment (C3) for improving multi-agent reinforcement learning in collaborative LLM systems on math and coding benchmarks.
How It Works
You find this exciting project that helps AI teams work better together on tough math and coding problems, with a clear guide and paper to read.
Follow simple steps to download and set up the tools, data, and smart models needed for your experiments.
Run a quick check to see your AI team chatting and solving problems correctly right away.
Launch the training so your AI agents learn to collaborate smarter on math and code challenges.
Review charts and numbers showing how much better your AI team performs.
Celebrate as your collaborative AI agents solve problems more accurately and efficiently than before!
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.