ZJU-OmniAI / GFT
PublicGFT: From Imitation to Reward Fine-Tuning with Unbiased Group Advantages and Dynamic Coefficient Rectification
GFT is an open-source framework that trains large language models to excel at math reasoning by combining imitation learning with reinforcement techniques in a single efficient stage.
How It Works
You find this project on GitHub while looking for ways to make AI better at solving math problems, and get excited about its simple training method.
You explore the paper and instructions, learning how it smartly mixes copying good answers with exploring new ideas to teach AI math reasoning.
You install the easy training tools on your powerful computer setup, following the quick steps.
You download ready-made collections of math questions and answers to use for training.
You launch the training with a click, watching your AI learn from groups of answers and get smarter at math.
Your AI now solves tougher math puzzles faster and more accurately, ready for real use!
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.