tajwarfahim / maxrl
PublicOfficial Implementation of "Maximum Likelihood Reinforcement Learning (MaxRL)"
This repository implements research code for training AI models using a new reinforcement learning method that improves performance on math, mazes, and vision tasks without needing human feedback.
How It Works
You stumble upon this research project that teaches AI to solve puzzles and math by learning from its own tries, like practicing to get better.
Follow simple steps to prepare your computer, like creating a new workspace and grabbing the needed helpers.
Download ready-made mazes, math problems, or picture sets to teach your AI what good solutions look like.
Hit start on a script, and watch your AI play games against itself, gradually solving tougher challenges.
Peek at charts showing improvement, adjust speeds if needed, and let it run multiple rounds.
Your assistant now masters mazes, math, and images better than before, ready for real-world tests!
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.