pyshka501 / rl-textbook
PublicReinforcement Learning: From Bandits to LLM Alignment — Open textbook with 17 chapters, Colab notebooks, and exercises
An open-source textbook bridging classical reinforcement learning theory with modern language model alignment, featuring interactive notebooks, exercises with solutions, and multilingual translation plans.
How It Works
You hear about a free online guide that teaches how smart computers learn by trial and error, from simple games to advanced AI chatbots.
Download the easy-to-read PDF to start exploring chapters at your own pace from your couch.
Click a fun badge to instantly run hands-on demos in your web browser, watching ideas come alive without any setup.
Tackle challenges with ready answers to test your understanding and build confidence.
Check out planned translations if you prefer reading in your own language.
You've gone from newbie to expert, ready to apply these powerful ideas to real-world AI projects!
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.