This repository contains lecture notes, practical materials, and implementations for the course: "Reinforcement Learning: from Bandits to RLHF" The course is designed to provide a deep and systematic understanding of RL, combining: solid mathematical foundations intuitive explanations practical implementations modern research insights
This repository provides materials for an academic course on reinforcement learning, progressing from basic bandit problems to advanced techniques like RLHF for large language models.
How It Works
You stumble upon this collection of lessons about teaching computers to make smart choices, like in games or recommendations.
You grab the downloadable notes and video recordings to start learning at your own pace.
You hop into the friendly chat groups to connect with others and get help from the teacher.
You dive into the step-by-step videos that explain ideas from simple choices to advanced AI training, feeling the concepts click.
You open the interactive practice files to build and test your own smart decision-makers right on your computer.
You now understand how to create AI that learns from trial and error, ready to tackle real-world problems or read cutting-edge research.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.