poetrywanderer / RL-Projects
PublicThis include experimental RL Projects on LLM, VLM & Generative tasks
A collection of academic research projects exploring how AI models can improve themselves through reinforcement learning and knowledge distillation, covering text reasoning, visual understanding, and image generation tasks.
How It Works
A researcher learns about this open-source project exploring how AI models can teach themselves new skills through practice and feedback.
The project offers three ways to explore AI learning: teaching language models math skills, helping vision models read diagrams, and teaching image generators to write readable text.
Teach a language model to solve arithmetic puzzles by practicing and improving step-by-step
Help a vision model learn to read and reason about geometric diagrams
Train an image creator to render clear, readable text in pictures
You prepare your computer with the necessary tools and download the pre-trained models to get started.
The AI model practices on thousands of examples, receiving feedback on its performance and gradually improving its abilities.
You observe training curves showing how the model improves over time, from barely solving problems to achieving high accuracy.
The trained model demonstrates its new capabilities, whether that's solving math problems, reading diagrams accurately, or generating images with clear text.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.