arrowonstr / LLM-Handwritten-Template
Public包含了LLM的一些手撕代码,如强化学习。可以帮助从代码层面深入理解原理,以及有助于准备大模型面试可能出现的手撕。后续会更新Transformer等更多手撕
This repository offers easy and hard templates with mock setups and TODO exercises to hand-implement core RLHF algorithms like PPO, DPO, and GRPO for educational purposes.
How It Works
You find a helpful GitHub collection of practice kits to learn how to train AI assistants to be more helpful and safe by building key techniques yourself.
Download the project, install the simple building block it needs, and run a quick check to see everything lights up green and ready to go.
Follow clear step-by-step nudges to fill in the blanks and test often.
Dive into formulas and data flows to piece it together on your own.
Tackle the guided exercises one at a time, watching your additions make the pieces click together perfectly with each test run.
Launch complete sessions for different training styles and watch your AI practice responding better over time.
Check how your custom-trained AI now prefers great answers over poor ones, proving your work pays off.
Celebrate understanding the secrets of making AI assistants smarter and kinder through hands-on building.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.