A research toolkit for training small language models from scratch, running standardized benchmarks, and comparing performance across experiments.
How It Works
You find this handy toolkit on GitHub while looking for ways to experiment with AI language models.
You read the clear guide and get everything ready on your computer in just a few minutes.
You download ready-to-use text data so your AI has plenty to learn from.
You start the training with one command and watch as your custom language model grows smarter step by step.
You run exciting quizzes like science questions and common sense puzzles to see how well it performs.
You see charts comparing your results to others and spot improvements right away.
Your trained model shines on benchmarks, complete with plots and reports for your experiments.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.