A Chinese-focused PyTorch framework for exploring Attention Residuals in Qwen3-style causal LMs, with baseline, Block AttnRes, Full AttnRes, training, evaluation, and visualization support.
This project experiments with a technique called Attention Residuals to train improved Chinese large language models from scratch, including tools for training, testing performance on Chinese benchmarks, and visualizing internal decisions.
How It Works
You hear about a smart way to help AI understand Chinese better by reusing its own thoughts during learning.
You grab the free tools and data to start experimenting with Chinese text.
Stick to the usual way to learn Chinese patterns.
Reuse thoughts in chunks for steady improvement.
Mix every past thought for the most thorough learning.
You feed it Chinese stories and watch it get smarter step by step.
You quiz it on Chinese questions to see its understanding and scores.
You view colorful maps showing which past thoughts it reuses most.
You now have a sharper AI for Chinese language tasks, ready to use or share.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.