Tokenless is a local plugin for Claude Code that reduces AI costs by compressing the information sent to the AI while keeping all original data saved on your computer. When you read large files, run tests, or execute commands, Tokenless intercepts the output, saves the full version locally, and sends a compact summary to the AI instead. You can expand these summaries anytime you need the details. It also offers different response styles—chat mode for conversational replies or coding mode for dense technical output—so you control how verbose the AI responses are. Everything runs locally on your machine with no external services, and risky or failed outputs always pass through uncompressed.
How It Works
As you use Claude Code for bigger projects, you realize every file read, test result, and command output keeps getting sent along, making each request more expensive.
You discover a tool that promises to cut your AI costs by half while keeping all your work saved locally.
With just a few simple commands, Tokenless is installed and ready to work alongside Claude Code.
You keep coding normally, but now large outputs get compressed into summaries while the full details stay safe on your computer.
Short, readable responses in plain language for everyday questions and discussions.
Dense, technical output with abbreviations for efficient programming sessions.
When you need exact details from a compressed output, just ask to expand it and everything is there.
Your AI costs drop significantly while every piece of evidence from your work stays safely stored locally, ready whenever you need it.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.