learningCatHD / telos-sdk
PublicTELOS SDK: a cache-aware prompt protocol and gateway for portable agent context.
TELOS is an open-source tool developed by researchers at Tsinghua University that acts as a smart middle layer between AI coding assistants and the AI services they call. It works by recognizing which parts of a conversation with an AI are permanent (like tool definitions and system instructions) versus which parts change every turn (like timestamps and environment details). It then ensures the permanent parts are remembered by the AI service's cache, so you only pay for the new information each time. The tool installs in seconds, auto-detects popular AI coding assistants, and shows you a live dashboard of your actual dollar savings. It supports multiple AI providers (Anthropic, OpenAI, DeepSeek) and inference frameworks (vLLM, SGLang), and comes from a legitimate university research lab with published benchmarks showing ~40% cost reduction without degrading task accuracy.
How It Works
A colleague mentions TELOS at a team meeting — a tool that can cut your AI agent's running costs by up to 90% without changing anything about how your agent works.
You run a simple install command, and TELOS sets itself up automatically on your computer.
TELOS scans your computer and automatically finds which AI coding assistants you already use — like Claude Code, OpenClaw, or others — and connects them all to its gateway.
The local gateway launches in the background, and your AI assistant is now routing through TELOS without you having to change a single line of code.
You open a dashboard in your browser that shows exactly how much money you're saving in real dollars — not vague percentages — with every conversation your AI has.
Over the next few weeks, your monthly AI server costs shrink significantly because TELOS makes sure your AI assistant's instructions only get sent once, then reused from memory instead of being repeated every single time.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.