ww-w-ai/claude-code-token-saver
21 stars · Last commit 2026-06-02
45% cost reduction measured. The only Claude Code plugin built from CC source analysis — cache expiry prevention, SubTask auto-delegation, zero-cost context restoration, real-time dashboard. Max Plan + API pay-per-use.
README preview
# claude-code-token-saver **The only Claude Code plugin that actually reads CC's source code to find where your tokens go — and fixes it automatically. Spend less, code longer.** > Measured result: **45% cost reduction** on a real $326/day workload → $180/day. Cache expiry prevention, automatic SubTask delegation, zero-cost context restoration, and a full analytics dashboard — in one install, zero config. Works with **Max Plan ($200/mo)** and **API pay-per-use**. Same plugin, same features. Stronger for every user — especially when every token is real money.  ### What it does in 30 seconds | Feature | What happens | Impact | | ------- | ------------ | ------ | | 🛡️ Token Guardian | Detects cache expiry, blocks $9 re-sends before they happen | Prevents the #1 silent cost spike | | 🧠 Session Architect | Auto-delegates to SubTasks (37.5% cheaper) + parallelizes tool calls to cut round-trips | Context stays small, round-trips drop, costs compound down | | 🪶 Concise Mode | Decision-focused responses: essential context + choices, nothing else | Fewer output tokens, faster user decisions | | 🔄 /continue | Replaces /compact — zero LLM calls, zero cost, zero info loss | Free context restoration | | 📊 Status Line | Real-time cost, context size, rate limit — under 50ms | See problems before they cost you | | 📈 /usage-view | Interactive HTML dashboard with AI-powered analysis | Full cost forensics in one click |