ww-w-ai/claude-code-token-saver

21 stars · Last commit 2026-06-02

45% cost reduction measured. The only Claude Code plugin built from CC source analysis — cache expiry prevention, SubTask auto-delegation, zero-cost context restoration, real-time dashboard. Max Plan + API pay-per-use.

README preview

# claude-code-token-saver

**The only Claude Code plugin that actually reads CC's source code to find where your tokens go — and fixes it automatically. Spend less, code longer.**

> Measured result: **45% cost reduction** on a real $326/day workload → $180/day. Cache expiry prevention, automatic SubTask delegation, zero-cost context restoration, and a full analytics dashboard — in one install, zero config.

Works with **Max Plan ($200/mo)** and **API pay-per-use**. Same plugin, same features. Stronger for every user — especially when every token is real money.

![Usage dashboard — see exactly where your tokens go](docs/images/usage-view-overview.png)

### What it does in 30 seconds

| Feature | What happens | Impact |
| ------- | ------------ | ------ |
| 🛡️ Token Guardian | Detects cache expiry, blocks $9 re-sends before they happen | Prevents the #1 silent cost spike |
| 🧠 Session Architect | Auto-delegates to SubTasks (37.5% cheaper) + parallelizes tool calls to cut round-trips | Context stays small, round-trips drop, costs compound down |
| 🪶 Concise Mode | Decision-focused responses: essential context + choices, nothing else | Fewer output tokens, faster user decisions |
| 🔄 /continue | Replaces /compact — zero LLM calls, zero cost, zero info loss | Free context restoration |
| 📊 Status Line | Real-time cost, context size, rate limit — under 50ms | See problems before they cost you |
| 📈 /usage-view | Interactive HTML dashboard with AI-powered analysis | Full cost forensics in one click |

View full repository on GitHub →