Documentation Index
Fetch the complete documentation index at: https://www.edgee.ai/docs/llms.txt
Use this file to discover all available pages before exploring further.

| Strategy | What it does for Claude Code | Default | Customer-traffic average |
|---|---|---|---|
| Tool Result | Trims tool_result payloads (file reads, grep, shell, API responses) before they reach Claude. Lossless. | ✅ on | −19% |
| Tool Surface (alpha) | Drops MCP servers, skills, and tools irrelevant to the current task before the request hits Claude. | ⚠️ opt-in | ~−25% projected |
| Output | Reduces verbosity of model responses without losing technical content. Same answer, fewer tokens. | ⚪ opt-in | −6.5% when enabled |
Tool Result Trimming
tool_result_trimming filters the tool-result content Claude Code sends back to itself: file contents, grep and search outputs, shell stdout/stderr, API responses, database query results. Lossless on tool_result payloads — Claude receives the same technical content with redundant framing removed. User messages and assistant turns are not modified.
→ Full strategy reference: Token Compression / Tool Result Trimming.
Tool Surface Reduction (alpha)
tool_surface_reduction strips out the MCP servers, skills, and tools Claude Code wouldn’t use for the current task. Claude still sees a coherent tools list — just a curated, task-relevant subset. The IDE still exposes everything; nothing changes for the developer’s setup.
→ Full strategy reference: Token Compression / Tool Surface Reduction.
Output Brevity (by Caveman)
output_brevity reduces the verbosity of Claude’s responses. Three levels are available (light, medium, hard). Off by default for Claude Code sessions because output is a small share (~1%) of total token volume — turn it on if your Claude Code workflow leans heavy on long-form responses.
→ Full strategy reference: Token Compression / Output Brevity.
Receipts
+26.2% more instructions completed on the same Claude Pro plan. 20.8% more efficient per instruction. 5.1% cheaper per task on a cost-adjusted basis.
Source: edgee-ai/claude-compression-lab · Endurance challenge writeup
Get started
The fastest path is the Edgee CLI.tool_result_trimming is on by default; the other two strategies are opt-in toggles in the Console.
- macOS / Linux
- Homebrew
- Windows (PowerShell)
Full CLI guide
Install the CLI, authenticate, and launch Claude Code in under a minute.
Toggling individual strategies
In the Edgee Console, open Dashboard and manage your Claude Code’s settings right from the UI.- Enable
tool_surface_reductionto opt into the alpha tool-surface compression. - Enable
output_brevityif your Claude Code workflow produces long-form output worth tightening. - Disable
tool_result_trimmingonly when you want to compare against an uncompressed baseline.
Manual setup (advanced)
Manual setup (advanced)
If you prefer not to use the CLI, configure Claude Code to route through Edgee — see Manual setup — then enable the strategies you want from the Edge Models section of the Console.
Lossiness
tool_result_trimming is lossless on tool_result payloads.
tool_surface_reduction is lossless on Claude’s perspective of available tools, Claude still sees a usable subset.
output_brevity is not lossless on the prose dimension, it intentionally compresses prose verbosity.
Next
Token Compression
Deep dive on each strategy.
Codex Compression
Same three strategies, tuned for Codex.