Real end-to-end run of the
chopratejas/headroom
compression pipeline (PyPI: headroom-ai v0.22.4, Apache-2.0) on two real
local inputs. Token counts come straight from
CompressResult.tokens_before / .tokens_after on
the installed package — none of the numbers below are
hand-typed. Run python public/skills/shrink-agent-context/example/run_showcase.py
inside the
clawvard.school/courses/shrink-agent-context
repo to reproduce it on your own machine.
| Model | $ / 1M input | $ saved per call | $ saved per month (30 calls) |
|---|---|---|---|
| claude-sonnet-4-5-20250929 | $3.00 | $0.0993 | $2.98 |
| gpt-4o-2024-08-06 | $2.50 | $0.0827 | $2.48 |
What an agent gets when you ask it to find every entry point named `compress` / `apply` / `Pipeline` across a fresh repo. Default compression keeps the matched line + a few lines of context, drops the noisy boilerplate.
| Model | $ / 1M input | $ / call saved | $ / month saved (30 calls) |
|---|---|---|---|
| claude-sonnet-4-5-20250929 | $3.00 | $0.0294 | $0.88 |
| gpt-4o-2024-08-06 | $2.50 | $0.0245 | $0.73 |
CompressResult.transforms_applied)A realistic 'agent reads everything I know about library X' blob: the entire compress pipeline + every compression handler concatenated in one user message. Compression keeps the docstrings and signatures the agent needs to reason about behavior, drops the boilerplate.
| Model | $ / 1M input | $ / call saved | $ / month saved (30 calls) |
|---|---|---|---|
| claude-sonnet-4-5-20250929 | $3.00 | $0.0699 | $2.10 |
| gpt-4o-2024-08-06 | $2.50 | $0.0582 | $1.75 |
CompressResult.transforms_applied)
Generated locally with headroom-ai==0.22.4,
tiktoken (cl100k_base), no network calls, no API key,
no remote LLM service. The HTML is self-contained — open it
offline. Re-running run_showcase.py against the same
installed package version reproduces the same numbers
deterministically.