DCP Token Savings Report

66.9M

tokens saved via compound DCP effect

Without DCP you would have used 261.9M tokens instead of 195.0M.

34.3% saved · $200.68 at $3/M

33.1x compound multiplier iThe multiplier shows how much compound savings exceeds one-time compression. A 31.9x multiplier means each compressed token saves ~32 tokens total across all subsequent requests in that session.

73/100 sessions had DCP

Understanding This Report

One-Time Savings measures tokens removed during each compression event — counted once. If DCP compresses 40K tokens into a 1.5K summary, that's 38.5K tokens saved in that single event.

Compound Savings recognizes those 38.5K tokens are never re-sent on subsequent requests. In a session that continues for 149 more turns, the real savings is 38.5K × 149 = 5.7M tokens.

Example: Session A has 164 requests. DCP compresses 173K tokens into summaries across 6 blocks, starting around turn 15. Those 173K tokens are never re-sent on the remaining ~149 requests. One-time savings: 173K. Compound savings: 14.9M (86x multiplier).

The Multiplier (33.1x) means that on average, each compressed token saves ~33 tokens across all future requests in that session.

One-Time vs Compound Savings

One-Time Compression iSum of (compressedTokens - summaryTokens) across ALL compression events in ALL sessions. This is the traditional metric that only counts tokens removed once.

2.0M

Tokens removed during compression events — counted once.

Compound Savings (Real) iOne-time savings multiplied by the number of subsequent requests that benefit from each compression. This reflects the actual reduction in API token consumption.

66.9M

Tokens saved across ALL subsequent requests in each session.

Key Metrics

Tokens UsediTotal tokens (input + output + reasoning) consumed across all 100 sessions. This is what was actually sent/received after DCP compression.

195.0M

Tokens SavediCompound savings: one-time compressed tokens multiplied by subsequent requests that benefit. This is the real reduction in API consumption.

66.9M

Savings RateiCompound tokens saved divided by tokens used. Without DCP, you would have used 261.9M tokens total.

34.3%

USD Saved ($3/M)iEstimated cost savings at a blended rate of $3.00 per million input tokens. See sensitivity table below for other rates.

$200.68

Total RequestsiNumber of model inference calls (step-finish events) across all sessions. Each request benefits from prior DCP compressions.

4111

DCP BlocksiTotal compression events across all sessions. Each block compresses a range of conversation messages into a summary.

MultiplieriCompound savings divided by one-time savings. A 33.1x multiplier means each compressed token saves ~33 tokens on average across all subsequent requests.

33.1x

Sessions w/ DCPi73 of 100 sessions have DCP state files. Of the remaining 27: 14 were too short for DCP, 13 had enough activity (≥10 requests + ≥10.0K peak context) that DCP should have triggered but didn’t. DCP activates via a hard limit at 100.0K tokens or a soft nudge every 5 turns after turn 15.

73/100

Savings Rate

Top Sessions by Compound Savings

Per-Model Breakdown

USD Savings Sensitivity

Blended input-token pricing from fungies.io, awesomeagents.ai, pecollective.com, stochasticsandbox.com, tldl.io (2026).

Missed DCP Opportunities

Rate	Estimated USD Saved (Compound)
$2/M	$133.79
$3/M	$200.68
$5/M	$334.46

13 sessions had enough activity for DCP (≥10 requests + ≥10.0K peak context) but DCP didn’t trigger. These sessions used 7.3M tokens collectively. DCP was installed (files date to March 2026) but its soft nudge system (every 5 turns after turn 15) didn’t prompt compression — likely because context stayed under the 100.0K hard limit.

Insights

Session	Model	Requests	Peak Context	Tokens Used
ses_13ccfadedffe1qWO8Mwq...	glm-5.1	22	25.5K	971.9K
ses_14dcb2ab7ffepT2FuK4f...	glm-5.1	36	15.3K	919.8K
ses_14db006e1ffeclQALCSy...	glm-5.1	32	12.8K	773.1K
ses_146ac49acffepgQO1CA9...	glm-5.1	21	15.9K	723.2K
ses_13cd10d76ffe8vX2oexK...	glm-5.1	18	12.6K	678.5K
ses_138b4a6e6ffems9VY22j...	glm-5.2	15	12.5K	619.3K
ses_138a8cec6ffeadERV1g1...	gpt-5.4-mini	21	12.1K	600.1K
ses_14db10caeffeqkdOpW71...	minimax-m3	15	29.8K	516.7K
ses_143fe55a8ffeCicrkfWM...	gpt-5.4-mini	18	17.3K	475.9K
ses_13cd0336affeEhNmlmXN...	glm-5.1	17	13.3K	408.2K
ses_13cb3a911ffeexIyNSG5...	gpt-5.5	13	15.3K	226.2K
ses_139093da9ffezJ7JidV2...	glm-5.1	11	13.3K	194.7K
ses_13cd2771cffeZWtv15e5...	glm-5.1	10	13.4K	171.4K

13 session(s) missed DCP (≥10 requests and ≥10.0K peak context, but no compression triggered): 7.3M tokens consumed across these sessions. DCP was installed but its nudge system didn’t fire — likely because context stayed under the 100.0K hard limit and the soft nudge (every 5 turns after turn 15) didn’t prompt the model to compress.
5 model(s) never triggered DCP: gpt-5.5, deepseek-v4-flash, deepseek-v4-pro, north-mini-code-free, nex-n2-pro:free. These were used only on short sessions with small context windows.
Best compound multiplier: Session with 164 requests achieved 85.8x — 173.3K one-time compression amplified to 14.9M compound savings.
DCP activation: 73 of 100 sessions (73%) have DCP data. 41 sessions were too short to need it.
Compound effect is the story: One-time compression removed 2.0M tokens. Real API savings: 66.9M (33.1x more).
Average DCP blocks per active session: 0.8 blocks across 73 sessions.

Post-Ready Summary

DCP saved 66.9M tokens across 100 sessions — 34.3% of total usage, or $200.68 in API costs at $3/M.

Without DCP, total cost would be 261.9M tokens instead of 195.0M. Compound effect amplifies one-time compression by 33.1x.

73% session DCP activation (73/100 sessions); 14 too short, 13 missed (7.3M tokens at risk)
11 models tracked, 5 never triggered DCP
Top session: 14.9M compound on 7.6M used (85.8x)