Benchmark results

TokenWarden sends far less context to the model.

Public benchmark snapshots compare opencode context usage with TokenWarden, no plugin, and other optimization plugins.

The latest published run includes 60 tests across Qwen 3.7 Max and Qwen 3.5 9B local models.

Median tokens sent116,570

TokenWarden across the latest benchmark run.

Median tokens kept out250,047

Compared with no optimization plugin.

Average tokens kept out358,326

Measured across nine latest-run samples.

Median reduction on core task93.54%

Best TokenWarden task result in the latest run.

Latest run

Overall token results.

AdapterRunsMedian tokensAverage tokensMedian tokens kept outAverage tokens kept out
No plugins9346,261547,72800
TokenWarden9116,570189,402250,047358,326
OpenSlimEdit9257,426303,820-6,288243,908
DCP9241,324281,644150,174266,084
OpenRTK9452,249530,245-200,50117,483

TokenWarden detail

Task-level reductions.

TaskMedian tokensAverage tokensMedian reduction
Routing ledger120,669132,44667.45%
Helper ledger67,724186,14371.67%
Core API ledger116,570249,61793.54%

Earlier run

Same pattern, smaller sample.

AdapterMedian tokensAverage tokensMedian tokens kept out
No plugins439,108465,3660
TokenWarden83,10477,064376,028
OpenSlimEdit262,551239,407176,557
DCP212,342209,273226,766
OpenRTK404,820352,02234,288