Flash Compact: 33,000 tok/sec Context Compaction

Flash Compact drops 50-70% of an agent's context at 33,000+ tokens/second while keeping every surviving line verbatim. Two modes: objective compaction strips filler with no guidance, query-based compaction weights keep/drop decisions against what the agent needs next.

Tejas Bhakta
Tejas Bhakta
March 7, 202610 min read
Flash Compact: 33,000 tok/sec Context Compaction