Quotient Labs
The Margin
Notes on context compression, benchmarking, and building Fermat.
Benchmarking Context Optimization
How Fermat's idle compression works — and how we verify it preserves the working state an agent actually needs.
Jun 16, 20268 min readCompressing Prose
Fermat's prose compression model trims low-value tokens from text-heavy turns at write time. Here, we detail how we benchmark it.
Jun 14, 20265 min read