headroom
Active·★ 2.1k·Apache-2.0·Updated 2026-05-30
Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server.
Headroom is a context compression layer designed for AI agents and LLMs, significantly reducing token usage (60-95% fewer tokens) by compressing tool outputs, logs, RAG chunks, files, and conversation history. It operates locally and reversibly, ensuring data privacy and the ability to retrieve original content on demand.
#AI Agents#LLM Optimization#Context Compression#Token Efficiency#Reversible Compression