lm
Offload Tasks from Claude to your Local LLM With Houtini-LM - uses OpenAPI for LM Studio and Ollama Compatibility. Save tokens by offloading some grunt work for your API - our tool description helps claude decide what work to assign and why.
Houtini LM connects Claude Code to a local LLM, offloading bounded tasks like boilerplate generation, code review, and commit messages to a free, private local model, while Claude handles complex reasoning. It tracks token savings and supports various local LLM backends.