lm: Houtini LM connects Claude Code to a local LLM, offloading bounded tasks like boilerplate generation, code review, and commit messages to a free, private local model, while Claude handles complex reasoning. It tracks token savings and supports various local LLM backends.; on-policy: This repository implements MAPPO, a multi-agent variant of PPO, widely used in cooperative multi-agent games and research. It provides robust implementations for various multi-agent environments like StarCraft II, Hanabi, and Google Research Football, along with detailed training scripts and hyperparameter guidance.
Generate boilerplate, test stubs, and documentation
Research and experimentation in cooperative multi-agent reinforcement learning