houtini-lm: Houtini LM connects Claude Code to a local LLM server or any OpenAI-compatible API, offloading bounded tasks to reduce token costs. It provides tools, performance tracking, and model routing for efficient delegation. Claude remains the orchestrator for complex reasoning, while cheap local models handle grunt work.; on-policy: This repository implements MAPPO, a multi-agent variant of PPO, widely used in cooperative multi-agent games and research. It provides robust implementations for various multi-agent environments like StarCraft II, Hanabi, and Google Research Football, along with detailed training scripts and hyperparameter guidance.
Generate test stubs
Research and experimentation in cooperative multi-agent reinforcement learning