on-policy: This repository implements MAPPO, a multi-agent variant of PPO, widely used in cooperative multi-agent games and research. It provides robust implementations for various multi-agent environments like StarCraft II, Hanabi, and Google Research Football, along with detailed training scripts and hyperparameter guidance.; mini-swe-agent: Mini-SWE-agent is a lightweight, 100-line AI agent designed to solve GitHub issues and more, offering a simplified yet performant alternative to larger coding agents. It focuses on minimalism, high performance on benchmarks like SWE-bench, and easy deployment across various environments.
Research and experimentation in cooperative multi-agent reinforcement learning
Researchers for benchmarking, fine-tuning, or RL experiments without bloat