on-policy: This repository implements MAPPO, a multi-agent variant of PPO, widely used in cooperative multi-agent games and research. It provides robust implementations for various multi-agent environments like StarCraft II, Hanabi, and Google Research Football, along with detailed training scripts and hyperparameter guidance.; codex-mcp-tool: This MCP server integrates Claude/Cursor with the Codex CLI, enhancing AI-powered code interactions. It enables advanced features like file analysis, multi-turn conversations, sandboxed code execution, and structured change management.
Research and experimentation in cooperative multi-agent reinforcement learning
Code Understanding: Explain the architecture of project source code.