on-policy: This repository implements MAPPO, a multi-agent variant of PPO, widely used in cooperative multi-agent games and research. It provides robust implementations for various multi-agent environments like StarCraft II, Hanabi, and Google Research Football, along with detailed training scripts and hyperparameter guidance.; youtu-agent: Youtu-Agent is a flexible, high-performance framework for building, running, and evaluating autonomous agents, excelling in tasks like data analysis and deep research using open-source models. It features automated agent generation, continuous experience learning via Training-Free GRPO, and scalable end-to-end reinforcement learning capabilities.
Research and experimentation in cooperative multi-agent reinforcement learning
Data analysis and report generation.