AReaL: AReaL is an open-source, fully asynchronous reinforcement learning training system designed for large reasoning and agentic models. It offers exceptional flexibility, industry-leading speed, and scalability from a single node to over 1,000 GPUs, achieving state-of-the-art performance.; on-policy: This repository implements MAPPO, a multi-agent variant of PPO, widely used in cooperative multi-agent games and research. It provides robust implementations for various multi-agent environments like StarCraft II, Hanabi, and Google Research Football, along with detailed training scripts and hyperparameter guidance.
Training Reasoning Agents: Developing AI agents capable of complex mathematical, coding, and general reasoning tasks.
Research and experimentation in cooperative multi-agent reinforcement learning