on-policy: This repository implements MAPPO, a multi-agent variant of PPO, widely used in cooperative multi-agent games and research. It provides robust implementations for various multi-agent environments like StarCraft II, Hanabi, and Google Research Football, along with detailed training scripts and hyperparameter guidance.; maro: MARO is a Reinforcement Learning as a Service (RaaS) platform designed for real-world resource optimization across various industrial domains. It offers simulation, RL, and distributed toolkits to facilitate the development and deployment of complex optimization solutions.
Research and experimentation in cooperative multi-agent reinforcement learning
Container Inventory Management in logistics