on-policy: This repository implements MAPPO, a multi-agent variant of PPO, widely used in cooperative multi-agent games and research. It provides robust implementations for various multi-agent environments like StarCraft II, Hanabi, and Google Research Football, along with detailed training scripts and hyperparameter guidance.; gym-pybullet-drones: gym-pybullet-drones is a minimalist refactoring of its original repository, providing a Gym environment for simulating multi-agent quadcopter control. It is designed for compatibility with Gymnasium, Stable Baselines3 2.0, and various flight firmwares for hardware-in-the-loop simulation.
Research and experimentation in cooperative multi-agent reinforcement learning
Developing and evaluating PID controllers for quadcopter flight