on-policy
Active·★ 2.0k·MIT·Updated 2024-07-18
★ Trending★ Reinforcement Learning★ Multi-Agent AI
This is the official implementation of Multi-Agent PPO (MAPPO).
This repository implements MAPPO, a multi-agent variant of PPO, widely used in cooperative multi-agent games and research. It provides robust implementations for various multi-agent environments like StarCraft II, Hanabi, and Google Research Football, along with detailed training scripts and hyperparameter guidance.
#Multi-Agent Reinforcement Learning#PPO#MAPPO#Reinforcement Learning#PyTorch
01
Features
01Implementation of MAPPO (Multi-Agent PPO)
02Support for diverse multi-agent environments (e.g., StarCraft II, Hanabi)
03Ready-to-use training scripts for various scenarios
04Detailed hyperparameter guidance and updated results
05Default support for shared policy among agents
02
Compatibility
StarCraftII (SMAC)
Native
Verified via docs
Hanabi
Native
Verified via docs
Multiagent Particle-World Environments (MPEs)
Native
Verified via docs
Google Research Football (GRF)
Native
Verified via docs
StarCraftII (SMAC) v2
Native
Verified via docs
03
Quick start
1
$ pip install -e .
04
Use cases
↳Research and experimentation in cooperative multi-agent reinforcement learning
↳Benchmarking and evaluating PPO's effectiveness in MARL scenarios
↳Training AI agents for popular multi-agent games like StarCraft II and Hanabi
05
Alternatives
MetaGPT★ 68.4k
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
cua★ 17.3k
Open-source infrastructure for Computer-Use Agents. Sandboxes, SDKs, and benchmarks to train and evaluate AI agents that can control full desktops (macOS, Linux, Windows).
ir-sim★ 1.1k
A Python-based lightweight robot simulator designed for navigation, control, and reinforcement learning
verl-agent★ 1.9k
verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"
maro★ 919
Multi-Agent Resource Optimization (MARO) platform is an instance of Reinforcement Learning as a Service (RaaS) for real-world resource optimization problems.
Related searches
Comments
Log in to leave a comment
No comments yet. Be the first!