AgentIndex icon
AgentIndex
ToolsCategoriesTrendingNewCompare
Submit Tool
Home/
LLM Infra/
on-policy
on-policy logo

on-policy

Active·★ 2.0k·MIT·Updated 2024-07-18
★ Trending★ Reinforcement Learning★ Multi-Agent AI

This is the official implementation of Multi-Agent PPO (MAPPO).

This repository implements MAPPO, a multi-agent variant of PPO, widely used in cooperative multi-agent games and research. It provides robust implementations for various multi-agent environments like StarCraft II, Hanabi, and Google Research Football, along with detailed training scripts and hyperparameter guidance.

#Multi-Agent Reinforcement Learning#PPO#MAPPO#Reinforcement Learning#PyTorch
$ Install
$ pip install -e .
↗ Visit site★ GitHub
01

Features

01Implementation of MAPPO (Multi-Agent PPO)
02Support for diverse multi-agent environments (e.g., StarCraft II, Hanabi)
03Ready-to-use training scripts for various scenarios
04Detailed hyperparameter guidance and updated results
05Default support for shared policy among agents
02

Compatibility

StarCraftII (SMAC)
Native
Verified via docs
Hanabi
Native
Verified via docs
Multiagent Particle-World Environments (MPEs)
Native
Verified via docs
Google Research Football (GRF)
Native
Verified via docs
StarCraftII (SMAC) v2
Native
Verified via docs
03

Quick start

1
$ pip install -e .
04

Use cases

↳Research and experimentation in cooperative multi-agent reinforcement learning
↳Benchmarking and evaluating PPO's effectiveness in MARL scenarios
↳Training AI agents for popular multi-agent games like StarCraft II and Hanabi
05

Alternatives

MetaGPT logo
MetaGPT★ 68.4k
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
vs →
cua logo
cua★ 17.3k
Open-source infrastructure for Computer-Use Agents. Sandboxes, SDKs, and benchmarks to train and evaluate AI agents that can control full desktops (macOS, Linux, Windows).
vs →
rllm logo
rllm★ 5.6k
Democratizing Reinforcement Learning for LLMs
vs →
ir-sim logo
ir-sim★ 1.1k
A Python-based lightweight robot simulator designed for navigation, control, and reinforcement learning
vs →
verl-agent logo
verl-agent★ 1.9k
verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"
vs →
maro logo
maro★ 919
Multi-Agent Resource Optimization (MARO) platform is an instance of Reinforcement Learning as a Service (RaaS) for real-world resource optimization problems.
vs →
skrl logo
skrl★ 1.1k
Modular Reinforcement Learning (RL) library (implemented in PyTorch, JAX, and NVIDIA Warp) with support for Gymnasium/Gym, NVIDIA Isaac Lab, MuJoCo Playground and other environments
vs →
virtualhome logo
virtualhome★ 617
API to run VirtualHome, a Multi-Agent Household Simulator
vs →
See all alternatives →

Related searches

on-policy AlternativesBest LLM Infra Tools 2026Open Source LLM Infraon-policy Tutorialon-policy Vs CompetitorsMulti-Agent Reinforcement LearningPPOMAPPO

Comments

Log in to leave a comment

No comments yet. Be the first!

On this page
01Features02Compatibility03Quick start04Use cases05Alternatives
Stats
GitHub Stars★ 2.0k
Last commit1y ago
StatusActive
LicenseMIT
CategoryLLM Infra
Trend (30d)
+0k↑ 4.4%
Links
Documentation↗Discussion↗Issues↗Releases↗

Deploy on DigitalOcean — Get $200 Free Credit

Ad
© 2026 AgentIndex.app|Built by a 10-year iOS Developer.
QYSGitHubBuy me a coffee ☕

Browse by Category

Code AssistantWorkflow AutomationRAG / Knowledge BaseMulti-AgentBrowser AutomationLLM InfraDev ToolingObservability

Not affiliated with Anthropic, OpenAI or Microsoft.