AgentIndex icon
AgentIndex
ToolsCategoriesTrendingNewCompare
Submit Tool
Home/
Compare/
Alternatives to on-policy
on-policy logo

Best Alternatives to on-policy

LLM InfraΒ·β˜… 2.0kΒ·MIT

This is the official implementation of Multi-Agent PPO (MAPPO).

#Multi-Agent Reinforcement Learning#PPO#MAPPO#Reinforcement Learning#PyTorch
β†— Visit siteGitHub
01

Best Alternatives to on-policy

ToolDescriptionStarsLicenseUpdated
01MetaGPT logoMetaGPT
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
β˜… 68.4k
MIT
4mo ago
Details β†’
02cua logocua
Open-source infrastructure for Computer-Use Agents. Sandboxes, SDKs, and benchmarks to train and evaluate AI agents that can control full desktops (macOS, Linux, Windows).
β˜… 17.3k
MIT
1d ago
Details β†’
03xLAM logoxLAM
xLAM: A Family of Large Action Models to Empower AI Agent Systems
β˜… 621
APACHE
9mo ago
Details β†’
04mini-swe-agent logomini-swe-agent
The 100 line AI agent that solves GitHub issues or helps you in your command line. Radically simple, no huge configs, no giant monorepoβ€”but scores >74% on SWE-bench verified!
β˜… 4.7k
β€”
5d ago
Details β†’
05youtu-agent logoyoutu-agent
A simple yet powerful agent framework that delivers with open-source models
β˜… 4.6k
NOASSERTION
2mo ago
Details β†’
06FedML logoFedML
FEDML - The unified and scalable ML library for large-scale distributed training, model serving, and federated learning. FEDML Launch, a cross-cloud scheduler, further enables running any AI jobs on any GPU cloud or on-premise cluster. Built on this library, TensorOpera AI (https://TensorOpera.ai) is your generative AI platform at scale.
β˜… 4.0k
Apache-2.0
7mo ago
Details β†’
07chatarena logochatarena
ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.
β˜… 1.5k
APACHE
9mo ago
Details β†’
08AgileRL logoAgileRL
Streamlining reinforcement learning with RLOps. State-of-the-art RL algorithms and tools, with 10x faster training through evolutionary hyperparameter optimization.
β˜… 921
β€”
1d ago
Details β†’
02

Side-by-Side Comparison

Field
on-policy logoon-policy
MetaGPT logoMetaGPT
cua logocua
xLAM logoxLAM
CategoryLLM InfraLLM InfraLLM InfraLLM Infra
Starsβ˜… 2.0kβ˜… 68.4kβ˜… 17.3kβ˜… 621
LicenseMITMITMITAPACHE
Updated1y ago4mo ago1d ago9mo ago
Open SourceYesYesYesYes
03

Compare on-policy With Others

on-policy vs MetaGPT β†’on-policy vs cua β†’on-policy vs xLAM β†’on-policy vs mini-swe-agent β†’on-policy vs youtu-agent β†’on-policy vs FedML β†’
04

FAQ

What is on-policy?
This repository implements MAPPO, a multi-agent variant of PPO, widely used in cooperative multi-agent games and research. It provides robust implementations for various multi-agent environments like StarCraft II, Hanabi, and Google Research Football, along with detailed training scripts and hyperparameter guidance.
What is the best alternative to on-policy?
MetaGPT is the top-rated alternative to on-policy in the LLM Infra category.
Is there a free alternative to on-policy?
MetaGPT is a free, open-source alternative.
Is on-policy open source?
Yes, on-policy is open source on GitHub, licensed under MIT.
Β© 2026 AgentIndex.app|Built by a 10-year iOS Developer.
QYSGitHubBuy me a coffee β˜•

Browse by Category

Code AssistantWorkflow AutomationRAG / Knowledge BaseMulti-AgentBrowser AutomationLLM InfraDev ToolingObservability

Not affiliated with Anthropic, OpenAI or Microsoft.