AgentIndex icon
AgentIndex
ToolsCategoriesTrendingNewCompare
Submit Tool
Home/
Compare/
on-policy vs llmqore
on-policy logo
on-policy
★ 2.0k
vs
llmqore logo
llmqore
★ 24

on-policy vs llmqore

on-policy: This repository implements MAPPO, a multi-agent variant of PPO, widely used in cooperative multi-agent games and research. It provides robust implementations for various multi-agent environments like StarCraft II, Hanabi, and Google Research Football, along with detailed training scripts and hyperparameter guidance.; llmqore: A Qt/C++ library that provides a unified streaming API across six LLM providers, supports creating MCP servers and clients, and includes a standalone CLI tool called MCP Bridge for aggregating MCP servers.

01

TL;DR

on-policy logoChoose on-policy if…

Research and experimentation in cooperative multi-agent reinforcement learning

llmqore logoChoose llmqore if…

Build LLM-powered applications with a unified API

02

Side-by-Side Comparison

Field
on-policy logoon-policy
llmqore logollmqore
Category
LLM Infra
LLM Infra
Stars
★ 2.0k
★ 24
License
MIT
MIT
Updated
1y ago
1d ago
Open Source
Yes
Yes
Website
↗ Visit
↗ Visit
GitHub
↗ GitHub
↗ GitHub
Tags
Multi-Agent Reinforcement Learning, PPO, MAPPO
claude, google, mcp
03

Features

on-policy logoon-policy
01Implementation of MAPPO (Multi-Agent PPO)
02Support for diverse multi-agent environments (e.g., StarCraft II, Hanabi)
03Ready-to-use training scripts for various scenarios
04Detailed hyperparameter guidance and updated results
05Default support for shared policy among agents
llmqore logollmqore
01Unified streaming API across six LLM providers
02MCP server with stdio and HTTP transports
03MCP client that binds server tools into LLM clients
04MCP Bridge CLI tool to aggregate multiple MCP servers
05Support for tools, resources, and prompts via MCP protocol
04

Use Cases

on-policy logoon-policy
↳Research and experimentation in cooperative multi-agent reinforcement learning
↳Benchmarking and evaluating PPO's effectiveness in MARL scenarios
↳Training AI agents for popular multi-agent games like StarCraft II and Hanabi
llmqore logollmqore
↳Build LLM-powered applications with a unified API
↳Create MCP servers to expose tools and resources
↳Aggregate multiple MCP servers into one endpoint using MCP Bridge
05

Best For

on-policy logoon-policy
TrendingReinforcement LearningMulti-Agent AI
llmqore logollmqore
TrendingDev Tooling
FAQ

FAQ

What is the difference between on-policy and llmqore?
Both on-policy and llmqore are in the LLM Infra category. on-policy has 2.0k stars, while llmqore has 24 stars.
Which is better, on-policy or llmqore?
The best choice depends on your use case. Choose on-policy if Research and experimentation in cooperative multi-agent reinforcement learning, and llmqore if Build LLM-powered applications with a unified API.
Is on-policy free or open source?
Yes, on-policy is open source on GitHub (MIT).
Is llmqore free or open source?
Yes, llmqore is open source on GitHub (MIT).
→

Related

Alternatives to on-policy →Alternatives to llmqore →on-policy details →llmqore details →
© 2026 AgentIndex.app|Built by a 10-year iOS Developer.
QYSGitHubBuy me a coffee ☕

Browse by Category

Code AssistantWorkflow AutomationRAG / Knowledge BaseMulti-AgentBrowser AutomationLLM InfraDev ToolingObservability

Not affiliated with Anthropic, OpenAI or Microsoft.