LLM-VM: The Anarchy LLM-VM is an optimized backend designed to run open-source LLMs with modern features like tool usage and persistent memory. It acts as a virtual machine for human language, coordinating models, data, prompts, and tools to optimize batch calls and support various architectures.; on-policy: This repository implements MAPPO, a multi-agent variant of PPO, widely used in cooperative multi-agent games and research. It provides robust implementations for various multi-agent environments like StarCraft II, Hanabi, and Google Research Football, along with detailed training scripts and hyperparameter guidance.
Accelerate AGI development and prototyping
Research and experimentation in cooperative multi-agent reinforcement learning