on-policy: This repository implements MAPPO, a multi-agent variant of PPO, widely used in cooperative multi-agent games and research. It provides robust implementations for various multi-agent environments like StarCraft II, Hanabi, and Google Research Football, along with detailed training scripts and hyperparameter guidance.; xLAM: xLAM is a research repository for Large Action Models (LAMs), which aggregates and unifies agent trajectories from diverse environments into a consistent format. It streamlines the creation of a generic data loader optimized for agent training, enabling robust model development across various scenarios.
Research and experimentation in cooperative multi-agent reinforcement learning
Function calling in LLMs