verl-agent: `verl-agent` extends veRL to train LLM agents using reinforcement learning, featuring a novel step-independent multi-turn rollout mechanism. This design ensures high scalability for long-horizon tasks by allowing customizable per-step input structures and memory management.; gym-pybullet-drones: gym-pybullet-drones is a minimalist refactoring of its original repository, providing a Gym environment for simulating multi-agent quadcopter control. It is designed for compatibility with Gymnasium, Stable Baselines3 2.0, and various flight firmwares for hardware-in-the-loop simulation.
Training large language model agents for complex multi-turn, long-horizon tasks.
Developing and evaluating PID controllers for quadcopter flight