verl-agent: `verl-agent` extends veRL to train LLM agents using reinforcement learning, featuring a novel step-independent multi-turn rollout mechanism. This design ensures high scalability for long-horizon tasks by allowing customizable per-step input structures and memory management.; env-doctor: Env-Doctor is a crucial tool that diagnoses and resolves common compatibility issues between your GPU, NVIDIA CUDA versions, and Python AI libraries like PyTorch and TensorFlow. It helps users quickly identify and fix mismatches, ensuring a smooth deep learning development experience.
Training large language model agents for complex multi-turn, long-horizon tasks.
Diagnosing GPU, CUDA, and Python AI library version conflicts