AReaL: AReaL is an open-source, fully asynchronous reinforcement learning training system designed for large reasoning and agentic models. It offers exceptional flexibility, industry-leading speed, and scalability from a single node to over 1,000 GPUs, achieving state-of-the-art performance.; mini-swe-agent: Mini-SWE-agent is a lightweight, 100-line AI agent designed to solve GitHub issues and more, offering a simplified yet performant alternative to larger coding agents. It focuses on minimalism, high performance on benchmarks like SWE-bench, and easy deployment across various environments.
Training Reasoning Agents: Developing AI agents capable of complex mathematical, coding, and general reasoning tasks.
Researchers for benchmarking, fine-tuning, or RL experiments without bloat