Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, Kimi, and more!
2025-03-10
否
2025-08-02T21:14:33Z
该仓库已收录但尚未编辑。项目介绍及使用教程请前往 GitHub 阅读 README