该仓库已收录但尚未编辑。项目介绍及使用教程请前往 GitHub 阅读 README
RLinf is a flexible and scalable open-source infrastructure designed for post-training foundation models (LLMs, VLMs, VLAs) via reinforcement learning.
2025-08-14
否
2025-09-19T07:57:56Z
0 条讨论