#

self-rewarding

https://static.github-zh.com/github_avatars/lucidrains?size=40

#计算机科学#Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI

Python 1.4 k
1 年前
https://static.github-zh.com/github_avatars/ShuaiLyu0110?size=40

SQL-o1: A Self-Reward Heuristic Dynamic Search Method for Text-to-SQL

Python 191
4 个月前
https://static.github-zh.com/github_avatars/zli12321?size=40

Reinforcement Learning of Vision Language Models with Self Visual Perception Reward

Python 120
3 天前
https://static.github-zh.com/github_avatars/sastpg?size=40

Consistent Paths Lead to Truth: Self-Rewarding Reinforcement Learning for LLM Reasoning

Python 15
3 个月前
https://static.github-zh.com/github_avatars/wantbook-book?size=40

#大语言模型#SeRL: Self-Play Reinforcement Learning for Large Language Models with Limited Data

Python 13
14 天前
Website
Wikipedia