#

reinforcement-learning-from-human-feedback

https://static.github-zh.com/github_avatars/OpenRLHF?size=40

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 7.93 k
9 小时前
https://static.github-zh.com/github_avatars/tatsu-lab?size=40

#自然语言处理#A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.

Python 825
1 年前
https://static.github-zh.com/github_avatars/martin-wey?size=40

CodeUltraFeedback: aligning large language models to coding preferences (TOSEM 2025)

Python 72
1 年前
https://static.github-zh.com/github_avatars/tlc4418?size=40

#计算机科学#A repo for RLHF training and BoN over LLMs, with support for reward model ensembles.

Python 45
8 个月前
https://static.github-zh.com/github_avatars/CJReinforce?size=40

#计算机科学#Official code for ICML 2024 paper, "RIME: Robust Preference-based Reinforcement Learning with Noisy Preferences" (ICML 2024 Spotlight)

Python 34
1 年前
https://static.github-zh.com/github_avatars/clam004?size=40

#自然语言处理#annotated tutorial of the huggingface TRL repo for reinforcement learning from human feedback connecting equations from PPO and GAE to the lines of code in the pytorch implementation

Jupyter Notebook 20
5 个月前
https://static.github-zh.com/github_avatars/ymetz?size=40

RLHF-Blender: A Configurable Interactive Interface for Learning from Diverse Human Feedback

Python 13
12 天前
https://static.github-zh.com/github_avatars/flint-xf-fan?size=40

[AAMAS 2025] Privacy-preserving and Personalized RLHF, with convergence guarantees. The Code contains experiments for training multiple instances of GPT-2 for personalized sentiment aligned text gener...

Python 10
5 个月前
https://static.github-zh.com/github_avatars/liushunyu?size=40

[TSMC] Ask-AC: An Initiative Advisor-in-the-Loop Actor-Critic Framework

Python 8
1 年前
https://static.github-zh.com/github_avatars/SJ9VRF?size=40

This repository contains the implementation of a Reinforcement Learning with Human Feedback (RLHF) system using custom datasets. The project utilizes the trlX library for training a preference model t...

Python 5
1 年前
https://static.github-zh.com/github_avatars/Almost-Intelligence?size=40

LMRax is a framework built on JAX to train transformers language models by reinforcement learning, along with the reward model training.

Python 2
3 年前
https://static.github-zh.com/github_avatars/umenzi?size=40

Code for Bachelor thesis, The Human Factor: Addressing Diversity in Reinforcement Learning from Human Feedback.

Python 0
1 年前
loading...
Website
Wikipedia