GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

rlhf

Website
Wikipedia
hiyouga/LLaMA-Factory
https://static.github-zh.com/github_avatars/hiyouga?size=40
hiyouga / LLaMA-Factory

#大语言模型#Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

fine-tuninglanguage-modelllama大语言模型pefttransformersrlhfqloraquantizationchatglmqweninstruction-tuningmistralgptloralarge-language-modelsagent人工智能moellama3
Python 52.29 k
4 天前
https://static.github-zh.com/github_avatars/LAION-AI?size=40
LAION-AI / Open-Assistant

#大语言模型#面向所有人的对话式 AI,我们相信我们即将创造一场革命,正如 Stable Diffusion 改变了现代艺术的创作过程, 我们将透过对话式 AI 来改变世界.

ChatGPTlanguage-modelrlhf人工智能assistantdiscord-bot机器学习NextPython
Python 37.38 k
10 个月前
https://static.github-zh.com/github_avatars/RUCAIBox?size=40
RUCAIBox / LLMSurvey

#自然语言处理#大语言模型综述

chain-of-thoughtChatGPTin-context-learninginstruction-tuninglarge-language-models大语言模型自然语言处理pre-trained-language-modelspre-trainingrlhf
Python 11.58 k
3 个月前
ymcui/Chinese-LLaMA-Alpaca-2
https://static.github-zh.com/github_avatars/ymcui?size=40
ymcui / Chinese-LLaMA-Alpaca-2

#自然语言处理#中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)

alpacallama大语言模型llama-2large-language-models自然语言处理alpaca-2flash-attentionllama2alpaca2Yarnrlhf
Python 7.16 k
9 个月前
https://static.github-zh.com/github_avatars/InternLM?size=40
InternLM / InternLM

#大语言模型#Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).

聊天机器人gpt大语言模型long-contextrlhffine-tuning-llm中文flash-attentionpretrained-models
Python 6.94 k
4 个月前
huggingface/alignment-handbook
https://static.github-zh.com/github_avatars/huggingface?size=40
huggingface / alignment-handbook

#大语言模型#Robust recipes to align language models with human and AI preferences

大语言模型rlhftransformers
Python 5.22 k
2 个月前
argilla-io/argilla
https://static.github-zh.com/github_avatars/argilla-io?size=40
argilla-io / argilla

#自然语言处理#Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets

human-in-the-loop自然语言处理mlopsdeveloper-toolstext-labelingannotation-tool机器学习active-learningweak-supervisiontext-annotation大语言模型人工智能gpt-4rlhflangchain
Python 4.53 k
6 天前
https://static.github-zh.com/github_avatars/opendilab?size=40
opendilab / awesome-RLHF

#计算机科学#A curated list of reinforcement learning with human feedback resources (continually updated)

深度学习deep-reinforcement-learninghuman-feedbackreinforcement-learningrlhflarge-language-models
3.98 k
2 个月前
https://static.github-zh.com/github_avatars/PKU-Alignment?size=40
PKU-Alignment / align-anything

Align Anything: Training All-modality Model with Feedback

large-language-modelsmultimodalrlhfchameleondpovision-language-model
Jupyter Notebook 3.93 k
19 天前
Kiln-AI/Kiln
https://static.github-zh.com/github_avatars/Kiln-AI?size=40
Kiln-AI / Kiln

#计算机科学#The easiest tool for fine-tuning LLM models, synthetic data generation, and collaborating on datasets.

人工智能chain-of-thoughtcollaborationfine-tuning机器学习macOSollamaopenaipromptprompt-engineeringPythonrlhfsynthetic-dataWindowsevalsevaluation
Python 3.74 k
5 小时前
https://static.github-zh.com/github_avatars/hiyouga?size=40
hiyouga / ChatGLM-Efficient-Tuning

#大语言模型#Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调

chatglmChatGPTfine-tuningloraalpacapefthuggingfacelanguage-modeltransformersPyTorchrlhfchatglm2qlora
Python 3.7 k
2 年前
https://static.github-zh.com/github_avatars/transformerlab?size=40
transformerlab / transformerlab-app

Open Source Application for Advanced LLM Engineering: interact, train, fine-tune, and evaluate large language models on your own computer.

Electronllama大语言模型lorarlhftransformersMLX
TypeScript 3.41 k
2 天前
Docta-ai/docta
https://static.github-zh.com/github_avatars/Docta-ai?size=40
Docta-ai / docta

A Doctor for your data

datadata-centric-aidata-centric-machine-learningdata-curationdata-diagnosislanguage-modelrlhf
Python 3.31 k
5 个月前
argilla-io/distilabel
https://static.github-zh.com/github_avatars/argilla-io?size=40
argilla-io / distilabel

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

人工智能huggingface大语言模型openaiPythonrlhfsynthetic-datasynthetic-dataset-generation
Python 2.75 k
6 天前
https://static.github-zh.com/github_avatars/tatsu-lab?size=40
tatsu-lab / alpaca_eval

#自然语言处理#An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

深度学习evaluationfoundation-modelsinstruction-followinglarge-language-modelsleaderboard自然语言处理rlhf
Jupyter Notebook 1.77 k
6 个月前
https://static.github-zh.com/github_avatars/THUDM?size=40
THUDM / WebGLM

#大语言模型#WebGLM: An Efficient Web-enhanced Question Answering System (KDD 2023)

ChatGPT大语言模型rlhfwebglm
Python 1.6 k
3 个月前
https://static.github-zh.com/github_avatars/PKU-Alignment?size=40
PKU-Alignment / safe-rlhf

#数据仓库#Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

ai-safetyalpaca数据集deepspeedlarge-language-modelsllama大语言模型reinforcement-learningreinforcement-learning-from-human-feedbackrlhftransformersvicunasafetygpttransformerbeaver
Python 1.49 k
1 年前
https://static.github-zh.com/github_avatars/THUDM?size=40
THUDM / ImageReward

[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation

diffusion-modelsgenerative-modelrlhf
Python 1.42 k
5 个月前
https://static.github-zh.com/github_avatars/RLHFlow?size=40
RLHFlow / RLHF-Reward-Modeling

#大语言模型#Recipes to train reward model for RLHF.

大语言模型rlhfllama3
Python 1.37 k
2 个月前
https://static.github-zh.com/github_avatars/OpenLMLab?size=40
OpenLMLab / MOSS-RLHF

Secrets of RLHF in Large Language Models Part I: PPO

rlhfalignmentai-safety
Python 1.37 k
1 年前
loading...