GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

rft

Website
Wikipedia
https://static.github-zh.com/github_avatars/modelscope?size=40
modelscope / ms-swift

#大语言模型#Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Llava, GLM4v, ...

大语言模型lorallamasftdeploymultimodalpeftinternvlligerqwen2-vlrftdeepseek-r1embeddinggrpoopen-r1megatronomnillama4qwen3qwen3-moe
Python 8.09 k
2 天前
https://static.github-zh.com/github_avatars/LifeCoachRay?size=40
LifeCoachRay / My-Pocket-Token-Foundation

#区块链#The My Pocket Token Foundation will make the blockchain better, by bridging the blockchain with the worldwide web. Some of the best Developers in the world.

blockchain-technologydeveloper-tools加密货币tokensblogging安全rftnftoolsnftsnftnft-gallery
Solidity 15
3 年前
https://static.github-zh.com/github_avatars/dafyddg?size=40
dafyddg / RFA

Implementation of the Rhythm Formant Analysis methodology for identifying speech rhythms and rhythm variation in the low frequency spectrum and spectrogram.

rft
Python 15
2 年前
https://static.github-zh.com/github_avatars/flint-xf-fan?size=40
flint-xf-fan / Federated-RLHF

[AAMAS 2025] Privacy-preserving and Personalized RLHF, with convergence guarantees. The Code contains experiments for training multiple instances of GPT-2 for personalized sentiment aligned text gener...

大语言模型reinforcement-learning-from-human-feedbackrftrlhf
Python 9
2 个月前
https://static.github-zh.com/github_avatars/anasshad?size=40
anasshad / Refungible-Tokens-Fractional-NFT

Smart contract and unit tests for Refungible Token / Fractional NFT

Soliditysmart-contractserc721erc20nftrft
JavaScript 4
4 年前
https://static.github-zh.com/github_avatars/XxFChen?size=40
XxFChen / awesome-reinforcement-fine-tuning

Awesome Reinforcement Fine Tuning

rftfine-tuningfinetuning
3
6 个月前
https://static.github-zh.com/github_avatars/JohnTheCoolingFan?size=40
JohnTheCoolingFan / RandomFactorioThings

Random Factorio Things mod for Factorio

Factoriomodrft
Lua 2
5 个月前
https://static.github-zh.com/github_avatars/Masoudjafaripour?size=40
Masoudjafaripour / llm-hf-planning

A small Hugging Face LLM for planning and reasoning

fine-tuning大语言模型planningrftsft
Python 2
4 个月前
https://static.github-zh.com/github_avatars/Azaijah?size=40
Azaijah / Syllogimind

Syllogimind is a application developed in Go, designed to engage users in enhancing their logical reasoning capabilities through the generation and solving of syllogisms.

rft
Go 0
1 年前
https://static.github-zh.com/github_avatars/aman-maurya?size=40
aman-maurya / OfficeExporter

Generate MsWord file using php

PHPphp-librarymswordrftoffice-tools
PHP 0
5 年前
https://static.github-zh.com/github_avatars/rft-kolcsonzo?size=40
rft-kolcsonzo / kolcsonzo-api

rftuniversityPHPREST APIAPIDocker
PHP 0
6 年前
https://static.github-zh.com/github_avatars/HINNOTN?size=40
HINNOTN / syllogisms

#大语言模型#Algorithmic Truth Table Method for Proving Validity of Argument Forms

大语言模型philosophyPythonreasoningrftstatementvalidation
TeX 0
4 天前
https://static.github-zh.com/github_avatars/ksm26?size=40
ksm26 / Reinforcement-Fine-Tuning-LLMs-with-GRPO

The course teaches how to fine-tune LLMs using Group Relative Policy Optimization (GRPO)—a reinforcement learning method that improves model reasoning with minimal data. Learn RFT concepts, reward des...

grporeinforcement-learningrftrlhflanguage-model机器学习
0
10 天前