GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

sft

Website
Wikipedia
https://static.github-zh.com/github_avatars/dataelement?size=40
dataelement / bisheng

#大语言模型#一个大模型应用开发平台,赋能和加速大模型应用开发落地,帮助用户以最佳体验进入下一代应用开发模式。

agent人工智能聊天机器人ragworkflowenterprisegenaigptlangchianllama大语言模型llmdevopsllmopsOCRopenaiorchestrationPythonReactfinetunesft
TypeScript 8.84 k
2 天前
https://static.github-zh.com/github_avatars/oumi-ai?size=40
oumi-ai / oumi

Easily fine-tune, evaluate and deploy Qwen3, DeepSeek-R1, Llama 4 or any open source LLM / VLM!

dpoevaluationfine-tuninginferencellama大语言模型sftvlms
Python 8.18 k
2 天前
https://static.github-zh.com/github_avatars/modelscope?size=40
modelscope / ms-swift

#大语言模型#Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Llava, GLM4v, ...

大语言模型lorallamasftdeploymultimodalpeftinternvlligerqwen2-vlrftdeepseek-r1embeddinggrpoopen-r1megatronomnillama4qwen3qwen3-moe
Python 8.09 k
2 天前
https://static.github-zh.com/github_avatars/AI-Hypercomputer?size=40
AI-Hypercomputer / maxtext

#大语言模型#A simple, performant and scalable Jax LLM!

large-language-models大语言模型gptdeepseekfine-tuninggemma2gemma3jaxllama2llama3llama4mistralmixtralsft
Python 1.77 k
2 天前
https://static.github-zh.com/github_avatars/ssbuild?size=40
ssbuild / chatglm_finetuning

#计算机科学#chatglm 6b finetuning and alpaca finetuning

chatglm深度学习loraPyTorchsftfreezeqlora
Python 1.54 k
3 个月前
https://static.github-zh.com/github_avatars/ScienceOne-AI?size=40
ScienceOne-AI / DeepSeek-671B-SFT-Guide

#大语言模型#An open-source solution for full parameter fine-tuning of DeepSeek-V3/R1 671B, including complete code and scripts from training to inference, as well as some practical experiences and conclusions. (D...

deepseek-r1大语言模型moesftPython
Python 699
3 个月前
https://static.github-zh.com/github_avatars/jerry1993-tech?size=40
jerry1993-tech / Cornucopia-LLaMA-Fin-Chinese

#自然语言处理#聚宝盆(Cornucopia): 中文金融系列开源可商用大模型,并提供一套高效轻量化的垂直领域LLM训练框架(Pretraining、SFT、RLHF、Quantize等)

llama自然语言处理中文financerlhfsftqatext-generationlarge-language-modelstransformers
Python 636
2 年前
https://static.github-zh.com/github_avatars/choosewhatulike?size=40
choosewhatulike / trainable-agents

#自然语言处理#Code and datasets for "Character-LLM: A Trainable Agent for Role-Playing"

agentlanguage-model大语言模型roleplaysftlarge-language-models自然语言处理character
Python 550
8 个月前
https://static.github-zh.com/github_avatars/ukairia777?size=40
ukairia777 / tensorflow-nlp-tutorial

#自然语言处理#tensorflow를 사용하여 텍스트 전처리부터, Topic Models, BERT, GPT, LLM과 같은 최신 모델의 다운스트림 태스크들을 정리한 Deep Learning NLP 저장소입니다.

Tensorflow自然语言处理question-answeringnamed-entity-recognitionbert-nerbert大语言模型dpollamasfthuggingfacetransformersloratrainer
Jupyter Notebook 544
1 个月前
https://static.github-zh.com/github_avatars/awesome-rag?size=40
awesome-rag / awesome-rag

#Awesome#Awesome-RAG: Collect typical RAG papers and systems.

agent人工智能graphrag大语言模型Open SourceBukkitragsftAwesome Lists
386
5 个月前
https://static.github-zh.com/github_avatars/0xsequence?size=40
0xsequence / erc-1155

Ethereum Semi Fungible Standard (ERC-1155)

以太坊erc1155sftnft
TypeScript 322
21 天前
https://static.github-zh.com/github_avatars/open-sciencelab?size=40
open-sciencelab / GraphGen

#大语言模型#GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation

ai4sciencedata-generationllm-trainingpretrainqaqwensftknowledge-graph大语言模型
Python 195
11 天前
https://static.github-zh.com/github_avatars/solv-finance?size=40
solv-finance / erc-3525

ERC-3525 Reference Implementation

sft
Solidity 112
2 年前
https://static.github-zh.com/github_avatars/NiuTrans?size=40
NiuTrans / Vision-LLM-Alignment

#大语言模型#This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vision models.

visiondpo大语言模型rlhfsftppoalignmentmllmmulti-modelllava
Python 109
8 个月前
https://static.github-zh.com/github_avatars/liangyuwang?size=40
liangyuwang / zo2

ZO2 (Zeroth-Order Offloading): Full Parameter Fine-Tuning 175B LLMs with 18GB GPU Memory

大语言模型offloadingsft
Python 95
1 个月前
https://static.github-zh.com/github_avatars/OpenSparseLLMs?size=40
OpenSparseLLMs / LLaMA-MoE-v2

🚀 LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training

llamamixture-of-expertssftmoefine-tuninginstruction-tuningllama3sparsityattention
Python 86
6 个月前
https://static.github-zh.com/github_avatars/Goekdeniz-Guelmez?size=40
Goekdeniz-Guelmez / mlx-lm-lora

#计算机科学#Train Large Language Models on MLX.

Apple深度学习dpogrpo机器学习MLXsfttraining
Python 84
7 天前
https://static.github-zh.com/github_avatars/ecnu-sea?size=40
ecnu-sea / SEA

#自然语言处理#SEA is an automated paper review framework capable of generating comprehensive and high-quality review feedback with high consistency for papers, thereby assisting researchers in improving the quality...

dataset大语言模型自然语言处理sft
Python 71
7 个月前
https://static.github-zh.com/github_avatars/Zeyi-Lin?size=40
Zeyi-Lin / Qwen3-Medical-SFT

Qwen3 Fine-tuning: Medical R1 Style Chat

fine-tuningqwen3sft
Python 68
16 天前
https://static.github-zh.com/github_avatars/TuGraph-family?size=40
TuGraph-family / Awesome-Text2GQL

#Awesome#Fine-Tuning Dataset Auto-Generation for Graph Query Languages.

HacktoberfestAwesome Listsfine-tuninggraphdb大语言模型text2sqlpeftsft
Python 60
11 天前
loading...