sft · GitHub Topics

#大语言模型#一个大模型应用开发平台，赋能和加速大模型应用开发落地，帮助用户以最佳体验进入下一代应用开发模式。

agent 人工智能聊天机器人 rag workflow enterprise genai gpt langchian llama 大语言模型 llmdevops llmops OCR openai orchestration Python React finetune sft

TypeScript 9.24 k

8 小时前

modelscope / ms-swift

#大语言模型#Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Llava,...

大语言模型 lora llama sft deploy multimodal peft internvl liger qwen2-vl rft deepseek-r1 embedding grpo open-r1 megatron omni llama4 qwen3 qwen3-moe

Python 9 k

2 小时前

oumi-ai / oumi

Easily fine-tune, evaluate and deploy Qwen3, DeepSeek-R1, Llama 4 or any open source LLM / VLM!

dpo evaluation fine-tuning inference llama 大语言模型 sft vlms

Python 8.34 k

11 小时前

AI-Hypercomputer / maxtext

#大语言模型#A simple, performant and scalable Jax LLM!

large-language-models 大语言模型 gpt deepseek fine-tuning gemma2 gemma3 jax llama2 llama3 llama4 mistral mixtral sft

Python 1.85 k

2 天前

ssbuild / chatglm_finetuning

#计算机科学#chatglm 6b finetuning and alpaca finetuning

chatglm 深度学习 lora PyTorch sft freeze qlora

Python 1.55 k

5 个月前

ScienceOne-AI / DeepSeek-671B-SFT-Guide

#大语言模型#An open-source solution for full parameter fine-tuning of DeepSeek-V3/R1 671B, including complete code and scripts from training to inference, as well as some practical experiences and conclusions. (D...

deepseek-r1 大语言模型 moe sft Python

Python 736

5 个月前

jerry1993-tech / Cornucopia-LLaMA-Fin-Chinese

#自然语言处理#聚宝盆(Cornucopia): 中文金融系列开源可商用大模型，并提供一套高效轻量化的垂直领域LLM训练框架(Pretraining、SFT、RLHF、Quantize等)

llama 自然语言处理中文 finance rlhf sft qa text-generation large-language-models transformers

Python 639

2 年前

choosewhatulike / trainable-agents

#自然语言处理#Code and datasets for "Character-LLM: A Trainable Agent for Role-Playing"

agent language-model 大语言模型 roleplay sft large-language-models 自然语言处理 character

Python 566

9 个月前

ukairia777 / tensorflow-nlp-tutorial

#自然语言处理#tensorflow를 사용하여 텍스트 전처리부터, Topic Models, BERT, GPT, LLM과 같은 최신 모델의 다운스트림 태스크들을 정리한 Deep Learning NLP 저장소입니다.

Tensorflow 自然语言处理 question-answering named-entity-recognition bert-ner bert 大语言模型 dpo llama sft huggingface transformers lora trainer

Jupyter Notebook 551

1 个月前

awesome-rag / awesome-rag

#Awesome#Awesome-RAG: Collect typical RAG papers and systems.

agent 人工智能 graphrag 大语言模型 Open Source Bukkit rag sft Awesome Lists

403

6 个月前

0xsequence / erc-1155

Ethereum Semi Fungible Standard (ERC-1155)

以太坊 erc1155 sft nft

TypeScript 321

2 个月前

open-sciencelab / GraphGen

#大语言模型#GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation

ai4science data-generation llm-training pretrain qa qwen sft knowledge-graph 大语言模型

Python 266

3 天前

liangyuwang / zo2

ZO2 (Zeroth-Order Offloading): Full Parameter Fine-Tuning 175B LLMs with 18GB GPU Memory

大语言模型 offloading sft deepseek llama qwen

Python 164

15 天前

Zeyi-Lin / Qwen3-Medical-SFT

Qwen3 Fine-tuning: Medical R1 Style Chat

fine-tuning qwen3 sft

Python 124

2 个月前

solv-finance / erc-3525

ERC-3525 Reference Implementation

sft

Solidity 112

2 年前

NiuTrans / Vision-LLM-Alignment

#大语言模型#This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vision models.

vision dpo 大语言模型 rlhf sft ppo alignment mllm multi-model llava

Python 110

1 个月前

OpenSparseLLMs / LLaMA-MoE-v2

🚀 LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training

llama mixture-of-experts sft moe fine-tuning instruction-tuning llama3 sparsity attention

Python 85

8 个月前

ecnu-sea / SEA

#自然语言处理#SEA is an automated paper review framework capable of generating comprehensive and high-quality review feedback with high consistency for papers, thereby assisting researchers in improving the quality...

dataset 大语言模型自然语言处理 sft

Python 80

8 个月前