GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

deepspeed

Website
Wikipedia
https://static.github-zh.com/github_avatars/InternLM?size=40
InternLM / lmdeploy

#大语言模型#LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

cuda-kernelsdeepspeedfastertransformerllm-inferenceturbomindinternlmllama大语言模型codellamallama2llama3
Python 6.52 k
2 天前
https://static.github-zh.com/github_avatars/PKU-Alignment?size=40
PKU-Alignment / safe-rlhf

#数据仓库#Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

ai-safetyalpaca数据集deepspeedlarge-language-modelsllama大语言模型reinforcement-learningreinforcement-learning-from-human-feedbackrlhftransformersvicunasafetygpttransformerbeaver
Python 1.49 k
1 年前
https://static.github-zh.com/github_avatars/zjunlp?size=40
zjunlp / KnowLM

#计算机科学#An Open-sourced Knowledgable Large Language Model Framework.

llamalarge-language-modelspre-trained-language-modelslanguage-modelinstruction-following深度学习中文englishinstructionsmodelsreasoninggpt-3deepspeedinstruction-tuninglorapre-trainingpre-trained-model
Python 1.32 k
5 个月前
https://static.github-zh.com/github_avatars/antgroup?size=40
antgroup / glake

#大语言模型#GLake: optimizing GPU memory management and IO transmission.

deepspeedgpu大语言模型memoryonnxPyTorch
Python 466
3 个月前
https://static.github-zh.com/github_avatars/Coobiw?size=40
Coobiw / MPP-LLaVA

Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Train...

multimodal-large-language-modelsdeepspeedpipeline-parallelismmllmqwenfine-tuningpretraining
Jupyter Notebook 452
3 个月前
https://static.github-zh.com/github_avatars/shm007g?size=40
shm007g / LLaMA-Cult-and-More

#大语言模型#Large Language Models for All, 🦙 Cult and More, Stay in touch !

alpacaChatGPTgptllamaggmlgpt4gptqvicunaPyTorchTensorflowtransformersdeepspeed大语言模型
HTML 446
2 年前
https://static.github-zh.com/github_avatars/Xirider?size=40
Xirider / finetune-gpt2xl

Guide: Finetune GPT2-XL (1.5 Billion Parameters) and finetune GPT-NEO (2.7 B) on a single GPU with Huggingface Transformers using DeepSpeed

huggingfacehuggingface-transformersdeepspeedgpt2gpt3finetuninggpt-neo
Python 437
2 年前
https://static.github-zh.com/github_avatars/LambdaLabsML?size=40
LambdaLabsML / distributed-training-guide

Best practices & guides on how to write distributed pytorch training code

CUDAdeepspeeddistributed-traininggpugpu-clusterkuberentesncclPyTorchslurmclustermpisharding
Python 435
4 个月前
https://static.github-zh.com/github_avatars/OpenMOSS?size=40
OpenMOSS / CoLLiE

#自然语言处理#Collaborative Training of Large Language Models in an Efficient Way

深度学习deepspeed自然语言处理PyTorch
Python 415
10 个月前
https://static.github-zh.com/github_avatars/openpsi-project?size=40
openpsi-project / ReaLHF

#大语言模型#Super-Efficient RLHF Training of LLMs with Parameter Reallocation

大语言模型llm-trainingreinforcement-learning-from-human-feedbackreinforcement-learningdistributed-systemsdistributed-computinglarge-language-modelsllm-frameworkdeepspeedtransformers
Python 300
2 个月前
https://static.github-zh.com/github_avatars/sunzeyeah?size=40
sunzeyeah / RLHF

#自然语言处理#Implementation of Chinese ChatGPT

ChatGPT深度学习deepspeedglm自然语言处理PyTorch
Python 286
2 年前
https://static.github-zh.com/github_avatars/stanleylsx?size=40
stanleylsx / llms_tool

一个基于HuggingFace开发的大语言模型训练、测试工具。支持各模型的webui、终端预测,低参数量及全参数模型训练(预训练、SFT、RM、PPO、DPO)和融合、量化。

baichuanbloomchatglmfalconinternlmllamallama2mossqwenchatglm2PyTorchdeepspeedbaichuan2mistralchatglm3
Python 217
2 年前
https://static.github-zh.com/github_avatars/git-cloner?size=40
git-cloner / llama2-lora-fine-tuning

llama2 finetuning with deepspeed and lora

deepspeedfinetuningllama2lora
Python 174
2 年前
https://static.github-zh.com/github_avatars/bobo0810?size=40
bobo0810 / LearnDeepSpeed

DeepSpeed教程 & 示例注释 & 学习笔记 (大模型高效训练)

deepspeedExamplelarge-language-models
Python 164
2 年前
https://static.github-zh.com/github_avatars/jackaduma?size=40
jackaduma / ChatGLM-LoRA-RLHF-PyTorch

#大语言模型#A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the ChatGLM architecture. Basically ChatG...

lorachatglmchatglm-6bChatGPTfinetunegpt大语言模型PyTorchrlhfllamadeepspeedpeftppo
Python 136
2 年前
https://static.github-zh.com/github_avatars/HomebrewML?size=40
HomebrewML / revlib

#计算机科学#Simple and efficient RevNet-Library for PyTorch with XLA and DeepSpeed support and parameter offload

PyTorch深度学习deepspeedxlatpu
Python 127
3 年前
https://static.github-zh.com/github_avatars/CoinCheung?size=40
CoinCheung / gdGPT

#自然语言处理#Train llm (bloom, llama, baichuan2-7b, chatglm3-6b) with deepspeed pipeline mode. Faster than zero/zero++/fsdp.

deepspeed大语言模型pipeline自然语言处理PyTorchbloomflash-attentionbaichuan2-7bmixtral-8x7bllama2
Python 96
1 年前
https://static.github-zh.com/github_avatars/OpenCSGs?size=40
OpenCSGs / llm-inference

llm-inference is a platform for publishing and managing llm inference, providing a wide range of out-of-the-box features for model deployment, such as UI, RESTful API, auto-scaling, computing resource...

deepspeedllama-cppllm-inferenceraytransformervllm
Python 81
1 年前
https://static.github-zh.com/github_avatars/xyjigsaw?size=40
xyjigsaw / LLM-Pretrain-SFT

Scripts of LLM pre-training and fine-tuning (w/wo LoRA, DeepSpeed)

large-language-modelsllamaloramistralbaichuan2deepspeed
Python 80
1 年前
https://static.github-zh.com/github_avatars/billvsme?size=40
billvsme / train_law_llm

#大语言模型#✏️0成本LLM微调上手项目,⚡️一步一步使用colab训练法律LLM,基于microsoft/phi-1_5、chatglm3,包含lora微调,全参微调

人工智能deepspeedlawllama2大语言模型loraPython
Jupyter Notebook 75
1 年前
loading...