GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

model-parallelism

Website
Wikipedia
hpcaitech/ColossalAI
https://static.github-zh.com/github_avatars/hpcaitech?size=40
hpcaitech / ColossalAI

#计算机科学#一个整合高效并行技术的AI大模型训练系统。

深度学习hpclarge-scaledata-parallelismpipeline-parallelismmodel-parallelism人工智能big-modeldistributed-computinginferenceheterogeneous-trainingfoundation-models
Python 40.96 k
2 天前
https://static.github-zh.com/github_avatars/deepspeedai?size=40
deepspeedai / DeepSpeed

#计算机科学#DeepSpeed Chat: 一键式RLHF训练,让你的类ChatGPT千亿大模型提速省钱15倍

深度学习PyTorchgpu机器学习billion-parametersdata-parallelismmodel-parallelisminferencepipeline-parallelismcompressionmixture-of-expertstrillion-parameterszero
Python 38.87 k
1 天前
https://static.github-zh.com/github_avatars/kakaobrain?size=40
kakaobrain / torchgpipe

#计算机科学#A GPipe implementation in PyTorch

深度学习PyTorchgpipemodel-parallelismpipeline-parallelismparallelismcheckpointing
Python 843
1 年前
https://static.github-zh.com/github_avatars/PaddlePaddle?size=40
PaddlePaddle / PaddleFleetX

飞桨大模型开发套件,提供大语言模型、跨模态大模型、生物计算大模型等领域的全流程开发工具链。

paddlepaddlebenchmarklarge-scalemodel-parallelismdata-parallelismpipeline-parallelismcloudelasticlightningpretrainingself-supervised-learningunsupervised-learning
Python 470
1 年前
https://static.github-zh.com/github_avatars/Oneflow-Inc?size=40
Oneflow-Inc / libai

#自然语言处理#LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training

oneflow自然语言处理深度学习large-scaledata-parallelismmodel-parallelismdistributed-trainingpipeline-parallelismtransformerself-supervised-learningvision-transformer
Python 406
1 个月前
https://static.github-zh.com/github_avatars/kaiyuyue?size=40
kaiyuyue / torchshard

Slicing a PyTorch Tensor Into Parallel Shards

PyTorchmodel-parallelism
Python 299
8 天前
https://static.github-zh.com/github_avatars/alibaba?size=40
alibaba / EasyParallelLibrary

#计算机科学#Easy Parallel Library (EPL) is a general and efficient deep learning framework for distributed model training.

深度学习data-parallelismmodel-parallelismpipeline-parallelismmemory-efficientdistributed-traininggpu
Python 267
2 年前
https://static.github-zh.com/github_avatars/Shenggan?size=40
Shenggan / awesome-distributed-ml

#计算机科学#A curated list of awesome projects and papers for distributed training or inference

深度学习distributed-systemshigh-performance-computing机器学习model-parallelismpipeline-parallelism
238
8 个月前
https://static.github-zh.com/github_avatars/xrsrke?size=40
xrsrke / pipegoose

Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*

megatrontransformersdata-parallelismpipeline-parallelismmodel-parallelismhuggingface-transformersmixture-of-expertsmoe
Python 82
2 年前
https://static.github-zh.com/github_avatars/hkproj?size=40
hkproj / pytorch-transformer-distributed

#计算机科学#Distributed training (multi-node) of a Transformer model

data-parallelism深度学习distributed-training机器学习model-parallelismPyTorch教程
Python 69
1 年前
https://static.github-zh.com/github_avatars/tanyuqian?size=40
tanyuqian / redco

NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference

jaxmodel-parallelismdistributed-traininglarge-language-modelsllamadiffusion-modelsfederated-learningimage-captioningmamlmeta-learningpporeinforcement-learningseq2seqstable-diffusionmlsysgemmadifferential-privacy
Python 66
6 个月前
https://static.github-zh.com/github_avatars/NERSC?size=40
NERSC / sc23-dl-tutorial

#计算机科学#SC23 Deep Learning at Scale Tutorial Material

data-parallelism深度学习model-parallelismvision-transformers
Python 45
9 个月前
https://static.github-zh.com/github_avatars/vdutts7?size=40
vdutts7 / dnn-distributed

Distributed training of DNNs • C++/MPI Proxies (GPT-2, GPT-3, CosmoFlow, DLRM)

distributed-deep-learningdnnmodel-parallelism深度神经网络mpi
C++ 42
1 年前
https://static.github-zh.com/github_avatars/NERSC?size=40
NERSC / dl-at-scale-training

#计算机科学#Deep Learning at Scale Training Event at NERSC

data-parallelism深度学习hpcmodel-parallelismperformance-optimization
Python 19
10 天前
https://static.github-zh.com/github_avatars/ryantd?size=40
ryantd / veloce

#计算机科学#WIP. Veloce is a low-code Ray-based parallelization library that makes machine learning computation novel, efficient, and heterogeneous.

raydistributedPyTorchdistributed-computingdata-parallelismparameter-server深度学习model-parallelismsparsity
Python 18
3 年前
https://static.github-zh.com/github_avatars/AlibabaPAI?size=40
AlibabaPAI / FlashModels

#大语言模型#Fast and easy distributed model training examples.

distributed-trainingxladata-parallelism深度学习model-parallelismPyTorchzero大语言模型
Python 12
7 个月前
https://static.github-zh.com/github_avatars/atakehiro?size=40
atakehiro / 3D-U-Net-pytorch-model-parallel

PyTorch implementation of 3D U-Net with model parallel in 2GPU for large model

PyTorchmodel-parallelism
Python 9
5 年前
https://static.github-zh.com/github_avatars/Shenggan?size=40
Shenggan / atp

Adaptive Tensor Parallelism for Foundation Models

attentiondistributed-trainingmodel-parallelismPyTorchtransformergpt
Python 9
3 年前
https://static.github-zh.com/github_avatars/ShashankSubramanian?size=40
ShashankSubramanian / transformer-perf-estimates

Performance Estimates for Transformer AI Models in Science

model-parallelismtransformer
Jupyter Notebook 7
8 个月前
https://static.github-zh.com/github_avatars/fanpu?size=40
fanpu / DynPartition

#计算机科学#Official implementation of DynPartition: Automatic Optimal Pipeline Parallelism of Dynamic Neural Networks over Heterogeneous GPU Systems for Inference Tasks

机器学习model-parallelismneural-networkspipeline-parallelismPyTorchreinforcement-learningscheduling
Python 6
2 年前
loading...