GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
EleutherAI

EleutherAI / gpt-neox

星标7.22 k
复刻1.06 k


问题 官网
 
Loading

关于

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

eleuther.ai
deepspeed-librarygpt-3transformerslanguage-model
创建时间

2020-12-22

是否国产

否

  修改时间

2025-06-09T22:43:40Z


语言

  • Python85.4%
  • C++11.2%
  • Cuda2.5%
  • Shell0.5%
  • Dockerfile0.3%
  • C0.1%
  • 其他0.01%


该仓库已收录但尚未编辑。项目介绍及使用教程请前往 GitHub 阅读 README


0 条讨论

登录后发表评论

EleutherAI 的其他开源项目

lm-evaluation-harness
@EleutherAI

A framework for few-shot evaluation of language models.

evaluation-frameworklanguage-modeltransformer
Python9.27 k
2 天前
gpt-neo存档
@EleutherAI

An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.

language-modeltransformersgptgpt-2gpt-3
Python8.29 k
3 年前
the-pile
@EleutherAI

Python1.57 k
2 年前

您可能感兴趣的

grok-1
@xai-org

大模型Grok-1开源

Python50.29 k
9 个月前
lm-evaluation-harness
@EleutherAI

A framework for few-shot evaluation of language models.

evaluation-frameworklanguage-modeltransformer
Python9.27 k
2 天前
Open-Sora
@hpcaitech

Open-Sora: 完全开源的高效复现类Sora视频生成方案

Python26.66 k
1 个月前
OpenInterpreter/01
01
@OpenInterpreter

The #1 open-source voice interface for desktop, mobile, and ESP32 chips.

Python5.07 k
7 个月前
ggml-org/llama.cpp
llama.cpp
@ggml-org

Facebook 的 LLaMA 模型在 C/C++ 中的移植

llamaggml
C++81.75 k
38 分钟前
OpenHands
@All-Hands-AI

#大语言模型#🙌 OpenHands: Code Less, Make More

agent人工智能大语言模型ChatGPTclaude-ai
Python58.04 k
1 小时前
rasbt/LLMs-from-scratch
Sebastian Raschka
LLMs-from-scratch
Sebastian Raschka@rasbt

#大语言模型#《Build a Large Language Model (From Scratch)》,从零开始使用PyTorch实现一个类似ChatGPT的大型语言模型

ChatGPTgptlarge-language-models大语言模型Python
Jupyter Notebook51.08 k
1 天前
llama
@meta-llama

LLaMA模型的推理代码

Python58.34 k
5 个月前
ollama
@ollama

#大语言模型#本地化搭建和运行 Llama2 和其他大模型

llama大语言模型llama2Go
Go143.65 k
15 小时前
Open-Sora-Plan
@PKU-YuanGroup

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python11.98 k
9 天前
DeepSpeed
@deepspeedai

#计算机科学#DeepSpeed Chat: 一键式RLHF训练,让你的类ChatGPT千亿大模型提速省钱15倍

深度学习PyTorchgpu机器学习billion-parameters
Python38.85 k
11 小时前
devika
@stitionai

Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. D...

Python18.77 k
9 个月前
pythia
@EleutherAI

The hub for EleutherAI's work on interpretability and learning dynamics

Jupyter Notebook2.39 k
6 个月前
gpt-neo存档
@EleutherAI

An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.

language-modeltransformersgptgpt-2gpt-3
Python8.29 k
3 年前
amazon-science/chronos-forecasting
chronos-forecasting
@amazon-science

#大语言模型#Chronos: Pretrained Models for Probabilistic Time Series Forecasting

forecastinglarge-language-models大语言模型机器学习time-series
Python3.36 k
17 天前
DeepSeek-VL
@deepseek-ai

DeepSeek-VL: Towards Real-World Vision-Language Understanding

vision-language-modelvision-language-pretrainingfoundation-models
Python3.88 k
1 年前
AI2
OLMo
AI2@allenai

Modeling, training, eval, and inference code for OLMo

Python5.67 k
2 天前
gpt-prompt-engineer
@mshumer

Jupyter Notebook9.54 k
2 个月前
codellama
@meta-llama

CodeLlama 模型推理代码

Python16.33 k
10 个月前
LaVague
@lavague-ai

#大语言模型#Large Action Model framework to develop AI Web Agents

人工智能browserlarge-action-model大语言模型Open Source
Python6.07 k
5 个月前