GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
NVIDIA-NeMo

NVIDIA-NeMo / Curator

星标1.13 k
复刻170


问题
 
Loading

该仓库已收录但尚未编辑。项目介绍及使用教程请前往 GitHub 阅读 README


0 条讨论

登录后发表评论

关于

Scalable data pre processing and curation toolkit for LLMs

data-curation大语言模型datadata-prepdata-preparationdata-processingdata-qualitydatacurationdatarecipesEntity resolutionfine-tuninglarge-language-modelslarge-scale-data-processingllmappsPython
创建时间

2024-03-14

是否国产

否

  修改时间

2025-09-06T03:59:12Z

Readme
相关推荐

语言

  • Python99.7%
  • Shell0.1%
  • Makefile0.1%
  • Dockerfile0.1%

NVIDIA-NeMo 的其他开源项目

NeMo
@NVIDIA-NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

machine-translationspeaker-recognitionasrttsgenerative-ai
Python15.63 k
6 小时前
RL
@NVIDIA-NeMo

Scalable toolkit for efficient model reinforcement

Python844
16 小时前

您可能感兴趣的

grok-1
@xai-org

大模型Grok-1开源

Python50.5 k
1 年前
OpenAI
transformer-debugger
OpenAI@openai

Python4.09 k
1 年前
OpenHands
@All-Hands-AI

#大语言模型#🙌 OpenHands: Code Less, Make More

agent人工智能大语言模型ChatGPTclaude-ai
Python63.26 k
5 小时前
Open-Sora
@hpcaitech

Open-Sora: 完全开源的高效复现类Sora视频生成方案

Python27.15 k
4 个月前
SWE-agent/SWE-agent
SWE-agent
@SWE-agent

#大语言模型#SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]

agent人工智能developer-tools大语言模型agent-based-model
Python17.26 k
5 天前
GaLore
@jiaweizzhao

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Python1.59 k
10 个月前
LaVague
@lavague-ai

#大语言模型#Large Action Model framework to develop AI Web Agents

人工智能browserlarge-action-model大语言模型Open Source
Python6.16 k
8 个月前
pytorch
torchtune
@pytorch • Meta

PyTorch native post-training library

Python5.39 k
1 个月前
devika
@stitionai

Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. D...

Python19.4 k
1 年前
VoiceCraft
@jasonppy

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Jupyter Notebook8.37 k
6 个月前
Open-Sora-Plan
@PKU-YuanGroup

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python12.02 k
2 个月前
⚡️ Lightning AI
lightning-thunder
⚡️ Lightning AI @Lightning-AI

PyTorch compiler that accelerates training and inference. Get built-in optimizations for performance, memory, parallelism, and easily write your own.

Python1.39 k
23 天前
openinterpreter/01
01
@openinterpreter

The #1 open-source voice interface for desktop, mobile, and ESP32 chips.

Python5.09 k
10 个月前
DeepSeek-VL
@deepseek-ai

DeepSeek-VL: Towards Real-World Vision-Language Understanding

vision-language-modelvision-language-pretrainingfoundation-models
Python3.96 k
1 年前
OpenAI
grok
OpenAI@openai

Python4.18 k1
1 年前
fsdp_qlora
@AnswerDotAI

Training LLMs with QLoRA + FSDP

Jupyter Notebook1.53 k
10 个月前
gpt-prompt-engineer
@mshumer

Jupyter Notebook9.59 k
4 个月前
Databricks
dbrx
Databricks@databricks

#大语言模型#Code examples and resources for DBRX, a large language model developed by Databricks

databricksgen-aigenerative-ai大语言模型llm-inference
Python2.57 k
1 年前
hrishioa/lumentis
lumentis
@hrishioa

AI powered one-click comprehensive docs from transcripts and text.

TypeScript1.65 k
7 个月前
maestro
@Doriandarko

A framework for Claude Opus to intelligently orchestrate subagents.

Python4.27 k
1 年前