A framework for few-shot evaluation of language models.
An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.
A framework for few-shot evaluation of language models.
Open-Sora: 完全开源的高效复现类Sora视频生成方案
The #1 open-source voice interface for desktop, mobile, and ESP32 chips.
#大语言模型#《Build a Large Language Model (From Scratch)》,从零开始使用PyTorch实现一个类似ChatGPT的大型语言模型
LLaMA模型的推理代码
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
#计算机科学#DeepSpeed Chat: 一键式RLHF训练,让你的类ChatGPT千亿大模型提速省钱15倍
Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. D...
The hub for EleutherAI's work on interpretability and learning dynamics
An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.
#大语言模型#Chronos: Pretrained Models for Probabilistic Time Series Forecasting
DeepSeek-VL: Towards Real-World Vision-Language Understanding
Modeling, training, eval, and inference code for OLMo
CodeLlama 模型推理代码
#大语言模型#Large Action Model framework to develop AI Web Agents