Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
2020-10-03
否
2025-08-18T01:07:44Z
#计算机科学#Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
#计算机科学#Implementation of Denoising Diffusion Probabilistic Model in Pytorch
#计算机科学#Google 文字合成图像神经网络,Imagen Pytorch 的实现
#计算机科学#Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT)...
Open-Sora: 完全开源的高效复现类Sora视频生成方案
#自然语言处理#为 Jax、PyTorch 和 TensorFlow 打造的先进的自然语言处理
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
#计算机科学#A concise but complete full-attention transformer with a set of promising experimental features from various papers
Mamba SSM architecture
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
#计算机科学#59 篇深度学习论文的实现,并带有详细注释。包括 transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, ...), gans(cyclegan, stylegan2, ...), 🎮 强化学习 (ppo, dqn), capsnet, distillation, ... 🧠
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. D...
#大语言模型#《Build a Large Language Model (From Scratch)》,从零开始使用PyTorch实现一个类似ChatGPT的大型语言模型
Zero-Shot Speech Editing and Text-to-Speech in the Wild
#计算机科学#Implementation of Denoising Diffusion Probabilistic Model in Pytorch
0 条讨论