#

attention-is-all-you-need

https://static.github-zh.com/github_avatars/jadore801120?size=40
Python 9.37 k
1 年前
https://static.github-zh.com/github_avatars/Kyubyong?size=40

A TensorFlow Implementation of the Transformer: Attention Is All You Need

Python 4.41 k
2 年前
gordicaleksa/pytorch-original-transformer
https://static.github-zh.com/github_avatars/gordicaleksa?size=40

#计算机科学#My implementation of the original transformer model (Vaswani et al.). I've additionally included the playground.py file for visualizing otherwise seemingly hard concepts. Currently included IWSLT pret...

Jupyter Notebook 1.04 k
5 年前
https://static.github-zh.com/github_avatars/hkproj?size=40
Jupyter Notebook 1.03 k
1 年前
https://static.github-zh.com/github_avatars/kaituoxu?size=40

A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.

Python 803
2 年前
https://static.github-zh.com/github_avatars/lsdefine?size=40

#计算机科学#A Keras+TensorFlow Implementation of the Transformer: Attention Is All You Need

Python 713
4 年前
https://static.github-zh.com/github_avatars/kyegomez?size=40

#大语言模型#Implementation of plug in and play Attention from "LongNet: Scaling Transformers to 1,000,000,000 Tokens"

Python 711
2 年前
https://static.github-zh.com/github_avatars/sooftware?size=40

Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.

Python 632
2 年前
https://static.github-zh.com/github_avatars/feifeibear?size=40

USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference

Python 563
7 天前
https://static.github-zh.com/github_avatars/jayparks?size=40

A Pytorch Implementation of "Attention is All You Need" and "Weighted Transformer Network for Machine Translation"

Python 562
5 年前
https://static.github-zh.com/github_avatars/kyegomez?size=40

#计算机科学#Implementation of the ScreenAI model from the paper: "A Vision-Language Model for UI and Infographics Understanding"

Python 364
8 天前
https://static.github-zh.com/github_avatars/kyegomez?size=40

An open source implementation of "Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning", an all-new multi modal AI that uses just a decoder to generate both text and images

Python 363
2 年前
https://static.github-zh.com/github_avatars/sled-group?size=40

[CVPR 2024] Official implementation of CVPR 2024 paper: "Inversion-Free Image Editing with Natural Language"

Python 343
1 年前
loading...
Website
Wikipedia