#

self-attention

https://static.github-zh.com/github_avatars/datawhalechina?size=40

#大语言模型#《李宏毅深度学习教程》(李宏毅老师推荐👍,苹果书🍎),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases

Jupyter Notebook 15.72 k
3 个月前
https://static.github-zh.com/github_avatars/zhouhaoyi?size=40
Python 6.18 k
3 个月前
https://static.github-zh.com/github_avatars/PetarV-?size=40
Python 3.42 k
3 年前
https://static.github-zh.com/github_avatars/Diego999?size=40

Pytorch implementation of the Graph Attention Network model by Veličković et. al (2017, https://arxiv.org/abs/1710.10903)

Python 3.06 k
2 年前
gordicaleksa/pytorch-GAT
https://static.github-zh.com/github_avatars/gordicaleksa?size=40

#计算机科学#My implementation of the original GAT paper (Veličković et al.). I've additionally included the playground.py file for visualizing the Cora dataset, GAT embeddings, an attention mechanism, and entropy...

Jupyter Notebook 2.59 k
3 年前
https://static.github-zh.com/github_avatars/speedinghzl?size=40

CCNet: Criss-Cross Attention for Semantic Segmentation (TPAMI 2020 & ICCV 2019).

Python 1.47 k
4 年前
The-AI-Summer/self-attention-cv
https://static.github-zh.com/github_avatars/The-AI-Summer?size=40
Python 1.21 k
4 年前
https://static.github-zh.com/github_avatars/xxxnell?size=40

(ICLR 2022 Spotlight) Official PyTorch implementation of "How Do Vision Transformers Work?"

Python 820
3 年前
https://static.github-zh.com/github_avatars/kaituoxu?size=40

A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.

Python 803
2 年前
https://static.github-zh.com/github_avatars/jayparks?size=40

A Pytorch Implementation of "Attention is All You Need" and "Weighted Transformer Network for Machine Translation"

Python 562
5 年前
loading...
Website
Wikipedia