GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

vit

Website
Wikipedia
lukas-blecher/LaTeX-OCR
https://static.github-zh.com/github_avatars/lukas-blecher?size=40
lukas-blecher / LaTeX-OCR

#计算机科学#pix2tex: Using a ViT to convert images of equations into LaTeX code.

机器学习transformerim2latex深度学习image2textLaTeXdatasetPyTorchim2markupOCRlatex-ocrvitmath-ocrvision-transformer图像处理Pythonim2text
Python 14.55 k
5 个月前
https://static.github-zh.com/github_avatars/cmhungsteve?size=40
cmhungsteve / Awesome-Transformer-Attention

#Awesome#An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites

transformerattention-mechanismvision-transformer深度学习Awesome Liststransformer-cvtransformer-architecturetransformer-awesometransformer-with-cvtransformer-modelsvisual-transformer机器视觉papersattention-mechanismsself-attentionvitdetrtransformers
4.88 k
1 年前
https://static.github-zh.com/github_avatars/towhee-io?size=40
towhee-io / towhee

#大语言模型#Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.

机器学习convolutional-networksembedding-vectorsembeddings机器视觉图像处理video-processingfeature-extractionimage-retrievalunstructured-datafeature-vectortransformermilvusvision-transformervitpipeline大语言模型
Python 3.38 k
8 个月前
https://static.github-zh.com/github_avatars/open-compass?size=40
open-compass / VLMEvalKit

#大语言模型#Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

gpt-4vlarge-language-modelsllavamulti-modalopenaivqa大语言模型openai-apiqwengpt机器视觉PyTorchgpt4ChatGPTclipvitevaluationclaudegemini
Python 2.52 k
3 天前
https://static.github-zh.com/github_avatars/hila-chefer?size=40
hila-chefer / Transformer-Explainability

#计算机科学#[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize classifications by Transformer based networks.

深度学习vision-transformerbert-modelbertexplainabilityvitcvpr2021
Jupyter Notebook 1.9 k
1 年前
roboflow/inference
https://static.github-zh.com/github_avatars/roboflow?size=40
roboflow / inference

#计算机科学#Turn any computer or edge device into a command center for your computer vision projects.

机器视觉inference-apiinference-servervityolov5yolov8jetsontensorrtclassificationinstance-segmentationobject-detectiononnx部署Dockerinference机器学习Pythonyolo11agents
Python 1.74 k
2 天前
https://static.github-zh.com/github_avatars/thu-ml?size=40
thu-ml / SageAttention

#大语言模型#Quantized Attention achieves speedup of 2-5x and 3-11x compared to FlashAttention and xformers, without lossing end-to-end metrics across language, image, and video models.

attention大语言模型quantizationCUDAtritonvideo-generationmlsysvit
Cuda 1.72 k
4 天前
https://static.github-zh.com/github_avatars/Yangzhangcst?size=40
Yangzhangcst / Transformer-in-Computer-Vision

#计算机科学#A paper list of some recent Transformer-based CV works.

transformertransformer-cvtransformer-awesomedetrvitAwesome Lists机器视觉深度学习papers
1.29 k
6 天前
https://static.github-zh.com/github_avatars/BR-IDL?size=40
BR-IDL / PaddleViT

#计算机科学#:robot: PaddleViT: State-of-the-art Visual Transformer and MLP Models for PaddlePaddle 2.0+

cv机器视觉paddlepaddlevitmlptransformerencoder-decoderclassificationdetectionsegmentationGenerative Adversarial Network深度学习semantic-segmentationobject-detection
Python 1.24 k
3 年前
https://static.github-zh.com/github_avatars/yitu-opensource?size=40
yitu-opensource / T2T-ViT

ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet

vision-transformervit
Jupyter Notebook 1.19 k
2 年前
https://static.github-zh.com/github_avatars/sail-sg?size=40
sail-sg / Adan

#计算机科学#Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models

bert-modelconvnext深度学习fairseqoptimizerresnettimmvittransformer-xl人工智能diffusiondreamfusiongpt2PyTorchcuda-programmingllm-training大语言模型moe
Python 792
7 天前
https://static.github-zh.com/github_avatars/v-iashin?size=40
v-iashin / video_features

Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and TIMM models.

PyTorchfeature-extractionparallelaudio-featuresi3dresnetraftoptical-flowcliptimmvit
Python 599
5 个月前
https://static.github-zh.com/github_avatars/thu-ml?size=40
thu-ml / SpargeAttn

#大语言模型#SpargeAttention: A training-free sparse attention that can accelerate any model inference.

ai-infraattention大语言模型mlsysquantizationvision-transformervideo-generationvit
Cuda 593
5 天前
https://static.github-zh.com/github_avatars/chinhsuanwu?size=40
chinhsuanwu / mobilevit-pytorch

A PyTorch implementation of "MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer"

vitmobilenetv2vision-transformer
Python 533
3 年前
https://static.github-zh.com/github_avatars/zgcr?size=40
zgcr / SimpleAICV_pytorch_training_examples

SimpleAICV:pytorch training and testing examples.

PyTorchresnetvitvandetrfcosretinanetdeeplabv3plussolov2yolactdbnetsamsegment-anything
Jupyter Notebook 434
1 个月前
https://static.github-zh.com/github_avatars/eeyhsong?size=40
eeyhsong / EEG-Transformer

#计算机科学#i. A practical application of Transformer (ViT) on 2-D physiological signal (EEG) classification tasks. Also could be tried with EMG, EOG, ECG, etc. ii. Including the attention of spatial dimension (c...

深度学习attention-mechanismvittransformerattentionEEGeeg-classification
Python 301
2 年前
https://static.github-zh.com/github_avatars/gupta-abhay?size=40
gupta-abhay / pytorch-vit

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

image-recognitiontransformersimage-classificationvitvision-transformer
Python 295
4 年前
https://static.github-zh.com/github_avatars/vatz88?size=40
vatz88 / FFCSonTheGo

FFCS course registration made hassle free for VITians. Search courses and visualize the timetable on the go!

vitvelloreffcstimetableHacktoberfestJavaScript
JavaScript 292
7 个月前
https://static.github-zh.com/github_avatars/PaddlePaddle?size=40
PaddlePaddle / PASSL

#计算机科学#PASSL包含 SimCLR,MoCo v1/v2,BYOL,CLIP,PixPro,simsiam, SwAV, BEiT,MAE 等图像自监督算法以及 Vision Transformer,DEiT,Swin Transformer,CvT,T2T-ViT,MLP-Mixer,XCiT,ConvNeXt,PVTv2 等基础视觉算法

深度学习mocosimclrclipself-supervised-learningpaddleswin-transformervision-transformerbeitconvnextvitdeitpvtswav
Python 283
2 年前
https://static.github-zh.com/github_avatars/megvii-research?size=40
megvii-research / RevCol

Official Code of Paper "Reversible Column Networks" "RevColv2"

cnn机器视觉PyTorchtransformervit
Python 263
2 年前
loading...