GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

vision-transformer

Website
Wikipedia
https://static.github-zh.com/github_avatars/open-mmlab?size=40
open-mmlab / mmdetection

OpenMMLab Detection Toolbox and Benchmark

object-detectioninstance-segmentationfast-rcnnfaster-rcnnmask-rcnncascade-rcnnssdretinanetPyTorchpanoptic-segmentationrtmdetswin-transformertransformervision-transformeryoloconvnextdetrgrounding-dino
Python 31.17 k
10 个月前
lukas-blecher/LaTeX-OCR
https://static.github-zh.com/github_avatars/lukas-blecher?size=40
lukas-blecher / LaTeX-OCR

#计算机科学#pix2tex: Using a ViT to convert images of equations into LaTeX code.

机器学习transformerim2latex深度学习image2textLaTeXdatasetPyTorchim2markupOCRlatex-ocrvitmath-ocrvision-transformer图像处理Pythonim2text
Python 14.55 k
5 个月前
https://static.github-zh.com/github_avatars/NielsRogge?size=40
NielsRogge / Transformers-Tutorials

This repository contains demos I made with the Transformers library by HuggingFace.

transformersPyTorchbertvision-transformerlayoutlmgpt-2
Jupyter Notebook 10.99 k
20 天前
https://static.github-zh.com/github_avatars/FoundationVision?size=40
FoundationVision / VAR

[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultr...

auto-regressive-modeldiffusion-modelsimage-generationtransformersautoregressive-modelsgenerative-aigenerative-modelgptgpt-2large-language-modelsvision-transformerneurips
Jupyter Notebook 8.22 k
1 个月前
https://static.github-zh.com/github_avatars/adithya-s-k?size=40
adithya-s-k / omniparse

Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks

OCRomniparserparse-serverparser-libraryvision-transformerweb-crawler
Python 6.58 k
4 天前
https://static.github-zh.com/github_avatars/JingyunLiang?size=40
JingyunLiang / SwinIR

SwinIR: Image Restoration Using Swin Transformer (official repository)

image-super-resolutionimage-denoisingcompression-artifact-reductionimage-deblockingtransformerreal-world-image-super-resolutionlightweight-image-super-resolutionimage-restorationlow-level-visionvision-transformerrestorationsuper-resolutiondenoisingdecompression
Python 4.92 k
1 年前
https://static.github-zh.com/github_avatars/cmhungsteve?size=40
cmhungsteve / Awesome-Transformer-Attention

#Awesome#An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites

transformerattention-mechanismvision-transformer深度学习Awesome Liststransformer-cvtransformer-architecturetransformer-awesometransformer-with-cvtransformer-modelsvisual-transformer机器视觉papersattention-mechanismsself-attentionvitdetrtransformers
4.88 k
1 年前
https://static.github-zh.com/github_avatars/huawei-noah?size=40
huawei-noah / Efficient-AI-Backbones

Efficient AI Backbones including GhostNet, TNT and MLP, developed by Huawei Noah's Ark Lab.

convolutional-neural-networksefficient-inferenceimagenetmodel-compressionTensorflowPyTorchghostnettransformerpretrained-modelsvision-transformer
Python 4.24 k
3 个月前
https://static.github-zh.com/github_avatars/open-mmlab?size=40
open-mmlab / mmpretrain

#计算机科学#OpenMMLab Pre-training Toolbox and Benchmark

image-classificationresnetmobilenetPyTorch深度学习swin-transformerbeitclipconstrastive-learningconvnextmasked-image-modelingmocopretrained-modelsself-supervised-learningvision-transformermultimodal
Python 3.68 k
7 个月前
https://static.github-zh.com/github_avatars/google-research?size=40
google-research / scenic

#计算机科学#Scenic: A Jax Library for Computer Vision Research and Beyond

jax机器视觉深度学习researchattentiontransformersvision-transformer
Python 3.56 k
2 天前
https://static.github-zh.com/github_avatars/towhee-io?size=40
towhee-io / towhee

#大语言模型#Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.

机器学习convolutional-networksembedding-vectorsembeddings机器视觉图像处理video-processingfeature-extractionimage-retrievalunstructured-datafeature-vectortransformermilvusvision-transformervitpipeline大语言模型
Python 3.38 k
8 个月前
https://static.github-zh.com/github_avatars/mit-han-lab?size=40
mit-han-lab / efficientvit

Efficient vision foundation models for high-resolution generation and perception.

high-resolutionimagenetefficientvitsegment-anythingsegmentationvision-transformerdeep-compression-autoencoderefficient-diffusion-model
Python 2.91 k
2 个月前
https://static.github-zh.com/github_avatars/InternLM?size=40
InternLM / InternLM-XComposer

#大语言模型#InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

ChatGPTvisual-language-learningmulti-modalityfoundationgpt-4instruction-tuningmllmmultimodalvision-language-modellanguage-model大语言模型large-vision-language-modelvision-transformergpt
Python 2.84 k
20 天前
https://static.github-zh.com/github_avatars/baaivision?size=40
baaivision / EVA

EVA Series: Visual Representation Fantasies from BAAI

foundation-modelsrepresentation-learningvision-transformer
Python 2.5 k
10 个月前
https://static.github-zh.com/github_avatars/OpenGVLab?size=40
OpenGVLab / InternVideo

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

foundation-modelsvideo-understandingvision-transformeraction-recognitionmultimodaltemporal-action-localizationvideo-question-answeringzero-shot-classificationbenchmarkcontrastive-learningself-supervisedinstruction-tuningvideo-clip
Python 1.91 k
22 天前
https://static.github-zh.com/github_avatars/hila-chefer?size=40
hila-chefer / Transformer-Explainability

#计算机科学#[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize classifications by Transformer based networks.

深度学习vision-transformerbert-modelbertexplainabilityvitcvpr2021
Jupyter Notebook 1.9 k
1 年前
https://static.github-zh.com/github_avatars/alibaba?size=40
alibaba / EasyCV

An all-in-one toolkit for computer vision

self-supervised-learningtransformersclassification机器视觉object-detectionPyTorchvision-transformer
Python 1.87 k
1 个月前
https://static.github-zh.com/github_avatars/microsoft?size=40
microsoft / Cream

This is a collection of our NAS and Vision Transformer work.

nasautomlvision-transformerrpevit-compressionefficiencyknowledge-distillation
Python 1.77 k
1 年前
https://static.github-zh.com/github_avatars/ViTAE-Transformer?size=40
ViTAE-Transformer / ViTPose

#计算机科学#The official repo for [NeurIPS'22] "ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation" and [TPAMI'23] "ViTPose++: Vision Transformer for Generic Body Pose Estimation"

深度学习distillationpose-estimationPyTorchself-supervised-learningvision-transformer
Python 1.63 k
1 年前
https://static.github-zh.com/github_avatars/MCG-NJU?size=40
MCG-NJU / VideoMAE

[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

self-supervised-learningaction-recognitionvideo-understandingtransformervision-transformerPyTorchvideo-analysisneurips-2022
Python 1.51 k
2 年前
loading...