GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

textvqa

Website
Wikipedia
https://static.github-zh.com/github_avatars/facebookresearch?size=40
facebookresearch / mmf

#计算机科学#A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

PyTorchvqapretrained-modelsmultimodal深度学习captioningdialogtextvqahateful-memesmulti-tasking
Python 5.57 k
2 个月前
https://static.github-zh.com/github_avatars/yashkant?size=40
yashkant / sam-textvqa

Official code for paper "Spatially Aware Multimodal Transformers for TextVQA" published at ECCV, 2020.

eccvtextvqavisionlanguage
Python 64
4 年前
https://static.github-zh.com/github_avatars/phiyodr?size=40
phiyodr / vqaloader

PyTorch DataLoader for many VQA datasets

dataloaderPyTorchtextvqavqa
Python 12
2 年前
https://static.github-zh.com/github_avatars/soonchangAI?size=40
soonchangAI / LFPR

[PRL 2024] This is the code repo for our label-free pruning and retraining technique for autoregressive Text-VQA Transformers (TAP, TAP†).

textvqatransformer
Python 2
10 个月前