#

vision-language-transformer

https://static.github-zh.com/github_avatars/IDEA-Research?size=40

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 8.88 k
1 年前
https://static.github-zh.com/github_avatars/salesforce?size=40

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Jupyter Notebook 5.48 k
1 年前
https://static.github-zh.com/github_avatars/shenyunhang?size=40

[CVPR 2024] Aligning and Prompting Everything All at Once for Universal Visual Perception

Python 588
1 年前
https://static.github-zh.com/github_avatars/henghuiding?size=40

[ICCV2021 & TPAMI2023] Vision-Language Transformer and Query Generation for Referring Segmentation

Python 358
4 年前
https://static.github-zh.com/github_avatars/yiren-jian?size=40

[NeurIPS 2023] Bootstrapping Vision-Language Learning with Decoupled Language Pre-training

Python 25
2 年前
https://static.github-zh.com/github_avatars/ThomasVonWu?size=40

#大语言模型#A collection of VLMs papers, blogs, and projects, with a focus on VLMs in Autonomous Driving and related reasoning techniques.

10
10 个月前
https://static.github-zh.com/github_avatars/aurooj?size=40

Mini-batch selective sampling for knowledge adaption of VLMs for mammography.

Jupyter Notebook 1
1 年前
loading...
Website
Wikipedia