GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

referring-expression-comprehension

Website
Wikipedia
https://static.github-zh.com/github_avatars/OFA-Sys?size=40
OFA-Sys / OFA

Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

multimodalpretrainingimage-captioningtext-to-image-synthesisvisual-question-answeringreferring-expression-comprehensionvision-languagepretrained-modelspromptprompt-tuning中文
Python 2.5 k
1 年前
https://static.github-zh.com/github_avatars/MasterBin-IIAU?size=40
MasterBin-IIAU / UNINEXT

[CVPR'23] Universal Instance Perception as Object Discovery and Retrieval

instance-segmentationobject-detectionobject-trackingperceptionreferring-expression-comprehensionreferring-expression-segmentationunified-modelmulti-object-tracking-segmentationmultiple-object-trackingreferring-video-object-segmentationvideo-instance-segmentationsingle-object-trackingvideo-object-segmentation
Python 1.27 k
2 年前
https://static.github-zh.com/github_avatars/FoundationVision?size=40
FoundationVision / GLEE

[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

foundation-modelobject-detectionopen-worldtrackingopen-vocabulary-detectionopen-vocabulary-segmentationopen-vocabulary-video-segmentationreferring-expression-comprehensionreferring-expression-segmentationvideo-instance-segmentationvideo-object-segmentationzero-shot-object-detectionreferring-video-object-segmentationinteractive-segmentationsegment-anything
Python 1.13 k
8 个月前
https://static.github-zh.com/github_avatars/henghuiding?size=40
henghuiding / ReLA

[CVPR2023 Highlight] GRES: Generalized Referring Expression Segmentation

multimodal-learningreferring-expression-comprehensionreferring-expression-segmentationvision-language-transformercvpr2023
Python 702
2 年前
https://static.github-zh.com/github_avatars/shenyunhang?size=40
shenyunhang / APE

[CVPR 2024] Aligning and Prompting Everything All at Once for Universal Visual Perception

image-segmentationobject-detectionopen-worldreferring-expression-comprehensionvision-language-transformer
Python 569
1 年前
https://static.github-zh.com/github_avatars/henghuiding?size=40
henghuiding / MeViS

[ICCV 2023] MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions

multimodal-learningreferring-expression-comprehensionreferring-expression-segmentationreferring-video-object-segmentationvideo-understanding
Python 527
1 年前
https://static.github-zh.com/github_avatars/Charles-Xie?size=40
Charles-Xie / awesome-described-object-detection

#Awesome#A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring Expression Comprehension. Updated frequently and pull requests w...

Awesome Listsopen-vocabulary-detectionreferring-expression-comprehension
280
10 天前
https://static.github-zh.com/github_avatars/henghuiding?size=40
henghuiding / gRefCOCO

A benchmark dataset for GRES and GREC [CVPR2023 Highlight]

datasetreferring-expression-comprehensionreferring-expression-segmentation
Python 236
2 年前
https://static.github-zh.com/github_avatars/luogen1996?size=40
luogen1996 / MCN

[CVPR2020] Multi-task Collaborative Network for Joint Referring Expression Comprehension and Segmentation, CVPR2020 (oral)

cvpr2020referring-expression-comprehensionreferring-expression-segmentationmulti-task-learning
Python 138
3 年前
https://static.github-zh.com/github_avatars/shikras?size=40
shikras / d-cube

A detection/segmentation dataset with labels characterized by intricate and flexible expressions. "Described Object Detection: Liberating Object Detection with Flexible Expressions" (NeurIPS 2023).

multi-modal-learningobject-detectionreferring-expression-comprehensionvision-languagedatasetopen-vocabulary-detection
Python 125
1 年前
https://static.github-zh.com/github_avatars/MILVLG?size=40
MILVLG / rosita

ROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge Integration

vision-and-languagevqapre-trainingimage-text-retrievalreferring-expression-comprehension
Python 56
2 年前
https://static.github-zh.com/github_avatars/luogen1996?size=40
luogen1996 / SimREC

A lightweight codebase for referring expression comprehension and segmentation

referring-expression-comprehensionreferring-expression-segmentation
Python 55
3 年前
https://static.github-zh.com/github_avatars/IDEA-Research?size=40
IDEA-Research / Rex-Thinker

Rex-Thinker: Grounded Object Refering via Chain-of-Thought Reasoning

mllmobject-detectionreferring-expression-comprehensiongrpo
Python 27
10 天前
https://static.github-zh.com/github_avatars/xuyang-liu16?size=40
xuyang-liu16 / VGDiffZero

[ICASSP 2024] VGDiffZero: Text-to-image Diffusion Models Can Be Zero-shot Visual Grounders

机器视觉vision-language-modelzero-shot-learningstable-diffusiontext-to-image-generationreferring-expression-comprehension
Python 16
4 个月前
https://static.github-zh.com/github_avatars/Disguiser15?size=40
Disguiser15 / RefTeacher

RefTeacher is a strong baseline method for Semi-Supervised Referring Expression Comprehension.

referring-expression-comprehensionsemi-supervised-learning
Python 12
2 年前
https://static.github-zh.com/github_avatars/willemsenbram?size=40
willemsenbram / a-game-of-sorts

Repository for the paper "Collecting Visually-Grounded Dialogue with A Game Of Sorts"

datasetdialoguereferring-expression-comprehensionvision-and-language
Shell 4
2 年前
https://static.github-zh.com/github_avatars/lparolari?size=40
lparolari / harlequin

Code and DataLoader for the Harlequin dataset 🎨 described in the paper "Harlequin: Color-driven Generation of Synthetic Data for Referring Expression Comprehension", presented at ICPR'24

referring-expression-comprehensionsynthetic-data-generation
Python 3
7 个月前
https://static.github-zh.com/github_avatars/rd20karim?size=40
rd20karim / MB-ORES

MB-ORES: A Multi-Branch Object Reasoner for Visual Grounding in Remote Sensing

人工智能机器视觉mahcine-learningobject-detectionreferring-expression-comprehensionremote-sensingrepresentation-learningtransformervision-languagemultimodalreasoningstate-of-the-art
1
2 个月前
https://static.github-zh.com/github_avatars/antonio-f?size=40
antonio-f / Florence-2-test

Florence-2 quick test

florence-2multimodal-large-language-modelshuggingface-transformersimage-captioningimage-to-textJupyter NotebookPythonreferring-expression-comprehension教程colab-notebook
Jupyter Notebook 0
10 个月前