#

visual-commonsense-reasoning

https://static.github-zh.com/github_avatars/rowanz?size=40

Recognition to Cognition Networks (code for the model in "From Recognition to Cognition: Visual Commonsense Reasoning", CVPR 2019)

Python 468
4 年前
https://static.github-zh.com/github_avatars/guyyariv?size=40

#计算机科学#This repo contains the official PyTorch implementation of vLMIG: Improving Visual Commonsense in Language Models via Multiple Image Generation

Python 16
1 年前
https://static.github-zh.com/github_avatars/baohuyvanba?size=40

Vision-Zephyr: a multimodal LLM for Visual Commonsense Reasoning—CLIP-ViT + Zephyr-7B with visual prompting; code, training scripts, and VCR evaluation.

Python 1
21 天前
Website
Wikipedia