#

multimodal-learning

https://static.github-zh.com/github_avatars/KaiyangZhou?size=40

Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)

Python 2.07 k
1 年前
https://static.github-zh.com/github_avatars/ArrowLuo?size=40

An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"

Python 983
1 年前
https://static.github-zh.com/github_avatars/DmitryRyumin?size=40

#人脸识别#ICCV 2023 Papers: Discover cutting-edge research from ICCV 2023, the leading computer vision conference. Stay updated on the latest in computer vision and deep learning, with code included. ⭐ support ...

Python 955
1 年前
https://static.github-zh.com/github_avatars/declare-lab?size=40

This repository contains various models targetting multimodal representation learning, multimodal fusion for downstream tasks such as multimodal sentiment analysis.

OpenEdge ABL 865
3 年前
https://static.github-zh.com/github_avatars/georgian-io?size=40

#自然语言处理#Multimodal model for text and tabular data with HuggingFace transformers as building block for text data

Python 605
1 年前
https://static.github-zh.com/github_avatars/sangminwoo?size=40

#Awesome#A curated list of awesome vision and language resources (still under construction... stay tuned!)

549
10 个月前
https://static.github-zh.com/github_avatars/henghuiding?size=40
Python 518
1 个月前
https://static.github-zh.com/github_avatars/subho406?size=40

#自然语言处理#Official Pytorch implementation of "OmniNet: A unified architecture for multi-modal multi-task learning" | Authors: Subhojeet Pramanik, Priyanka Agrawal, Aman Hussain

Python 513
5 年前
https://static.github-zh.com/github_avatars/DmitryRyumin?size=40

#人脸识别#ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processi...

Python 506
4 个月前
loading...
Website
Wikipedia