GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

multimodal-learning

Website
Wikipedia
https://static.github-zh.com/github_avatars/pliang279?size=40
pliang279 / awesome-multimodal-ml

#自然语言处理#Reading list for research topics in multimodal machine learning

multimodal-learning机器学习representation-learning自然语言处理机器视觉speech-processingRoboticshealthcarereading-list深度学习reinforcement-learning
6.5 k
10 个月前
https://static.github-zh.com/github_avatars/mlfoundations?size=40
mlfoundations / open_flamingo

#计算机科学#An open-source framework for training large multimodal models.

机器视觉深度学习in-context-learninglanguage-modelmultimodal-learningPyTorchflamingo
Python 3.95 k
10 个月前
https://static.github-zh.com/github_avatars/KaiyangZhou?size=40
KaiyangZhou / CoOp

Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)

foundation-modelsmultimodal-learningprompt-learning
Python 1.98 k
1 年前
Eurus-Holmes/Awesome-Multimodal-Research
https://static.github-zh.com/github_avatars/Eurus-Holmes?size=40
Eurus-Holmes / Awesome-Multimodal-Research

A curated list of Multimodal Related Research.

Awesome Listsmultimodal-researchmultimodal-learningmultimodal
Python 1.35 k
2 年前
https://static.github-zh.com/github_avatars/AILab-CVC?size=40
AILab-CVC / UniRepLKNet

#计算机科学#[CVPR'24] UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition

architecture人工智能convolutional-neural-networks深度学习multimodal-learning
Python 997
8 个月前
https://static.github-zh.com/github_avatars/PreferredAI?size=40
PreferredAI / cornac

A Comparative Framework for Multimodal Recommender Systems

recommender-systemrecommendation-algorithmsrecommendation-enginematrix-factorizationcollaborative-filteringmultimodal-learningrecommendation-systemmultimodality
Python 960
2 个月前
https://static.github-zh.com/github_avatars/ArrowLuo?size=40
ArrowLuo / CLIP4Clip

An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"

multimodal-learningmultimodalitymultimodalsearchrankingretrieval-modelretrievalactivitynetclip
Python 953
1 年前
https://static.github-zh.com/github_avatars/DmitryRyumin?size=40
DmitryRyumin / ICCV-2023-Papers

#人脸识别#ICCV 2023 Papers: Discover cutting-edge research from ICCV 2023, the leading computer vision conference. Stay updated on the latest in computer vision and deep learning, with code included. ⭐ support ...

iccviccv20233D3d-reconstructionbiometrics机器视觉数据集深度学习explainable-aiface-recognitiongesture-recognition图像处理image-synthesispattern-recognitionRoboticsvideo-synthesismultimodal-learningPhotogrammetrypose-estimationtransfer-learning
Python 953
9 个月前
https://static.github-zh.com/github_avatars/declare-lab?size=40
declare-lab / multimodal-deep-learning

This repository contains various models targetting multimodal representation learning, multimodal fusion for downstream tasks such as multimodal sentiment analysis.

multimodal-deep-learningmultimodal-learningmultimodal-interactions
OpenEdge ABL 846
2 年前
https://static.github-zh.com/github_avatars/HuaizhengZhang?size=40
HuaizhengZhang / Awsome-Deep-Learning-for-Video-Analysis

#计算机科学#Papers, code and datasets about deep learning and multi-modal learning for video analysis

深度学习video-analysisBukkitmultimodal-learning机器学习video-classification
799
4 年前
https://static.github-zh.com/github_avatars/richard-peng-xia?size=40
richard-peng-xia / awesome-multimodal-in-medical-imaging

A collection of resources on applications of multi-modal learning in medical imaging.

Medical imagingmultimodal-deep-learningmultimodal-learningvisual-question-answeringlarge-language-modelslarge-multimodal-modelsmultimodal-large-language-models
760
11 天前
https://static.github-zh.com/github_avatars/henghuiding?size=40
henghuiding / ReLA

[CVPR2023 Highlight] GRES: Generalized Referring Expression Segmentation

multimodal-learningreferring-expression-comprehensionreferring-expression-segmentationvision-language-transformercvpr2023
Python 702
2 年前
https://static.github-zh.com/github_avatars/georgian-io?size=40
georgian-io / Multimodal-Toolkit

#自然语言处理#Multimodal model for text and tabular data with HuggingFace transformers as building block for text data

huggingface-transformerstransformer自然语言处理tabular-datamultimodal-learning
Python 604
8 个月前
https://static.github-zh.com/github_avatars/pliang279?size=40
pliang279 / MultiBench

#自然语言处理#[NeurIPS 2021] Multiscale Benchmarks for Multimodal Representation Learning

机器学习multimodal-learningRobotics自然语言处理机器视觉深度学习healthcarerepresentation-learningspeech-processing
HTML 556
1 年前
https://static.github-zh.com/github_avatars/sangminwoo?size=40
sangminwoo / awesome-vision-and-language

#Awesome#A curated list of awesome vision and language resources (still under construction... stay tuned!)

Awesome Listsvision-and-languagemultimodal-learning
534
7 个月前
https://static.github-zh.com/github_avatars/henghuiding?size=40
henghuiding / MeViS

[ICCV 2023] MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions

multimodal-learningreferring-expression-comprehensionreferring-expression-segmentationreferring-video-object-segmentationvideo-understanding
Python 527
1 年前
https://static.github-zh.com/github_avatars/subho406?size=40
subho406 / OmniNet

#自然语言处理#Official Pytorch implementation of "OmniNet: A unified architecture for multi-modal multi-task learning" | Authors: Subhojeet Pramanik, Priyanka Agrawal, Aman Hussain

机器学习深度学习神经网络人工智能transformer自然语言处理image-captioningvideo-recognitionmultitask-learningmultimodal-learning
Python 512
5 年前
https://static.github-zh.com/github_avatars/microsoft?size=40
microsoft / XPretrain

#自然语言处理#Multi-modality pre-training

multimodal-learningpre-trainingmultimedia机器视觉自然语言处理
Python 495
1 年前
https://static.github-zh.com/github_avatars/njustkmg?size=40
njustkmg / OMML

Multi-Modal learning toolkit based on PaddlePaddle and PyTorch, supporting multiple applications such as multi-modal classification, cross-modal retrieval and image caption.

multimodalmultimodal-learningPythonpaddlepaddlePyTorchcrossmodal-retrievalimagecaptioningclassification
Python 475
2 年前
https://static.github-zh.com/github_avatars/DmitryRyumin?size=40
DmitryRyumin / ICASSP-2023-24-Papers

#人脸识别#ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processi...

asrdenoisingdomain-adaptationface-recognitionlanguage-modelingself-supervised-learningsemantic-segmentationsignal-processingspeech-recognitionvadgenerative-modelsimage-generationmusic-generationmultimodal-learning
Python 472
1 个月前
loading...