#

cross-modal-learning

https://static.github-zh.com/github_avatars/MohamedAfham?size=40

#计算机科学#Official implementation of "CrossPoint: Self-Supervised Cross-Modal Contrastive Learning for 3D Point Cloud Understanding" (CVPR, 2022)

Python 256
2 年前
https://static.github-zh.com/github_avatars/whwu95?size=40

【CVPR'2023 Highlight & TPAMI】Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval?

Python 243
10 个月前
https://static.github-zh.com/github_avatars/whwu95?size=40

【AAAI'2023 & IJCV】Transferring Vision-Language Models for Visual Recognition: A Classifier Perspective

Python 195
1 年前
https://static.github-zh.com/github_avatars/whwu95?size=40

【CVPR'2023】Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models

Python 152
1 年前
https://static.github-zh.com/github_avatars/Toytiny?size=40

#计算机科学#[CVPR 2023 Highlight 💡] Hidden Gems: 4D Radar Scene Flow Learning Using Cross-Modal Supervision

Python 131
2 年前
https://static.github-zh.com/github_avatars/RunpeiDong?size=40

[ICLR 2023] Autoencoders as Cross-Modal Teachers: Can Pretrained 2D Image Transformers Help 3D Representation Learning?

Python 102
1 年前
https://static.github-zh.com/github_avatars/WinfredGe?size=40

#计算机科学#[IJCAI 2025] Official implementation of "T2S: High-resolution Time Series Generation with Text-to-Series Diffusion Models"

Python 54
25 天前
https://static.github-zh.com/github_avatars/knightyxp?size=40

[AAAI 2024] DGL: Dynamic Global-Local Prompt Tuning for Text-Video Retrieval.

Python 42
1 年前
https://static.github-zh.com/github_avatars/GaochangWu?size=40
Python 21
1 个月前
https://static.github-zh.com/github_avatars/frank-chris?size=40

In this work, we implement different cross-modal learning schemes such as Siamese Network, Correlational Network and Deep Cross-Modal Projection Learning model and study their performance. We also pro...

Jupyter Notebook 11
4 年前
https://static.github-zh.com/github_avatars/StarMoonWang?size=40

Official Pytorch Implementation of SeisMoLLM: Advancing Seismic Monitoring via Cross-modal Transfer with Pre-trained Large Language Model

Python 9
1 个月前
https://static.github-zh.com/github_avatars/ospanbatyr?size=40

Code for the "Sample-efficient Integration of New Modalities into Large Language Models" paper

Python 8
8 天前
https://static.github-zh.com/github_avatars/Markin-Wang?size=40

[IJBHI 2024] This is the official implementation of CAMANet: Class Activation Map Guided Attention Network for Radiology Report Generation accepted to IEEE Journal of Biomedical and Health Informatic...

Python 8
4 个月前
https://static.github-zh.com/github_avatars/verlab?size=40

Original PyTorch implementation of the code for the paper "Straight to the Point: Fast-forwarding Videos via Reinforcement Learning Using Textual Data" at the IEEE/CVF Conference on Computer Vision an...

Python 8
3 年前
https://static.github-zh.com/github_avatars/IGITUGraz?size=40

Code for Limbacher, T., Özdenizci, O., & Legenstein, R. (2022). Memory-enriched computation and learning in spiking neural networks through Hebbian plasticity. arXiv preprint arXiv:2205.11276.

Python 7
2 年前
https://static.github-zh.com/github_avatars/codiceSpaghetti?size=40

#自然语言处理#This project creates the T4SA 2.0 dataset, i.e. a big set of data to train visual models for Sentiment Analysis in the Twitter domain using a cross-modal student-teacher approach.

Jupyter Notebook 4
4 个月前
https://static.github-zh.com/github_avatars/PrithivirajDamodaran?size=40

An intentionally simple Image to Food cross-modal search. Created by Prithiviraj Damodaran.

4
4 年前
loading...
Website
Wikipedia