GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

cross-modal

Website
Wikipedia
jina-ai/discoart
https://static.github-zh.com/github_avatars/jina-ai?size=40
jina-ai / discoart

🪩 Create Disco Diffusion artworks in one line

creative-aidisco-diffusioncross-modaldallegenerative-artmultimodaldiffusionpromptsmidjourneyimgenclip-guided-diffusionlatent-diffusionstable-diffusion
Python 3.84 k
2 年前
docarray/docarray
https://static.github-zh.com/github_avatars/docarray?size=40
docarray / docarray

#计算机科学#Represent, send, store and search multimodal data

docarray数据结构multimodalcross-modalneural-search深度学习nested-dataqdrantweaviatenearest-neighbor-searchprotobufelasticsearchmulti-modalsemantic-search机器学习PyTorchFastAPIpydantic
Python 3.07 k
9 天前
https://static.github-zh.com/github_avatars/shaoxiongji?size=40
shaoxiongji / knowledge-graphs

#自然语言处理#A collection of research on knowledge graphs

knowledge-graphrepresentation-learningrelation-extractionreasoning自然语言处理nercommonsensecross-modalquestion-answeringdialogue-systemsinformation-retrievalsurveyBukkit
JavaScript 1.75 k
3 年前
https://static.github-zh.com/github_avatars/krantiparida?size=40
krantiparida / awesome-audio-visual

#Awesome#A curated list of different papers and datasets in various areas of audio-visual processing

Awesome Listscross-modalLocalization (l10n)source-separation
733
1 年前
https://static.github-zh.com/github_avatars/kuanghuei?size=40
kuanghuei / SCAN

#计算机科学#PyTorch source code for "Stacked Cross Attention for Image-Text Matching" (ECCV 2018)

cross-modalimage-captioning神经网络深度学习PyTorch机器视觉
Python 565
2 年前
https://static.github-zh.com/github_avatars/towhee-io?size=40
towhee-io / examples

#自然语言处理#Analyze the unstructured data with Towhee, such as reverse image search, reverse video search, audio classification, question and answer systems, molecular search, etc.

audio-classificationcross-modalembeddingsimage-classification机器学习自然语言处理
Jupyter Notebook 498
1 年前
https://static.github-zh.com/github_avatars/yisun98?size=40
yisun98 / SOLC

Remote Sensing Sar-Optical Land-use Classfication Pytorch Pytorch高分辨率遥感语义分割/地物分割/地物分类

PyTorchremote-sensingsegmentationdeeplabv3cross-modalmulti-modal
Python 220
1 年前
https://static.github-zh.com/github_avatars/JizhiziLi?size=40
JizhiziLi / RIM

[CVPR 2023] Referring Image Matting

cross-modalimage-mattingimage-segmentationmultimodalmatting
208
2 年前
https://static.github-zh.com/github_avatars/DRSY?size=40
DRSY / MoTIS

#向量搜索引擎#[NAACL 2022]Mobile Text-to-Image search powered by multimodal semantic representation models(e.g., OpenAI's CLIP)

iOS人工智能image-searchclipvector-searchknnlshsemantic-searchknowledge-distillationretrievalcross-modalnaacl
Swift 123
2 年前
https://static.github-zh.com/github_avatars/QizhiPei?size=40
QizhiPei / BioT5

#自然语言处理#BioT5 (EMNLP 2023) and BioT5+ (ACL 2024 Findings)

Bioinformaticscomputational-biologycross-modal机器学习自然语言处理nlp-applications
Python 114
9 个月前
https://static.github-zh.com/github_avatars/Zengyi-Qin?size=40
Zengyi-Qin / Weakly-Supervised-3D-Object-Detection

Weakly Supervised 3D Object Detection from Point Clouds (VS3D), ACM MM 2020

3d-object-detectionkittiPoint cloudcross-modaltransfer-learningTensorflowlidarmonocularstereounsupervised-learning
Jupyter Notebook 107
2 年前
https://static.github-zh.com/github_avatars/qcraftai?size=40
qcraftai / distill-bev

DistillBEV: Boosting Multi-Camera 3D Object Detection with Cross-Modal Knowledge Distillation (ICCV 2023)

3d-object-detectionknowledge-distillationlidarnuscenesPoint cloudself-drivingautonomous-drivingdistillationcross-modalmulti-modal
Python 102
2 年前
https://static.github-zh.com/github_avatars/yangli18?size=40
yangli18 / VLTVG

Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning, CVPR 2022

vision-languagecross-modal
Python 97
3 年前
https://static.github-zh.com/github_avatars/rohitrango?size=40
rohitrango / objects-that-sound

#计算机科学#Unofficial Implementation of Google Deepmind's paper `Objects that Sound`

机器学习深度学习audio-videoembeddings深度神经网络deepmindcross-modal
Python 83
7 年前
https://static.github-zh.com/github_avatars/marslanm?size=40
marslanm / Multimodality-Representation-Learning

This repository provides a comprehensive collection of research papers focused on multimodal representation learning, all of which have been cited and discussed in the survey just accepted https://dl....

cross-modalmultimodal-datasetsmultimodal-deep-learningmultimodal-pre-trained-modeltransformer-modelsvision-language-pretraining
75
2 年前
https://static.github-zh.com/github_avatars/kywen1119?size=40
kywen1119 / DSRAN

Code for journal paper "Learning Dual Semantic Relations with Graph Attention for Image-Text Matching", TCSVT, 2020.

PyTorchimage-text-matchingcross-modal机器视觉
Python 72
3 年前
https://static.github-zh.com/github_avatars/Paranioar?size=40
Paranioar / UniPT

[CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"

cross-modalparameter-efficient-learningparameter-efficient-tuningtransfer-learningparameter-efficient-fine-tuning
Python 68
8 个月前
https://static.github-zh.com/github_avatars/Eaphan?size=40
Eaphan / UPIDet

Unleash the Potential of Image Branch for Cross-modal 3D Object Detection [NeurIPS2023]

3d-object-detectioncross-modalmulti-modal
Python 61
1 年前
https://static.github-zh.com/github_avatars/GT-RIPL?size=40
GT-RIPL / Xmodal-Ctx

Official PyTorch implementation of our CVPR 2022 paper: Beyond a Pre-Trained Object Detector: Cross-Modal Textual and Visual Context for Image Captioning

clipcross-modalimage-captioningvision-and-language
Python 59
3 年前
https://static.github-zh.com/github_avatars/zjukg?size=40
zjukg / DUET

[Paper][AAAI 2023] DUET: Cross-modal Semantic Grounding for Contrastive Zero-shot Learning

pretrained-language-modelPyTorchtransformerzero-shot-learningcross-modalgroundingsemantic
Python 52
1 年前
loading...