回车: Github搜索 Shift+回车: Google搜索

该仓库已收录但尚未编辑。项目介绍及使用教程请前往 GitHub 阅读 README

0 条讨论

登录后发表评论

关于

[TMM 2023] VideoXum: Cross-modal Visual and Textural Summarization of Videos

创建时间

2024-01-04

是否国产

否

修改时间

2024-04-09T06:47:09Z

Readme

语言

Python100.0%

您可能感兴趣的

GolangTraining

Todd McLeod@GoesToEleven

Training for Golang (go language)

Go10.28 k

1 年前

streaming-llm

MIT HAN Lab@mit-han-lab

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python7.05 k

1 年前

ChatCaptioner

@Vision-CAIR

Official Repository of ChatCaptioner

Jupyter Notebook462

2 年前

TransUNet

@Beckschen

This repository includes the official project of TransUNet, presented in our paper: TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation.

Python2.91 k

1 年前

VLTinT

@UARK-AICV

[AAAI 2023 Oral] VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning

aaai2023 transformer-architecture video-captioning vision-language PyTorch

Jupyter Notebook68

2 年前

build-web-application-with-golang

astaxie@astaxie

Go Web 编程电子书

Go43.96 k

1 年前🇨🇳

30dayMakeCppServer

@yuesong-feng

30天自制C++服务器，包含教程和源代码

C++Server cppserver epoll socket

C++6.85 k

6 个月前🇨🇳

HERO_Video_Feature_Extractor

@linjieli222

Video Feature Extraction Code for EMNLP 2020 paper "HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training"

PyTorch slowfast resnet vision-and-language

Python114

4 年前

video-summarization-resources

@robi56

Video Summarization Dataset, Papers, Codes

172

7 年前

OrcaSlicer

@SoftFever

G-code generator for 3D printers (Bambu, Prusa, Voron, VzBot, RatRig, Creality, etc.)

3d-printer 3D makers

C++11.01 k

10 小时前

wdi5

@ui5-community

official UI5 end-to-end test framework for UI5 web-apps. wdi5 = Webdriver.IO + UI5 Test API

ui5 webdriverio OpenUI5 Testing

TypeScript109

13 天前

video_features

@v-iashin

Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and TIMM models.

PyTorch feature-extraction parallel

Python617

8 个月前

VLMEvalKit

@open-compass

#大语言模型#Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

gpt-4v large-language-models llava multi-modal openai

Python3.11 k

3 天前

Awesome-LLMs-for-Video-Understanding

@yunlong10

🔥🔥🔥 [IEEE TCSVT] Latest Papers, Codes and Datasets on Vid-LLMs.

2.77 k

1 个月前

jylins / videoxum

自述文件

0 条讨论

关于

创建时间

是否国产

修改时间

语言

您可能感兴趣的