#

video-recognition

https://static.github-zh.com/github_avatars/PaddlePaddle?size=40

基于模块化的设计,提供丰富的视频算法实现、产业级的视频算法优化与应用,包括安防、体育、互联网、媒体等行业的动作定位与识别、行为分析、智能封面、视频标注、视频打标签等,涵盖动作识别与视频分类、动作定位、动作检测、多模态文本视频检索等技术。

Python 1.64 k
7 个月前
https://static.github-zh.com/github_avatars/subho406?size=40

#自然语言处理#Official Pytorch implementation of "OmniNet: A unified architecture for multi-modal multi-task learning" | Authors: Subhojeet Pramanik, Priyanka Agrawal, Aman Hussain

Python 513
5 年前
https://static.github-zh.com/github_avatars/datamllab?size=40
Python 339
2 年前
https://static.github-zh.com/github_avatars/Atze00?size=40

MoViNets PyTorch implementation: Mobile Video Networks for Efficient Video Recognition;

Jupyter Notebook 278
3 年前
https://static.github-zh.com/github_avatars/tea1528?size=40

#计算机科学#PyTorch implementation of Non-Local Neural Networks (https://arxiv.org/pdf/1711.07971.pdf)

Python 251
3 年前
https://static.github-zh.com/github_avatars/whwu95?size=40

【AAAI'2023 & IJCV】Transferring Vision-Language Models for Visual Recognition: A Classifier Perspective

Python 195
1 年前
https://static.github-zh.com/github_avatars/whwu95?size=40

GPT4Vis: What Can GPT-4 Do for Zero-shot Visual Recognition?

Python 184
1 年前
https://static.github-zh.com/github_avatars/whwu95?size=40

【CVPR'2023】Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models

Python 152
1 年前
https://static.github-zh.com/github_avatars/cooperdk?size=40
Python 150
3 年前
https://static.github-zh.com/github_avatars/ldkong1205?size=40

[NeurIPS 2023] Unsupervised Video Domain Adaptation for Action Recognition: A Disentanglement Perspective

Jupyter Notebook 119
2 年前
https://static.github-zh.com/github_avatars/rohitgirdhar?size=40

#计算机科学#CATER: A diagnostic dataset for Compositional Actions and TEmporal Reasoning

Python 107
5 年前
https://static.github-zh.com/github_avatars/Ha0Tang?size=40

[Neurocomputing 2019] Fast and Robust Dynamic Hand Gesture Recognition via Key Frames Extraction and Feature Fusion

C++ 102
4 年前
https://static.github-zh.com/github_avatars/DmitryRyumin?size=40

#人脸识别#WACV 2024 Papers: Discover cutting-edge research from WACV 2024, the leading computer vision conference. Stay updated on the latest in computer vision and deep learning, with code included. ⭐ support ...

Python 96
1 年前
https://static.github-zh.com/github_avatars/yanbeic?size=40

PyTorch Implementation on Paper [CVPR2021]Distilling Audio-Visual Knowledge by Compositional Contrastive Learning

Python 89
4 年前
https://static.github-zh.com/github_avatars/Fl1s?size=40

A search system that analyzes short video snippets (2–5 secs) and finds highly accurate matches using keyframe-based perceptual hashing. Selfhosted Video Shazam.

Java 59
1 个月前
loading...
Website
Wikipedia