#计算机科学#3D ResNets for Action Recognition (CVPR 2018)
基于模块化的设计,提供丰富的视频算法实现、产业级的视频算法优化与应用,包括安防、体育、互联网、媒体等行业的动作定位与识别、行为分析、智能封面、视频标注、视频打标签等,涵盖动作识别与视频分类、动作定位、动作检测、多模态文本视频检索等技术。
#自然语言处理#Official Pytorch implementation of "OmniNet: A unified architecture for multi-modal multi-task learning" | Authors: Subhojeet Pramanik, Priyanka Agrawal, Aman Hussain
#自然语言处理#Eden AI: simplify the use and deployment of AI technologies by providing a unique API that connects to the best possible AI engines
#计算机科学#Detects license plate of car and recognizes its characters
#计算机科学#AutoVideo: An Automated Video Action Recognition System
#计算机科学#PyTorch implementation of Non-Local Neural Networks (https://arxiv.org/pdf/1711.07971.pdf)
【AAAI'2023 & IJCV】Transferring Vision-Language Models for Visual Recognition: A Classifier Perspective
GPT4Vis: What Can GPT-4 Do for Zero-shot Visual Recognition?
【CVPR'2023】Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models
YAPO e+ - Yet Another Porn Organizer (extended)
#计算机科学#CATER: A diagnostic dataset for Compositional Actions and TEmporal Reasoning
#人脸识别#WACV 2024 Papers: Discover cutting-edge research from WACV 2024, the leading computer vision conference. Stay updated on the latest in computer vision and deep learning, with code included. ⭐ support ...
PyTorch Implementation on Paper [CVPR2021]Distilling Audio-Visual Knowledge by Compositional Contrastive Learning
Video Recognition using Mixed Convolutional Tube (MiCT) on PyTorch with a ResNet backbone
Frame Flexible Network (CVPR2023)
#计算机科学#My experimentation around action recognition in videos. Contains Keras implementation for C3D network based on original paper "Learning Spatiotemporal Features with 3D Convolutional Networks", Tran et...
Video dataset dedicated to portrait-mode video recognition.
#计算机科学#State of the art object detection in real-time using YOLOV3 algorithm. Augmented with a process that allows easy training of the classifier as a plug & play solution . Provides alert if an item in an...
[WACV'22] Code repository for the paper "Self-supervised Video Representation Learning with Cross-Stream Prototypical Contrasting", https://arxiv.org/abs/2106.10137.