The official repo for [NeurIPS'22] "ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation" and [TPAMI'23] "ViTPose++: Vision Transformer for Generic Body Pose Estimation"
2022-04-27
否
2024-07-24T11:00:20Z
Unofficial implementation for [ECCV'22] "Exploring Plain Vision Transformer Backbones for Object Detection"
#计算机科学#A comprehensive list [SAMRS@NeurIPS'23, RVSA@TGRS'22, RSP@TGRS'22] of our research works related to remote sensing, including papers, codes, and citations. Note: The repo for [TGRS'22] "An Empirical S...
#计算机科学#The official repo for [TGRS'22] "Advancing Plain Vision Transformer Towards Remote Sensing Foundation Model"
Open-Sora: 完全开源的高效复现类Sora视频生成方案
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
TripoSR: Fast 3D Object Reconstruction from a Single Image
OpenMMLab Pose Estimation Toolbox and Benchmark.
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
Official implementation of CVPR2020 paper "VIBE: Video Inference for Human Body Pose and Shape Estimation"
[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy
[ECCV 2024] Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
4DHumans: Reconstructing and Tracking Humans with Transformers
[ICCV 2023] PyTorch Implementation of "MotionBERT: A Unified Perspective on Learning Human Motion Representations"
SenseTime Research platform for single object tracking, implementing algorithms like SiamRPN and SiamMask.
whisper 是一个通用语音识别模型