Loading

该仓库已收录但尚未编辑。项目介绍及使用教程请前往 GitHub 阅读 README


0 条讨论

登录后发表评论

关于

Auto-AVSR: Lip-Reading Sentences Project

创建时间
是否国产

  修改时间

2025-01-08T12:54:54Z


语言

  • Python91.9%
  • Jupyter Notebook7.9%
  • Shell0.2%

mpc001 的其他开源项目

Visual Speech Recognition for Multiple Languages

Python425
2 年前

ICASSP'22 Training Strategies for Improved Lip-Reading; ICASSP'21 Towards Practical Lipreading with Distilled and Efficient Models; ICASSP'20 Lipreading using Temporal Convolutional Networks

Python419
2 年前

Pytorch code for End-to-End Audiovisual Speech Recognition

Python174
3 年前

您可能感兴趣的

Visual Speech Recognition for Multiple Languages

Python425
2 年前
hashicorp/terraform

Terraform 是一种安全有效地构建、更改和版本控制基础设施的工具(基础架构自动化的编排工具)。它的目标是 "Write, Plan, and create Infrastructure as Code", 基础架构即代码。

Go45.96 k
3 天前
Python9.35 k
6 天前
TypeScript92.2 k
4 天前

[ECCV 2024] Single Image to 3D Textured Mesh in 10 seconds with Convolutional Reconstruction Model.

Python664
8 个月前

Fairseq 是一个Python编写的 Seq2seq 建模工具包,可用于翻译、摘要、语言建模和其他文本生成任务训练自定义模型

Python31.68 k
2 个月前

Automated Lip reading from real-time videos in tensorflow in python

Jupyter Notebook159
7 年前
Python1.88 k
3 年前

Official Repository of the paper "Trajectory Consistency Distillation"

Python347
1 年前

Code for "AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling"

Python773
1 年前

A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.

Python233
1 年前

WACV 2020 "Word-level Deep Sign Language Recognition from Video: A New Large-scale Dataset and Methods Comparison"

Python1 k
2 年前

ICASSP'22 Training Strategies for Improved Lip-Reading; ICASSP'21 Towards Practical Lipreading with Distilled and Efficient Models; ICASSP'20 Lipreading using Temporal Convolutional Networks

Python419
2 年前

#人脸识别#RetinaFace: Deep Face Detection Library for Python

Python1.63 k
19 天前

Embed Python in Unreal Engine 4

C++2.84 k
3 年前