该仓库已收录但尚未编辑。项目介绍及使用教程请前往 GitHub 阅读 README

0 条讨论

登录后发表评论

关于

Auto-AVSR: Lip-Reading Sentences Project

创建时间

2023-06-16

是否国产

否

语言

Python91.9%
Jupyter Notebook7.9%
Shell0.2%

mpc001 的其他开源项目

Visual_Speech_Recognition_for_Multiple_Languages

@mpc001

Visual Speech Recognition for Multiple Languages

Python425

2 年前

Lipreading_using_Temporal_Convolutional_Networks

@mpc001

ICASSP'22 Training Strategies for Improved Lip-Reading; ICASSP'21 Towards Practical Lipreading with Distilled and Efficient Models; ICASSP'20 Lipreading using Temporal Convolutional Networks

Python419

2 年前

end-to-end-lipreading

@mpc001

Pytorch code for End-to-End Audiovisual Speech Recognition

Python174

3 年前

您可能感兴趣的

Visual_Speech_Recognition_for_Multiple_Languages

@mpc001

Visual Speech Recognition for Multiple Languages

Python425

2 年前

terraform

HashiCorp@hashicorp

Terraform 是一种安全有效地构建、更改和版本控制基础设施的工具(基础架构自动化的编排工具)。它的目标是 "Write, Plan, and create Infrastructure as Code", 基础架构即代码。

graph Infrastructure as code Terraform cloud cloud-management

Go45.96 k

3 天前

espnet

@espnet

#计算机科学#End-to-End Speech Processing Toolkit

深度学习 end-to-end chainer PyTorch kaldi

Python9.35 k

6 天前

@shadcn-ui

#React UI#基于 Radix UI + Tailwind CSS 的UI库

components Next radix-ui React Tailwind CSS

TypeScript92.2 k

4 天前

CRM

@thu-ml

[ECCV 2024] Single Image to 3D Textured Mesh in 10 seconds with Convolutional Reconstruction Model.

3D aigc diffusion-models generative-model

Python664

8 个月前

fairseq

@facebookresearch • Meta

Fairseq 是一个Python编写的 Seq2seq 建模工具包，可用于翻译、摘要、语言建模和其他文本生成任务训练自定义模型

Python PyTorch 人工智能

Python31.68 k

2 个月前

LipNet

@deepconvolution

Automated Lip reading from real-time videos in tensorflow in python

Jupyter Notebook159

7 年前

lip-reading-deeplearning

Amirsina Torfi@astorfi

#计算机科学#🔓 Lip Reading - Cross Audio-Visual Recognition using 3D Architectures

深度学习机器视觉 speech-recognition 3d-convolutional-network Tensorflow

Python1.88 k

3 年前

TCD

@jabir-zheng

Official Repository of the paper "Trajectory Consistency Distillation"

consistency-models diffusion text-to-image score-based-models stable-diffusion

Python347

1 年前

AnyGPT

@OpenMOSS

Code for "AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling"

Python773

1 年前

deep_avsr

@smeetrs

A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.

audio-visual-speech-recognition speech-recognition automatic-speech-recognition speech-to-text

Python233

1 年前

WLASL

@dxli94

WACV 2020 "Word-level Deep Sign Language Recognition from Video: A New Large-scale Dataset and Methods Comparison"

Python1 k

2 年前

Lipreading_using_Temporal_Convolutional_Networks

@mpc001

ICASSP'22 Training Strategies for Improved Lip-Reading; ICASSP'21 Towards Practical Lipreading with Distilled and Efficient Models; ICASSP'20 Lipreading using Temporal Convolutional Networks

Python419

2 年前