GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
该项目已被所有者存档,当前处于只读状态,不再更新
facebookresearch

facebookresearch / av_hubert

星标930
复刻147


问题
 
Loading

该仓库已收录但尚未编辑。项目介绍及使用教程请前往 GitHub 阅读 README


0 条讨论

登录后发表评论

关于

A self-supervised learning framework for audio-visual speech

创建时间

2021-12-23

是否国产

否

  修改时间

2023-12-07T00:17:57Z

Readme
相关推荐

语言

  • Python100.0%

facebookresearch 的其他开源项目

Meta Research
segment-anything
@facebookresearch • Meta

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook51.71 k
1 年前
Meta Research
faiss
@facebookresearch • Meta

#向量搜索引擎#向量相似性搜索库,为稠密向量提供高效相似度搜索和聚类

C++36.89 k
5 天前
Meta Research
detectron2
@facebookresearch • Meta

Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

Python32.67 k
7 天前
Meta Research
fairseq
@facebookresearch • Meta

Fairseq 是一个Python编写的 Seq2seq 建模工具包,可用于翻译、摘要、语言建模和其他文本生成任务训练自定义模型

PythonPyTorch人工智能
Python31.76 k
3 个月前

您可能感兴趣的

Open-Sora
@hpcaitech

Open-Sora: 完全开源的高效复现类Sora视频生成方案

Python27.13 k
4 个月前
grok-1
@xai-org

大模型Grok-1开源

Python50.49 k
1 年前
AniPortrait
@Zejun-Yang

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Python5 k
1 年前
Open-Sora-Plan
@PKU-YuanGroup

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python12.02 k
1 个月前
Make-Your-Anchor
@ICTMCG

[CVPR 2024] Make-Your-Anchor: A Diffusion-based 2D Avatar Generation Framework.

cvprcvpr2024
Python351
7 个月前
MeloTTS
@myshell-ai

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

text-to-speechtts中文englishfrench
Python6.7 k
8 个月前
diffused-heads
@MStypulkowski

Official repository for Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation

Python485
1 年前
Real3DPortrait
@yerfor

Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis; ICLR 2024 Spotlight; Official code

nerftalking-face-generation
Python1.06 k
1 年前
FunASR
@modelscope

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

conformerPyTorchspeech-recognitionparaformerpunctuation
Python12.4 k
19 天前
surya
@datalab-to

OCR, layout analysis, reading order, table recognition in 90+ languages

Python18.45 k
5 天前
openinterpreter/01
01
@openinterpreter

The #1 open-source voice interface for desktop, mobile, and ESP32 chips.

Python5.09 k
10 个月前
Lissy93/web-check
web-check
@Lissy93

🕵️‍♂️ All-in-one OSINT tool for analysing any website

OSINT隐私安全sysadmin
TypeScript26.35 k
1 个月前
uni2ts
@SalesforceAIResearch

#计算机科学#Unified Training of Universal Time Series Forecasting Transformers

深度学习forecasting机器学习pre-trained-modelspre-training
Jupyter Notebook1.24 k
23 天前
AnimatableGaussians
@lizhe00

Code of [CVPR 2024] "Animatable Gaussians: Learning Pose-dependent Gaussian Maps for High-fidelity Human Avatar Modeling"

3d-human3d-reconstructionanimatable-avatar3d-gaussian-splatting
Python1.03 k
10 个月前
Wav2Lip-HD
@saifhassan

High-Fidelity Lip-Syncing with Wav2Lip and Real-ESRGAN

Python482
1 年前
DAMO-ConvAI
@AlibabaResearch

#自然语言处理#DAMO-ConvAI: The official repository which contains the codebase for Alibaba DAMO Conversational AI.

conversational-ai深度学习自然语言处理dialog
Python1.46 k
1 个月前
DeepSeek-VL
@deepseek-ai

DeepSeek-VL: Towards Real-World Vision-Language Understanding

vision-language-modelvision-language-pretrainingfoundation-models
Python3.95 k
1 年前