GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

speech-representation

Website
Wikipedia
s3prl/s3prl
https://static.github-zh.com/github_avatars/s3prl?size=40
s3prl / s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit

speech-representationmockingjayrepresentation-learningapcteraself-supervised-learningspeech-pretrainingvq-apcwav2vechubertwavlm
Python 2.41 k
3 个月前
https://static.github-zh.com/github_avatars/jishengpeng?size=40
jishengpeng / WavTokenizer

[ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling

acousticcodecgpt4osemanticspeech-representationtext-to-speechdac
Python 1.15 k
3 个月前
https://static.github-zh.com/github_avatars/ddlBoJack?size=40
ddlBoJack / emotion2vec

[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

pytorch-implementationspeech-representation
Python 865
6 个月前
https://static.github-zh.com/github_avatars/jishengpeng?size=40
jishengpeng / WavChat

A Survey of Spoken Dialogue Models (60 pages)

gpt-4ospeechspeech-representationstreaming
302
7 个月前
https://static.github-zh.com/github_avatars/mechanicalsea?size=40
mechanicalsea / lighthubert

LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT

neural-architecture-searchPyTorchself-supervised-learningspeech-representation
Python 74
3 年前
https://static.github-zh.com/github_avatars/Ereboas?size=40
Ereboas / MagiCodec

#大语言模型#A single-layer, streaming codec model providing SOTA audio quality and discrete tokens designed for superior downstream modelability.

codec大语言模型PyTorchspeech-representationtext-to-speechtts
Python 55
11 天前
https://static.github-zh.com/github_avatars/andi611?size=40
andi611 / Mockingjay-Speech-Representation

Official Implementation of Mockingjay in Pytorch

speechspeech-representationmockingjayrepresentation-learningfeature-extractionsentiment-classificationspeaker-recognitionPyTorchpytorch-implementationapc
Python 54
2 年前
https://static.github-zh.com/github_avatars/vectominist?size=40
vectominist / MiniASR

A mini, simple, and fast end-to-end automatic speech recognition toolkit.

asrctcspeech-recognitionspeech-representationhubertminimalPyTorchfairseq
Jupyter Notebook 52
3 年前
https://static.github-zh.com/github_avatars/bshall?size=40
bshall / dusted

DUSTED: Spoken-Term Discovery using Discrete Speech Units

speech-representation
Jupyter Notebook 17
8 个月前
https://static.github-zh.com/github_avatars/seorim0?size=40
seorim0 / SE-using-SRL-Model

#计算机科学#Causal Speech Enhancement Based on a Two-Branch Nested U-Net Architecture Using Self-Supervised Speech Embeddings

深度学习深度神经网络noise-reductionPyTorchself-supervised-learningspeech-enhancementspeech-representationPython
Python 15
10 天前
https://static.github-zh.com/github_avatars/jefflai108?size=40
jefflai108 / Semi-Supervsied-Spoken-Language-Understanding-PyTorch

Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining

speech-recognitionsemi-supervised-learningspeech-representation
Python 12
4 年前
https://static.github-zh.com/github_avatars/ryota-komatsu?size=40
ryota-komatsu / slp2025

音学シンポジウム2025チュートリアル「マルチモーダル大規模言語モデル入門」資料

multimodal-large-language-modelsspeechspeech-processingspeech-representation
Jupyter Notebook 4
8 天前