GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

wavlm

Website
Wikipedia
https://static.github-zh.com/github_avatars/yl4579?size=40
yl4579 / StyleTTS2

#计算机科学#StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

深度学习PyTorchspeaker-adaptationspeech-synthesistext-to-speechttswavlmdiffusion-modelslatent-diffusionlatent-diffusion-modelsGenerative Adversarial Network
Python 5.79 k
10 个月前
s3prl/s3prl
https://static.github-zh.com/github_avatars/s3prl?size=40
s3prl / s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit

speech-representationmockingjayrepresentation-learningapcteraself-supervised-learningspeech-pretrainingvq-apcwav2vechubertwavlm
Python 2.41 k
3 个月前
https://static.github-zh.com/github_avatars/wenet-e2e?size=40
wenet-e2e / wespeaker

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

production-readyPyTorchresnetspeaker-recognitionspeaker-verificationspeaker-diarizationrepvggTLS (Transport Layer Security)dinowavlm
Python 934
1 个月前
https://static.github-zh.com/github_avatars/lucadellalib?size=40
lucadellalib / focalcodec

#计算机科学#A low-bitrate single-codebook 16 kHz speech codec based on focal modulation

codec深度学习PyTorchspeech-synthesiswavlm
Python 88
4 个月前
https://static.github-zh.com/github_avatars/mjhydri?size=40
mjhydri / Singing-Vocal-Beat-Tracking

This repo contains the source code of the first deep learning-base singing voice beat tracking system. It leverages WavLM and DistilHuBERT pre-trained speech models to create vocal embeddings and trai...

beat-trackinghubertmusicmusic-information-retrievalself-supervisedsinging-voicewavlm
Python 32
3 年前
https://static.github-zh.com/github_avatars/lucadellalib?size=40
lucadellalib / discrete-wavlm-codec

A neural speech codec based on discrete WavLM representations

clusteringcodechifi-ganPyTorchquantizationself-supervised-learningspeech-synthesiswavlm
Python 24
10 个月前
https://static.github-zh.com/github_avatars/lucadellalib?size=40
lucadellalib / audiocodecs

A collections of audio codecs with a standardized API

codecdacPyTorchquantizationself-supervised-learningspeech-synthesistext-to-speechwavlm
Python 21
20 天前
https://static.github-zh.com/github_avatars/theolepage?size=40
theolepage / wavlm_ssl_sv

SOTA method for self-supervised speaker verification leveraging a large-scale pretrained ASR model.

asrdinoPyTorchself-supervised-learningspeaker-recognitionspeaker-verificationwavlm
Python 8
4 个月前
https://static.github-zh.com/github_avatars/alessandropec?size=40
alessandropec / data_driven_ai_voice_cloning

#计算机科学#This repository contain the code of the main part of my master thesis degree at Politecnico di Torino in Data science & Engineering

人工智能generative-aispeaker-verificationtext-to-speech声音克隆zero-shot-learning深度学习机器学习wavlmfastspeech2tacotron2
Python 8
2 年前
https://static.github-zh.com/github_avatars/Sarasadeghii?size=40
Sarasadeghii / Sharif-WavLM

In this repository, the wavLM model is used for quality and poor quality data for speaker verification task, and the PyCM library is used for evaluation.

confusion-matrixspeaker-verificationwavlm
Jupyter Notebook 8
2 年前
https://static.github-zh.com/github_avatars/bunyaminergen?size=40
bunyaminergen / WavLMMSDD

This repository combines `WavLM`, a powerful speech representation model from Microsoft, with `MSDD` (Multi-Scale Diarization Decoder), a state-of-the-art approach for speaker diarization from Nvidia...

diarizationembeddingMicrosoftspeaker-diarizationspeechwavlm
Jupyter Notebook 7
3 个月前
https://static.github-zh.com/github_avatars/sadPororo?size=40
sadPororo / UniPool-SV

Universal Pooling Method for Speaker Verification Utilizing Pre-trained Multi-layer Features, 2025 preprint

hubertpretrained-modelsspeaker-recognitionspeaker-verificationwavlm
Python 6
9 个月前
https://static.github-zh.com/github_avatars/SmoothKen?size=40
SmoothKen / knn-svc

kNN-SVC: Robust Zero-Shot Singing Voice Conversion with Additive Synthesis and Concatenation Smoothness Optimization

singing-voice-conversionvoice-conversionwavlm
Python 5
2 个月前
https://static.github-zh.com/github_avatars/zhu00121?size=40
zhu00121 / Universal-representation-dynamics-of-deepfake-speech

This repo contains code used in the paper "Characterizing the temporal dynamics of universal speech representations for generalizable deepfake detection"

deepfake-detectionself-supervisedwavlm
Python 4
2 年前
https://static.github-zh.com/github_avatars/bunyaminergen?size=40
bunyaminergen / WavLMRawNetXSVBase

WavLM Large + RawNetX Speaker Verification Base: End-to-End Speaker Verification Architecture

audiofeature-extractionspeaker-verificationspeechspeech-processingwavlm
Python 2
3 个月前
https://static.github-zh.com/github_avatars/aitor-alvarez?size=40
aitor-alvarez / acoustic-transformer-models

Acoustic Transformer Models for Audio Classification

acousticclassificationhuberttransformer-modelspytorch-lightningwavlm
Python 1
4 个月前
https://static.github-zh.com/github_avatars/sadPororo?size=40
sadPororo / LAP

Rethinking Leveraging Pre-Trained Multi-Layer Representations for Speaker Verification, 2025 Interspeech

hubertpretrained-modelsspeaker-verificationwavlm
Python 1
16 天前
https://static.github-zh.com/github_avatars/lucadellalib?size=40
lucadellalib / cryceleb2023

CryCeleb2023 experiments

metric-learningspeaker-verificationtriplet-losswavlm
Jupyter Notebook 0
2 年前