GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

speaker-diarization

Website
Wikipedia
https://static.github-zh.com/github_avatars/modelscope?size=40
modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

conformerPyTorchspeech-recognitionparaformerpunctuationspeaker-diarizationrnntaudio-visual-speech-recognitionpretrained-modelvoice-activity-detectionWhisperdfsmnvadspeechgptspeechllm
Python 11 k
19 天前
https://static.github-zh.com/github_avatars/speechbrain?size=40
speechbrain / speechbrain

#计算机科学#A PyTorch-based Speech Toolkit

speech-recognitionspeech-toolkitspeaker-recognitionspeech-to-textspeech-enhancementspeech-separationaudioaudio-processingspeech-processingspeechrecognitionasrvoice-recognitionspeaker-diarizationspeaker-verificationPyTorchhuggingfacetransformerslanguage-model深度学习
Python 9.98 k
5 天前
https://static.github-zh.com/github_avatars/espnet?size=40
espnet / espnet

#计算机科学#End-to-End Speech Processing Toolkit

深度学习end-to-endchainerPyTorchkaldispeech-recognitionspeech-synthesisspeech-translationmachine-translationvoice-conversionspeech-enhancementspeech-separationsinging-voice-synthesisspeaker-diarizationtext-to-speech
Python 9.2 k
1 天前
https://static.github-zh.com/github_avatars/pyannote?size=40
pyannote / pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

PyTorchspeech-processingspeaker-diarizationvoice-activity-detectionpretrained-modelsspeaker-recognitionspeaker-verification
Jupyter Notebook 7.69 k
2 天前
https://static.github-zh.com/github_avatars/MahmoudAshraf97?size=40
MahmoudAshraf97 / whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

asrspeaker-diarizationspeechspeech-recognitionspeech-to-textWhisper
Jupyter Notebook 4.63 k
2 个月前
https://static.github-zh.com/github_avatars/linto-ai?size=40
linto-ai / whisper-timestamped

#计算机科学#Multilingual Automatic Speech Recognition with word-level timestamps and confidence

深度学习speechspeech-recognitionspeech-to-textasr机器学习PythonPyTorchattention-is-all-you-needattention-mechanismattention-modelspeaker-diarizationspeech-processingtransformersWhisper
Python 2.45 k
3 个月前
https://static.github-zh.com/github_avatars/Purfview?size=40
Purfview / whisper-standalone-win

Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.

openaispeech-to-textWhisperasrspeech-recognitionsubtitlesctranslate2faster-whisperwhisperxuvrdiarizationspeaker-diarization
2.17 k
2 个月前
https://static.github-zh.com/github_avatars/modelscope?size=40
modelscope / 3D-Speaker

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

speaker-diarizationspeaker-verificationlanguage-identificationmodelscope
Python 2.11 k
10 天前
wq2012/awesome-diarization
https://static.github-zh.com/github_avatars/wq2012?size=40
wq2012 / awesome-diarization

#Awesome#A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

speaker-diarizationAwesome Lists机器学习speech-recognitionspeech-processing深度学习
1.76 k
8 个月前
google/uis-rnn
https://static.github-zh.com/github_avatars/google?size=40
google / uis-rnn

#计算机科学#This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

speaker-diarizationuis-rnnspeaker-recognitionsupervised-learningclusteringsupervised-clustering机器学习
Python 1.58 k
9 个月前
juanmc2005/diart
https://static.github-zh.com/github_avatars/juanmc2005?size=40
juanmc2005 / diart

#计算机科学#A python package to build AI-powered real-time audio applications

speaker-diarizationstreaming-audioreal-time深度学习transcriptionvoice-activity-detection
Python 1.32 k
4 个月前
https://static.github-zh.com/github_avatars/wenet-e2e?size=40
wenet-e2e / wespeaker

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

production-readyPyTorchresnetspeaker-recognitionspeaker-verificationspeaker-diarizationrepvggTLS (Transport Layer Security)dinowavlm
Python 934
1 个月前
https://static.github-zh.com/github_avatars/transcriptionstream?size=40
transcriptionstream / transcriptionstream

#大语言模型#turnkey self-hosted offline transcription and diarization service with llm summary

自动化diarization大语言模型speaker-diarizationspeech-recognitiontranscriptionWhisperollamamistral-7bwhisperx
Python 857
9 个月前
https://static.github-zh.com/github_avatars/yinruiqing?size=40
yinruiqing / pyannote-whisper

#大语言模型#

asrspeaker-diarizationWhisperChatGPT
Python 596
1 年前
https://static.github-zh.com/github_avatars/wq2012?size=40
wq2012 / SpectralCluster

#计算机科学#Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.

机器学习clusteringspectral-clusteringunsupervised-learningspeaker-diarizationPython
Python 531
9 个月前
https://static.github-zh.com/github_avatars/taylorlu?size=40
taylorlu / Speaker-Diarization

speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition

uis-rnnspeaker-diarizationspeaker-recognition
Python 483
4 年前
https://static.github-zh.com/github_avatars/google?size=40
google / speaker-id

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

speaker-recognitionsource-separationspeaker-diarizationspeaker-verificationspeaker-identification
Python 421
3 个月前
https://static.github-zh.com/github_avatars/revdotcom?size=40
revdotcom / reverb

Open source inference code for Rev's model

speech-recognitionspeech-to-textasrcanaryDockerWhisperOpen Sourcespeechrecognitiondiarizationhuggingfacespeaker-diarization深度学习神经网络
Python 404
2 个月前
https://static.github-zh.com/github_avatars/nuaazs?size=40
nuaazs / VAF_2

Aims to create a comprehensive voice toolkit for training, testing, and deploying speaker verification systems.

antifraud微服务speaker-diarizationspeaker-recognitionspeech-recognition
Python 404
1 年前
https://static.github-zh.com/github_avatars/hitachi-speech?size=40
hitachi-speech / EEND

#计算机科学#End-to-End Neural Diarization

speaker-diarizationend-to-end机器学习chainerkaldi深度学习
Python 402
4 年前
loading...