#

speaker-recognition

https://static.github-zh.com/github_avatars/NVIDIA-NeMo?size=40

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 15.7 k
12 小时前
https://static.github-zh.com/github_avatars/pyannote?size=40

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 8.29 k
21 小时前
google/uis-rnn
https://static.github-zh.com/github_avatars/google?size=40

#计算机科学#This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

Python 1.58 k
1 年前
https://static.github-zh.com/github_avatars/clovaai?size=40
Python 1.13 k
1 年前
https://static.github-zh.com/github_avatars/yeyupiaoling?size=40

This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the sam...

Python 1.11 k
3 个月前
https://static.github-zh.com/github_avatars/wenet-e2e?size=40
Python 1.03 k
5 天前
https://static.github-zh.com/github_avatars/athena-team?size=40
C++ 959
3 年前
https://static.github-zh.com/github_avatars/astorfi?size=40
Python 788
6 年前
https://static.github-zh.com/github_avatars/TaoRuijie?size=40

Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)

Python 733
1 年前
https://static.github-zh.com/github_avatars/FluidInference?size=40
Swift 622
1 天前
https://static.github-zh.com/github_avatars/taylorlu?size=40

speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition

Python 490
4 年前
https://static.github-zh.com/github_avatars/google?size=40

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

Python 432
1 个月前
https://static.github-zh.com/github_avatars/nuaazs?size=40

Aims to create a comprehensive voice toolkit for training, testing, and deploying speaker verification systems.

Python 395
1 年前
https://static.github-zh.com/github_avatars/speechbrain?size=40

#计算机科学#The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN a...

HTML 371
3 个月前
https://static.github-zh.com/github_avatars/SamirPaulb?size=40

#计算机科学#A desktop application that uses AI to translate voice between languages in real time, while preserving the speaker's tone and emotion.

Tcl 329
2 年前
https://static.github-zh.com/github_avatars/manojpamk?size=40

Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196

Python 320
5 年前
loading...
Website
Wikipedia