GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

speaker-recognition

Website
Wikipedia
https://static.github-zh.com/github_avatars/NVIDIA?size=40
NVIDIA / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

machine-translationspeaker-recognitionasrttsgenerative-aimultimodal深度学习neural-networksspeaker-diariazationspeech-translationspeech-synthesislarge-language-models
Python 14.8 k
14 小时前
https://static.github-zh.com/github_avatars/speechbrain?size=40
speechbrain / speechbrain

#计算机科学#A PyTorch-based Speech Toolkit

speech-recognitionspeech-toolkitspeaker-recognitionspeech-to-textspeech-enhancementspeech-separationaudioaudio-processingspeech-processingspeechrecognitionasrvoice-recognitionspeaker-diarizationspeaker-verificationPyTorchhuggingfacetransformerslanguage-model深度学习
Python 9.98 k
5 天前
https://static.github-zh.com/github_avatars/pyannote?size=40
pyannote / pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

PyTorchspeech-processingspeaker-diarizationvoice-activity-detectionpretrained-modelsspeaker-recognitionspeaker-verification
Jupyter Notebook 7.69 k
2 天前
google/uis-rnn
https://static.github-zh.com/github_avatars/google?size=40
google / uis-rnn

#计算机科学#This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

speaker-diarizationuis-rnnspeaker-recognitionsupervised-learningclusteringsupervised-clustering机器学习
Python 1.58 k
9 个月前
https://static.github-zh.com/github_avatars/mravanelli?size=40
mravanelli / SincNet

#计算机科学#SincNet is a neural architecture for efficiently processing raw audio samples.

深度学习audiowaveformfilteringcnnconvolutional-neural-networksspeaker-recognitionspeaker-verificationspeaker-identificationspeech-recognitionasraudio-processingspeech-processingdigital-signal-processingsignal-processingneural-networks人工智能timitPyTorchPython
Python 1.18 k
4 年前
https://static.github-zh.com/github_avatars/clovaai?size=40
clovaai / voxceleb_trainer

In defence of metric learning for speaker recognition

speaker-recognitionmetric-learningspeaker-verification
Python 1.11 k
1 年前
https://static.github-zh.com/github_avatars/yeyupiaoling?size=40
yeyupiaoling / VoiceprintRecognition-Pytorch

This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the sam...

PyTorchvoice-recognitionarcfacespeaker-recognition
Python 1.02 k
5 天前
https://static.github-zh.com/github_avatars/athena-team?size=40
athena-team / athena

an open-source implementation of sequence-to-sequence based speech processing engine

speech-recognitionasrtransformerTensorflowctcunsupervised-learningsequence-to-sequence部署speaker-recognitionttsspeech-synthesis
C++ 951
3 年前
https://static.github-zh.com/github_avatars/wenet-e2e?size=40
wenet-e2e / wespeaker

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

production-readyPyTorchresnetspeaker-recognitionspeaker-verificationspeaker-diarizationrepvggTLS (Transport Layer Security)dinowavlm
Python 934
1 个月前
https://static.github-zh.com/github_avatars/astorfi?size=40
astorfi / 3D-convolutional-speaker-recognition

#计算机科学#🔈 Deep Learning & 3D Convolutional Neural Networks for Speaker Verification

convolutional-neural-networks深度学习speaker-recognition3D
Python 786
5 年前
https://static.github-zh.com/github_avatars/TaoRuijie?size=40
TaoRuijie / ECAPA-TDNN

Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)

speaker-recognitionspeaker-verification
Python 696
1 年前
https://static.github-zh.com/github_avatars/cvqluu?size=40
cvqluu / Angular-Penalty-Softmax-Losses-Pytorch

#人脸识别#Angular penalty loss functions in Pytorch (ArcFace, SphereFace, Additive Margin, CosFace)

metric-learningPyTorchloss-functionsembeddingface-verificationfashion-mnistface-recognitionspeaker-recognitionspherefacearcface
Python 489
2 年前
https://static.github-zh.com/github_avatars/taylorlu?size=40
taylorlu / Speaker-Diarization

speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition

uis-rnnspeaker-diarizationspeaker-recognition
Python 483
4 年前
https://static.github-zh.com/github_avatars/google?size=40
google / speaker-id

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

speaker-recognitionsource-separationspeaker-diarizationspeaker-verificationspeaker-identification
Python 421
3 个月前
https://static.github-zh.com/github_avatars/nuaazs?size=40
nuaazs / VAF_2

Aims to create a comprehensive voice toolkit for training, testing, and deploying speaker verification systems.

antifraud微服务speaker-diarizationspeaker-recognitionspeech-recognition
Python 404
1 年前
https://static.github-zh.com/github_avatars/speechbrain?size=40
speechbrain / speechbrain.github.io

#计算机科学#The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN a...

深度学习speech-recognitionspeech-to-textspeechspeech-processingspeaker-recognitionspeaker-verificationspeaker-identificationspeech-separationspeechrecognition神经网络neural-networkstimitspeech-analysis
HTML 368
16 天前
https://static.github-zh.com/github_avatars/manojpamk?size=40
manojpamk / pytorch_xvectors

Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196

speaker-verificationspeaker-recognitionspeaker-diarization
Python 316
5 年前
https://static.github-zh.com/github_avatars/yeyupiaoling?size=40
yeyupiaoling / VoiceprintRecognition-Tensorflow

使用Tensorflow实现声纹识别

Tensorflowvoice-recognitionarcfacespeaker-recognition
Python 311
1 年前
https://static.github-zh.com/github_avatars/SamirPaulb?size=40
SamirPaulb / real-time-voice-translator

#计算机科学#A desktop application that uses AI to translate voice between languages in real time, while preserving the speaker's tone and emotion.

final-year-project机器学习speaker-recognitionspeech-to-textspeechrecognitiontext-to-speechtkintertranslationGUIPython
Tcl 287
1 年前
https://static.github-zh.com/github_avatars/yeyupiaoling?size=40
yeyupiaoling / VoiceprintRecognition-PaddlePaddle

本项目使用了EcapaTdnn、ResNetSE、ERes2Net、CAM++等多种先进的声纹识别模型,同时本项目也支持了MelSpectrogram、Spectrogram、MFCC、Fbank等多种数据预处理方法

paddlepaddlevoice-recognitionarcfacespeaker-recognition
Python 267
5 天前
loading...