GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

speaker-identification

Website
Wikipedia
https://static.github-zh.com/github_avatars/alphacep?size=40
alphacep / vosk-api

#安卓#Vosk 是一个离线的语言识别工具。支持 Python, Java, Node.JS, C#, C++ ,能识别20+种语言,包括中文、英语、法语等。

speech-recognitionasrvoice-recognitionspeech-to-textAndroidiOS树莓派深度学习深度神经网络speech-to-text-androidspeaker-identificationspeaker-verificationPythonoffline隐私kaldideepspeechgoogle-speech-to-textvoskstt
Jupyter Notebook 12.08 k
1 个月前
https://static.github-zh.com/github_avatars/mravanelli?size=40
mravanelli / SincNet

#计算机科学#SincNet is a neural architecture for efficiently processing raw audio samples.

深度学习audiowaveformfilteringcnnconvolutional-neural-networksspeaker-recognitionspeaker-verificationspeaker-identificationspeech-recognitionasraudio-processingspeech-processingdigital-signal-processingsignal-processingneural-networks人工智能timitPyTorchPython
Python 1.18 k
4 年前
https://static.github-zh.com/github_avatars/HarryVolek?size=40
HarryVolek / PyTorch_Speaker_Verification

PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.

PyTorchspeaker-identificationspeaker-verification
Python 586
3 年前
https://static.github-zh.com/github_avatars/google?size=40
google / speaker-id

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

speaker-recognitionsource-separationspeaker-diarizationspeaker-verificationspeaker-identification
Python 421
3 个月前
https://static.github-zh.com/github_avatars/speechbrain?size=40
speechbrain / speechbrain.github.io

#计算机科学#The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN a...

深度学习speech-recognitionspeech-to-textspeechspeech-processingspeaker-recognitionspeaker-verificationspeaker-identificationspeech-separationspeechrecognition神经网络neural-networkstimitspeech-analysis
HTML 368
16 天前
https://static.github-zh.com/github_avatars/jymsuper?size=40
jymsuper / SpeakerRecognition_tutorial

#计算机科学#Simple d-vector based Speaker Recognition (verification and identification) using Pytorch

speaker-recognition深度学习speaker-verificationspeaker-identificationPyTorch
Python 211
5 年前
https://static.github-zh.com/github_avatars/Atul-Anand-Jha?size=40
Atul-Anand-Jha / Speaker-Identification-Python

Speaker Identification System (upto 100% accuracy); built using Python 2.7 and python_speech_features library

Pythonspeaker-recognitionspeaker-identification
Python 208
5 年前
https://static.github-zh.com/github_avatars/oscarknagg?size=40
oscarknagg / voicemap

#计算机科学#Identifying people from small audio fragments

机器学习speaker-identificationspeaker-recognitionconvolutional-neural-networks
Python 170
5 年前
https://static.github-zh.com/github_avatars/Speaker-Identification?size=40
Speaker-Identification / You-Only-Speak-Once

#计算机科学#Deep Learning - one shot learning for speaker recognition using Filter Banks

triplet-lossspeaker-recognition神经网络audiospeechspeaker-identification深度学习
Jupyter Notebook 168
1 年前
https://static.github-zh.com/github_avatars/kaistmm?size=40
kaistmm / Audio-Mamba-AuM

#计算机科学#Official Implementation of the work "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning"

audioaudio-classification深度学习mambaPyTorchrepresentation-learningspeaker-identification
Python 144
7 个月前
https://static.github-zh.com/github_avatars/jefflai108?size=40
jefflai108 / pytorch-kaldi-neural-speaker-embeddings

A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.

speaker-verificationspeaker-recognitionspeech-processingspeaker-identificationPyTorchkaldi
Perl 136
5 年前
https://static.github-zh.com/github_avatars/SiavashShams?size=40
SiavashShams / ssamba

#计算机科学#[SLT'24] The official implementation of SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model

audioaudio-classificationmambarepresentation-learningself-supervised-learningspeaker-identification深度学习emotion-recognition
Python 121
8 个月前
https://static.github-zh.com/github_avatars/Anwarvic?size=40
Anwarvic / Speaker-Recognition

This repo contains my attempt to create a Speaker Recognition and Verification system using SideKit-1.3.1

speaker-recognitionspeaker-verificationspeaker-identification
Python 111
6 年前
https://static.github-zh.com/github_avatars/FAKEBOB-adversarial-attack?size=40
FAKEBOB-adversarial-attack / FAKEBOB

Source code for paper "Who is real Bob? Adversarial Attacks on Speaker Recognition Systems" (IEEE S&P 2021)

adversarial-attacksspeaker-identificationspeaker-verification
Python 104
3 年前
https://static.github-zh.com/github_avatars/funcwj?size=40
funcwj / ge2e-speaker-verification

Pytorch implementation of "Generalized End-to-End Loss for Speaker Verification"

speaker-verificationPyTorchspeaker-identification
Python 102
6 年前
https://static.github-zh.com/github_avatars/Appen?size=40
Appen / UHV-OTS-Speech

A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.

speech-processingspeech-recognitionspeaker-diarizationspeaker-identification
Forth 102
2 年前
https://static.github-zh.com/github_avatars/Warma10032?size=40
Warma10032 / easytts

#自然语言处理#打造最简单的TTS前端集合,最简单的有声小说制作工作流。基于正则规则对小说进行分句,基于RoBERTa对小说中的对话进行说话人识别,从而实现一键式生成多人有声小说。多说话人的语音合成,高质量的有声小说制作。

人工智能audio-generation自然语言处理pyqtspeaker-identificationtts
Python 101
3 个月前
https://static.github-zh.com/github_avatars/cvqluu?size=40
cvqluu / GE2E-Loss

Pytorch implementation of Generalized End-to-End Loss for speaker verification

speaker-verificationPyTorchspeaker-identificationspeaker-diarizationspeaker-recognition
Python 84
6 年前
https://static.github-zh.com/github_avatars/nezhar?size=40
nezhar / speech-condenser

A tool for summarizing dialogues from videos or audio

asrspeaker-diarizationspeaker-identificationsummarization
Python 82
2 年前
https://static.github-zh.com/github_avatars/cyrta?size=40
cyrta / voxceleb

mirror of VoxCeleb dataset - a large-scale speaker identification dataset

speakerspeaker-identificationspeaker-verificationspeaker-recognitiondatasetcorpusspeech
Shell 72
6 年前
loading...