GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

speech

Website
Wikipedia
https://static.github-zh.com/github_avatars/coqui-ai?size=40
coqui-ai / TTS

#计算机科学#🐸💬 - 一个深度学习的 TTS 语言合成库

Pythontext-to-speech深度学习speechPyTorchttsvocodertacotronglow-ttsmelganspeaker-encoderhifiganspeaker-encodingsmulti-speaker-ttstts-modelspeech-synthesis声音克隆voice-synthesisvoice-conversion
Python 40.73 k
10 个月前
babysor/MockingBird
https://static.github-zh.com/github_avatars/babysor?size=40
babysor / MockingBird

#计算机科学#🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

人工智能speechPyTorch深度学习text-to-speechtts
Python 36.33 k
7 个月前
https://static.github-zh.com/github_avatars/svc-develop-team?size=40
svc-develop-team / so-vits-svc

#计算机科学#SoftVC VITS Singing Voice Conversion

人工智能audio-analysisGenerative Adversarial Networksinging-voice-conversionso-vits-svcsovitsvariational-inferencevcvitsvoicevoice-conversionvoiceconversionvoice-changerflow深度学习PyTorchspeech
Python 27.24 k
2 年前
huggingface/datasets
https://static.github-zh.com/github_avatars/huggingface?size=40
huggingface / datasets

#自然语言处理#🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

自然语言处理数据集PyTorchTensorflowpandasNumPy机器视觉机器学习深度学习speechHacktoberfest
Python 20.27 k
2 天前
https://static.github-zh.com/github_avatars/IDEA-Research?size=40
IDEA-Research / Grounded-Segment-Anything

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

open-vocabulary-detectionopen-vocabulary-segmentationdata-generationautomatic-labeling-systemcaptionspeechimage-editing
Jupyter Notebook 16.46 k
9 个月前
https://static.github-zh.com/github_avatars/m-bain?size=40
m-bain / whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

asrspeechspeech-recognitionspeech-to-textWhisper
Python 16.24 k
7 天前
https://static.github-zh.com/github_avatars/kaldi-asr?size=40
kaldi-asr / kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

kaldiC++CUDAShellspeech-recognitionspeech-to-textspeaker-verificationspeaker-idspeech
Shell 14.9 k
2 个月前
https://static.github-zh.com/github_avatars/AIGC-Audio?size=40
AIGC-Audio / AudioGPT

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

audiogptmusicsoundspeechtalking-head
Python 10.16 k
1 年前
https://static.github-zh.com/github_avatars/mozilla?size=40
mozilla / TTS

#计算机科学#:robot: 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

深度学习text-to-speechPythonPyTorchtacotronttsspeaker-encoderdataset-analysistacotron2tensorflow2vocodermelganglow-ttsspeech
Jupyter Notebook 9.88 k
2 年前
https://static.github-zh.com/github_avatars/netease-youdao?size=40
netease-youdao / EmotiVoice

#计算机科学#EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

PyTorchspeechspeech-synthesisttsmulti-speakertext-to-speech深度学习promptemotivoice人工智能Pythonemotionstyle
Python 8.03 k
10 个月前
modelscope/modelscope
https://static.github-zh.com/github_avatars/modelscope?size=40
modelscope / modelscope

#自然语言处理#ModelScope: bring the notion of Model-as-a-Service to life.

自然语言处理cvspeechmulti-modalscience深度学习机器学习Python
Python 7.98 k
3 天前
https://static.github-zh.com/github_avatars/PaddlePaddle?size=40
PaddlePaddle / models

#自然语言处理#Officially maintained, supported by PaddlePaddle, including CV, NLP, Speech, Rec, TS, big models and so on.

paddlepaddle深度学习神经网络机器视觉自然语言处理recommendationspeechcvmodels
Python 6.93 k
5 个月前
https://static.github-zh.com/github_avatars/TalAter?size=40
TalAter / annyang

💬 Speech recognition for your site

speech-recognitionspeechspeech-to-textvoice
JavaScript 6.66 k
10 个月前
https://static.github-zh.com/github_avatars/snakers4?size=40
snakers4 / silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

voice-detectionvoice-recognitionvoice-commandsPyTorchonnxvoice-activity-detectionvoice-controlonnx-runtimeonnxruntimespeechspeech-processingvad
Python 6.05 k
4 天前
https://static.github-zh.com/github_avatars/snakers4?size=40
snakers4 / silero-models

Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple

speech-recognitionspeech-to-textsttasrpretrained-modelsenglishgermanspanishstt-benchmarkPyTorchcolabonnxtext-to-speechspeechspeech-synthesistts
Jupyter Notebook 5.34 k
2 年前
https://static.github-zh.com/github_avatars/MahmoudAshraf97?size=40
MahmoudAshraf97 / whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

asrspeaker-diarizationspeechspeech-recognitionspeech-to-textWhisper
Jupyter Notebook 4.63 k
2 个月前
https://static.github-zh.com/github_avatars/metavoiceio?size=40
metavoiceio / metavoice-src

#计算机科学#Foundational model for human-like, expressive TTS

text-to-speech人工智能深度学习PyTorchspeechspeech-synthesisttsvoice-clonezero-shot-tts
Python 4.13 k
1 年前
https://static.github-zh.com/github_avatars/huggingface?size=40
huggingface / speech-to-speech

#计算机科学#Speech To Speech: an effort for an open-sourced and modular GPT4-o

人工智能assistantlanguage-model机器学习Pythonspeechspeech-synthesisspeech-to-textspeech-translation
Python 4.06 k
2 个月前
https://static.github-zh.com/github_avatars/fixie-ai?size=40
fixie-ai / ultravox

#大语言模型#A fast multimodal LLM for real-time voice

人工智能大语言模型slmspeech
Python 4.01 k
4 个月前
https://static.github-zh.com/github_avatars/jianchang512?size=40
jianchang512 / stt

Voice Recognition to Text Tool / 一个离线运行的本地音视频转字幕工具,输出json、srt字幕、纯文字格式

speechspeech-recognitionspeech-to-textstt
Python 3.5 k
6 个月前
loading...