GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

speech-synthesis

Website
Wikipedia
https://static.github-zh.com/github_avatars/coqui-ai?size=40
coqui-ai / TTS

#计算机科学#🐸💬 - 一个深度学习的 TTS 语言合成库

Pythontext-to-speech深度学习speechPyTorchttsvocodertacotronglow-ttsmelganspeaker-encoderhifiganspeaker-encodingsmulti-speaker-ttstts-modelspeech-synthesis声音克隆voice-synthesisvoice-conversion
Python 40.74 k
10 个月前
leon-ai/leon
https://static.github-zh.com/github_avatars/leon-ai?size=40
leon-ai / leon

🧠 Leon is your open-source personal assistant.

leonpersonal-assistantNode.jsPython人工智能speech-to-texttext-to-speechspeech-recognitionspeech-synthesisfliteassistantvirtual-assistant聊天机器人Botvoice-assistant自动化offline隐私ai-assistant
TypeScript 16.37 k
21 天前
https://static.github-zh.com/github_avatars/NVIDIA?size=40
NVIDIA / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

machine-translationspeaker-recognitionasrttsgenerative-aimultimodal深度学习neural-networksspeaker-diariazationspeech-translationspeech-synthesislarge-language-models
Python 14.8 k
10 小时前
https://static.github-zh.com/github_avatars/NVIDIA?size=40
NVIDIA / DeepLearningExamples

#自然语言处理#深度学习示例

机器视觉深度学习drug-discoveryforecastinglarge-language-modelsmxnetpaddlepaddlePyTorchrecommender-systemsspeech-recognitionspeech-synthesisTensorflowtensorflow2translation自然语言处理
Jupyter Notebook 14.33 k
10 个月前
https://static.github-zh.com/github_avatars/PaddlePaddle?size=40
PaddlePaddle / PaddleSpeech

PaddleSpeech 是基于飞桨 PaddlePaddle 的语音方向的开源模型库,用于语音和音频中的各种关键任务的开发,典型的应用包括:语音识别、语音翻译、语音合成等

transformerconformerspeech-translationstreaming-asrspeech-alignmentpunctuation-restorationstreaming-ttsspeech-synthesisttsasrspeech-recognition声音克隆vocodervoice-recognitionself-supervised-learningWhisper
Python 11.99 k
5 天前
https://static.github-zh.com/github_avatars/rhasspy?size=40
rhasspy / piper

A fast, local neural text to speech system

speech-synthesistext-to-speechtts
C++ 9.31 k
1 个月前
https://static.github-zh.com/github_avatars/espnet?size=40
espnet / espnet

#计算机科学#End-to-End Speech Processing Toolkit

深度学习end-to-endchainerPyTorchkaldispeech-recognitionspeech-synthesisspeech-translationmachine-translationvoice-conversionspeech-enhancementspeech-separationsinging-voice-synthesisspeaker-diarizationtext-to-speech
Python 9.2 k
1 天前
open-mmlab/Amphion
https://static.github-zh.com/github_avatars/open-mmlab?size=40
open-mmlab / Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, ...

audio-generationaudio-synthesisaudioldmmusic-generationnaturalspeech2singing-voice-conversionspeech-synthesistext-to-audiotext-to-speechvall-evoice-conversionauditfastspeech2vitsemiliamaskgctvocoder
Python 9.15 k
19 天前
https://static.github-zh.com/github_avatars/voicepaw?size=40
voicepaw / so-vits-svc-fork

#计算机科学#基于 so-vits-svc4.0(V1)的一个分支,支持实时推理和图形化推理界面,且兼容其模型。

sovitsvitsvoice-conversionso-vits-svchubertsoftvcrealtimevoice-changer深度学习PyTorchspeech-synthesisGenerative Adversarial Networklightningpytorch-lightningHacktoberfest
Python 9.04 k
6 天前
https://static.github-zh.com/github_avatars/rany2?size=40
rany2 / edge-tts

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

ttsspeech-synthesistext-to-speech
Python 8.45 k
1 个月前
https://static.github-zh.com/github_avatars/netease-youdao?size=40
netease-youdao / EmotiVoice

#计算机科学#EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

PyTorchspeechspeech-synthesisttsmulti-speakertext-to-speech深度学习promptemotivoice人工智能Pythonemotionstyle
Python 8.03 k
10 个月前
https://static.github-zh.com/github_avatars/jaywalnut310?size=40
jaywalnut310 / vits

#计算机科学#VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

ttstext-to-speechPyTorch深度学习speech-synthesis
Python 7.49 k
2 年前
https://static.github-zh.com/github_avatars/yl4579?size=40
yl4579 / StyleTTS2

#计算机科学#StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

深度学习PyTorchspeaker-adaptationspeech-synthesistext-to-speechttswavlmdiffusion-modelslatent-diffusionlatent-diffusion-modelsGenerative Adversarial Network
Python 5.79 k
10 个月前
https://static.github-zh.com/github_avatars/snakers4?size=40
snakers4 / silero-models

Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple

speech-recognitionspeech-to-textsttasrpretrained-modelsenglishgermanspanishstt-benchmarkPyTorchcolabonnxtext-to-speechspeechspeech-synthesistts
Jupyter Notebook 5.34 k
2 年前
https://static.github-zh.com/github_avatars/espeak-ng?size=40
espeak-ng / espeak-ng

#安卓#eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.

espeak-ngespeakAndroidtext-to-speechspeech-synthesis
C 5.16 k
4 天前
https://static.github-zh.com/github_avatars/MoonInTheRiver?size=40
MoonInTheRiver / DiffSinger

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code

text-to-speechdiffusion-speedupttsaaai2022singing-synthesisdiffusion-modelspeech-synthesissinging-voice-synthesissinging-voicesinging-voice-databaseMIDI
Python 4.52 k
3 个月前
https://static.github-zh.com/github_avatars/WhisperSpeech?size=40
WhisperSpeech / WhisperSpeech

An Open Source text-to-speech system built by inverting Whisper.

PyTorchspeech-synthesistts
Jupyter Notebook 4.28 k
7 天前
https://static.github-zh.com/github_avatars/metavoiceio?size=40
metavoiceio / metavoice-src

#计算机科学#Foundational model for human-like, expressive TTS

text-to-speech人工智能深度学习PyTorchspeechspeech-synthesisttsvoice-clonezero-shot-tts
Python 4.13 k
1 年前
https://static.github-zh.com/github_avatars/huggingface?size=40
huggingface / speech-to-speech

#计算机科学#Speech To Speech: an effort for an open-sourced and modular GPT4-o

人工智能assistantlanguage-model机器学习Pythonspeechspeech-synthesisspeech-to-textspeech-translation
Python 4.06 k
2 个月前
https://static.github-zh.com/github_avatars/TensorSpeech?size=40
TensorSpeech / TensorFlowTTS

😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other lan...

speech-synthesistext-to-speechtensorflow2melganfastspeechreal-timettsvocodermulti-speaker-ttsfastspeech2tacotron2tflite
Python 3.94 k
1 年前
loading...