GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

neural-tts

Website
Wikipedia
https://static.github-zh.com/github_avatars/keonlee9420?size=40
keonlee9420 / DiffGAN-TTS

PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs

text-to-speech深度神经网络PyTorchttsspeech-synthesisgenerative-modelddpmdiffusionneural-ttsnon-autoregressiveGenerative Adversarial Networkhifi-gandiffusion-modelsfastspeechmulti-speaker-tts
Python 336
3 年前
https://static.github-zh.com/github_avatars/keonlee9420?size=40
keonlee9420 / PortaSpeech

PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech

text-to-speechnormalizing-flowsgenerative-model深度神经网络PyTorchttsspeech-synthesisneural-ttsnon-autoregressiveportable-ttsvaefastspeechhifi-ganhigh-quality
Python 335
3 年前
https://static.github-zh.com/github_avatars/keonlee9420?size=40
keonlee9420 / Comprehensive-Transformer-TTS

#计算机科学#A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aim...

text-to-speechunsupervisednon-autoregressivemulti-speakerttsPyTorchfastspeechtransformerneural-ttsfastspeech2hifi-gansotaspeech-synthesis深度学习
Python 326
3 年前
https://static.github-zh.com/github_avatars/KevinMIN95?size=40
KevinMIN95 / StyleSpeech

Official implementation of Meta-StyleSpeech and StyleSpeech

officialttsmeta-learningtext-to-speechneural-ttsspeech-synthesisspeech
Python 246
3 年前
https://static.github-zh.com/github_avatars/keonlee9420?size=40
keonlee9420 / DiffSinger

PyTorch implementation of DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (focused on DiffSpeech)

text-to-speechdiffusionddpmPyTorchsinging-voicettsspeech-synthesisenglishdiffusion-modelsneural-ttsnon-autoregressivefastspeechdiffsinger
Python 240
3 年前
https://static.github-zh.com/github_avatars/keonlee9420?size=40
keonlee9420 / StyleSpeech

PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation

text-to-speechPyTorchttsspeech-synthesisenglishstyleneural-ttsnon-autoregressivefastspeechmeta-learningspeakerspeaker-adaptation
Python 194
3 年前
https://static.github-zh.com/github_avatars/keonlee9420?size=40
keonlee9420 / Cross-Speaker-Emotion-Transfer

PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech

text-to-speech深度神经网络PyTorchttsspeech-synthesisgenerative-modelneural-ttsnon-autoregressivesemi-supervised-learning
Python 194
3 年前
https://static.github-zh.com/github_avatars/keonlee9420?size=40
keonlee9420 / Parallel-Tacotron2

PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling

neural-ttsnon-autoregressivevaeself-attentiondurationspeech-synthesisPyTorchttstext-to-speechenglishfastspeech
Python 190
4 年前
https://static.github-zh.com/github_avatars/keonlee9420?size=40
keonlee9420 / Comprehensive-E2E-TTS

#计算机科学#A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimat...

深度学习fastspeech2hifi-ganjetsmulti-speakerneural-ttsnon-autoregressivePyTorchsotaspeech-synthesistext-to-speechttsunsupervisedend-to-end
Python 146
3 年前
https://static.github-zh.com/github_avatars/keonlee9420?size=40
keonlee9420 / VAENAR-TTS

PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.

vaeglownon-autoregressivettstext-to-speechdurationPyTorchspeech-synthesisself-attentionneural-ttsunsupervised-learning
Python 73
4 年前
https://static.github-zh.com/github_avatars/mush42?size=40
mush42 / sonata

A cross-platform inference engine for neural TTS models.

CgRPCneural-ttsPythonspeech-synthesistext-to-speechtts
Rust 72
7 个月前
https://static.github-zh.com/github_avatars/keonlee9420?size=40
keonlee9420 / FastPitchFormant

PyTorch Implementation of NCSOFT's FastPitchFormant: Source-filter based Decomposed Modeling for Speech Synthesis

text-to-speechend-to-endneural-ttsPyTorchttsspeech-synthesispitchfastspeechnon-autoregressive
Python 72
4 年前
https://static.github-zh.com/github_avatars/keonlee9420?size=40
keonlee9420 / WaveGrad2

PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis

text-to-speechneural-ttsaudiosynthesisnon-autoregressivescore-matchingdurationrobustPyTorchttsspeech-synthesistext-to-audioend-to-end
Python 69
4 年前
https://static.github-zh.com/github_avatars/keonlee9420?size=40
keonlee9420 / Daft-Exprt

PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis

text-to-speechstylePyTorchttsspeech-synthesisenglishspeakerneural-ttsnon-autoregressive
Python 56
4 年前
https://static.github-zh.com/github_avatars/keonlee9420?size=40
keonlee9420 / Comprehensive-Tacotron2

#计算机科学#PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to ...

text-to-speechttstacotrontacotron2PyTorchspeech-synthesisautoregressivemulti-speakerrobustnessefficiencyneural-ttshifi-gan深度学习
Python 48
2 年前
https://static.github-zh.com/github_avatars/Mobile-Artificial-Intelligence?size=40
Mobile-Artificial-Intelligence / babylon.cpp

Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port of the DeepPhonemizer model is used. For speech synthesis VITS...

人工智能elevenlabsneural-ttsonnxonnxruntimettsvits声音克隆onnx-modelsonnx-runtime
Python 21
10 个月前
https://static.github-zh.com/github_avatars/keonlee9420?size=40
keonlee9420 / Deep-Learning-TTS-Template

#计算机科学#This is a template for the Non-autoregressive Deep Learning-Based TTS model (in PyTorch).

text-to-speechPyTorchttsspeech-synthesis深度学习fastspeechnon-autoregressiveneural-ttstemplate
Python 15
4 年前
https://static.github-zh.com/github_avatars/QuantiusBenignus?size=40
QuantiusBenignus / voluble

#计算机科学#Let your GNOME desktop speak to you. Reads your desktop notifications or selected text out-loud with human-like voice using Piper. Uses a local LLM to summarize selected text.

gnomegnome-shell-extensionneural-ttsnotificationstext-to-speechWeb Accessibility (a11y)kissspeech-synthesisttsautoencoder深度学习vitsgnome-extension机器学习
JavaScript 9
2 个月前
https://static.github-zh.com/github_avatars/yokawasa?size=40
yokawasa / vscode-translator-voice

VS Code extension for multi-language text translation and TTS (text-to-speech) using Azure Cognitive Services. Please [✩Star] if you're using it!

Visual Studio CodeVS Code ExtensionTypeScripttranslatorttsneural-ttsazure-cognitive-servicesvoicespeech
TypeScript 7
4 年前
https://static.github-zh.com/github_avatars/marcel2215?size=40
marcel2215 / native-speaker

A simple Discord bot that synthesizes speech directly to a voice channel via text commands with support for sound effects.

audioBotDiscorddiscord-boteffectssoundspeechtext-to-speechttsvoiceplayerdiscord-apiPythonspeech-synthesisAzureneural-ttsneuralsound-effects
Python 0
1 年前
loading...