#

neural-tts

https://static.github-zh.com/github_avatars/keonlee9420?size=40

#计算机科学#A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aim...

Python 326
3 年前
https://static.github-zh.com/github_avatars/KevinMIN95?size=40
Python 251
4 年前
https://static.github-zh.com/github_avatars/keonlee9420?size=40

PyTorch implementation of DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (focused on DiffSpeech)

Python 243
4 年前
https://static.github-zh.com/github_avatars/keonlee9420?size=40
Python 195
4 年前
https://static.github-zh.com/github_avatars/keonlee9420?size=40

PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech

Python 194
3 年前
https://static.github-zh.com/github_avatars/keonlee9420?size=40

PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling

Python 190
4 年前
https://static.github-zh.com/github_avatars/keonlee9420?size=40

#计算机科学#A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimat...

Python 146
3 年前
https://static.github-zh.com/github_avatars/keonlee9420?size=40

PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.

Python 73
4 年前
https://static.github-zh.com/github_avatars/keonlee9420?size=40

PyTorch Implementation of NCSOFT's FastPitchFormant: Source-filter based Decomposed Modeling for Speech Synthesis

Python 72
4 年前
https://static.github-zh.com/github_avatars/mush42?size=40

A cross-platform inference engine for neural TTS models.

Rust 71
10 个月前
https://static.github-zh.com/github_avatars/keonlee9420?size=40
Python 69
4 年前
https://static.github-zh.com/github_avatars/keonlee9420?size=40

PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis

Python 57
4 年前
https://static.github-zh.com/github_avatars/keonlee9420?size=40

#计算机科学#PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to ...

Python 48
2 年前
https://static.github-zh.com/github_avatars/Mobile-Artificial-Intelligence?size=40

Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port of the DeepPhonemizer model is used. For speech synthesis VITS...

Python 23
15 天前
https://static.github-zh.com/github_avatars/keonlee9420?size=40
Python 15
4 年前
https://static.github-zh.com/github_avatars/QuantiusBenignus?size=40

#计算机科学#Let your GNOME desktop speak to you. Reads your desktop notifications or selected text out-loud with human-like voice using Piper. Uses a local LLM to summarize selected text.

JavaScript 12
4 个月前
https://static.github-zh.com/github_avatars/yokawasa?size=40

VS Code extension for multi-language text translation and TTS (text-to-speech) using Azure Cognitive Services. Please [✩Star] if you're using it!

TypeScript 7
4 年前
https://static.github-zh.com/github_avatars/mahshid1378?size=40

PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling

Python 1
6 个月前
loading...
Website
Wikipedia