#计算机科学#StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Zero-Shot Speech Editing and Text-to-Speech in the Wild
A multi-voice TTS system trained with an emphasis on quality
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
微软VALL-E X 零样本语音合成模型的开源实现
Open-source AI cookbook
A C# wrapper of OBSproject/libdshowcapture using .NET 8.0 with C++/CLI to utilize DirectShow on Windows
This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
#大语言模型#Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
0 条讨论