GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

voice-cloning

Website
Wikipedia
CorentinJ/Real-Time-Voice-Cloning
https://static.github-zh.com/github_avatars/CorentinJ?size=40
CorentinJ / Real-Time-Voice-Cloning

#计算机科学#Real-Time-Voice-Cloning 是一个基于深度学习的语音合成工具,5秒内即可克隆一个声音。

深度学习PyTorchTensorflowtts声音克隆Python
Python 54.5 k
16 天前
https://static.github-zh.com/github_avatars/RVC-Boss?size=40
RVC-Boss / GPT-SoVITS

强大的少样本语音转换与语音合成Web用户界面。

text-to-speechttsvitsvoice-clonevoice-cloneai声音克隆
Python 47.62 k
2 天前
https://static.github-zh.com/github_avatars/coqui-ai?size=40
coqui-ai / TTS

#计算机科学#🐸💬 - 一个深度学习的 TTS 语言合成库

Pythontext-to-speech深度学习speechPyTorchttsvocodertacotronglow-ttsmelganspeaker-encoderhifiganspeaker-encodingsmulti-speaker-ttstts-modelspeech-synthesis声音克隆voice-synthesisvoice-conversion
Python 40.74 k
10 个月前
https://static.github-zh.com/github_avatars/FunAudioLLM?size=40
FunAudioLLM / CosyVoice

#大语言模型#Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

audio-generationgpt-4otext-to-speechttscantonese聊天机器人ChatGPT中文englishfine-grainedfine-tuningjapanesekoreanmulti-lingualnatural-language-generationPythoncosyvoicecross-lingual声音克隆
Python 14.54 k
4 天前
https://static.github-zh.com/github_avatars/Huanshere?size=40
Huanshere / VideoLingo

Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组

ai-translationdubbingLocalization (l10n)video-translation声音克隆
Python 13.22 k
1 个月前
https://static.github-zh.com/github_avatars/PaddlePaddle?size=40
PaddlePaddle / PaddleSpeech

PaddleSpeech 是基于飞桨 PaddlePaddle 的语音方向的开源模型库,用于语音和音频中的各种关键任务的开发,典型的应用包括:语音识别、语音翻译、语音合成等

transformerconformerspeech-translationstreaming-asrspeech-alignmentpunctuation-restorationstreaming-ttsspeech-synthesisttsasrspeech-recognition声音克隆vocodervoice-recognitionself-supervised-learningWhisper
Python 11.99 k
6 天前
https://static.github-zh.com/github_avatars/DrewThomasson?size=40
DrewThomasson / ebook2audiobook

Convert ebooks to audiobooks with chapters and metadata using dynamic AI models and voice cloning. Supports 1,107+ languages!

audiobooksDockerepubLinuxmacOSttsWindowsxtts声音克隆gradio中文englishmultilingualcolab-notebookkaggleaudiobook
Python 10.02 k
21 小时前
https://static.github-zh.com/github_avatars/multimodal-art-projection?size=40
multimodal-art-projection / YuE

#计算机科学#YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open

foundation-modelsmusic-generationhuggingfacellamaaudio-generation声音克隆大语言模型人工智能深度学习gpt
Python 5.08 k
11 天前
abus-aikorea/voice-pro
https://static.github-zh.com/github_avatars/abus-aikorea?size=40
abus-aikorea / voice-pro

Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isola...

faster-whisperttsWhispergradiosubtitlestranscriptiontranslatorwebuispeech-recognitionspeech-synthesisspeech-to-texttext-to-speechyt-dlp声音克隆podcastsaudiobookvoice-conversionkaraokewhisperx
Python 3.71 k
19 天前
https://static.github-zh.com/github_avatars/Camb-ai?size=40
Camb-ai / MARS5-TTS

MARS5 speech model (TTS) from CAMB.AI

speechspeech-synthesistext-to-speechvoice-cloneai声音克隆
Jupyter Notebook 2.77 k
10 个月前
IAHispano/Applio
https://static.github-zh.com/github_avatars/IAHispano?size=40
IAHispano / Applio

A simple, high-quality voice conversion tool focused on ease of use and performance.

rvcvcvitsvoice人工智能声音克隆voice-conversionappliovoice-clonePyTorchspeechtext-to-speechtts
Python 2.42 k
8 天前
https://static.github-zh.com/github_avatars/voice-cloning-app?size=40
voice-cloning-app / Voice-Cloning-App

#计算机科学#A Python/Pytorch app for easily synthesising human voices

Pythonttstext-to-speechPyTorch深度学习声音克隆tacotron2
Python 1.44 k
6 个月前
https://static.github-zh.com/github_avatars/coqui-ai?size=40
coqui-ai / open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

ttssttspeech-to-texttext-to-speechspeech-recognitionspeech-synthesisspeech-processingvoice-recognitionvoice-activity-detection声音克隆speech-separation
1.34 k
1 年前
https://static.github-zh.com/github_avatars/gitmylo?size=40
gitmylo / audio-webui

A webui for different audio related Neural Networks

人工智能audioldmbarkrvctext-to-audiotext-to-speech声音克隆audiocraftmusicgenerative-musicttsaioall-in-one
Python 1.17 k
1 个月前
https://static.github-zh.com/github_avatars/Tomiinek?size=40
Tomiinek / Multilingual_Text_to_Speech

An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.

text-to-speechspeech-synthesismultilingualtts声音克隆
Python 837
2 年前
https://static.github-zh.com/github_avatars/gitmylo?size=40
gitmylo / bark-voice-cloning-HuBERT-quantizer

The code for the bark-voicecloning model. Training and inference.

人工智能neural-networkstext-to-speech声音克隆voice-conversion
Python 703
2 年前
https://static.github-zh.com/github_avatars/PlayVoice?size=40
PlayVoice / lora-svc

singing voice change based on whisper, and lora for singing voice clone

singing-voice-conversionvoice-conversionvits声音克隆Whisperlora
Python 640
2 年前
https://static.github-zh.com/github_avatars/PaddlePaddle?size=40
PaddlePaddle / Parakeet

PAddle PARAllel text-to-speech toolKIT (supporting Tacotron2, Transformer TTS, FastSpeech2/FastPitch, SpeedySpeech, WaveFlow and Parallel WaveGAN)

text-to-speechspeech-synthesistacotron2fastspeech2multi-speaker-tts声音克隆
Python 611
4 年前
https://static.github-zh.com/github_avatars/jackaduma?size=40
jackaduma / CycleGAN-VC2

#计算机科学#Voice Conversion by CycleGAN (语音克隆/语音转换): CycleGAN-VC2

voice-conversioncycleganGenerative Adversarial Network深度学习声音克隆pytorch-implementationspeech-synthesispix2pixaigc
Python 562
2 年前
https://static.github-zh.com/github_avatars/MiniMax-AI?size=40
MiniMax-AI / MiniMax-MCP

Official MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech, image generation and video generation APIs.

image-generationmcpmcp-servermcp-toolstext-to-speechvideo-generationimage-to-videotext-to-imagetext-to-video声音克隆
Python 546
1 个月前
loading...