GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

tts

Website
Wikipedia
CorentinJ/Real-Time-Voice-Cloning
https://static.github-zh.com/github_avatars/CorentinJ?size=40
CorentinJ / Real-Time-Voice-Cloning

#计算机科学#Real-Time-Voice-Cloning 是一个基于深度学习的语音合成工具,5秒内即可克隆一个声音。

深度学习PyTorchTensorflowtts声音克隆Python
Python 54.5 k
16 天前
https://static.github-zh.com/github_avatars/RVC-Boss?size=40
RVC-Boss / GPT-SoVITS

强大的少样本语音转换与语音合成Web用户界面。

text-to-speechttsvitsvoice-clonevoice-cloneai声音克隆
Python 47.62 k
2 天前
https://static.github-zh.com/github_avatars/coqui-ai?size=40
coqui-ai / TTS

#计算机科学#🐸💬 - 一个深度学习的 TTS 语言合成库

Pythontext-to-speech深度学习speechPyTorchttsvocodertacotronglow-ttsmelganspeaker-encoderhifiganspeaker-encodingsmulti-speaker-ttstts-modelspeech-synthesis声音克隆voice-synthesisvoice-conversion
Python 40.73 k
10 个月前
unslothai/unsloth
https://static.github-zh.com/github_avatars/unslothai?size=40
unslothai / unsloth

#大语言模型#Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.

finetuningfine-tuningllama大语言模型loramistralqloragemmallama3unslothdeepseekdeepseek-r1gemma3llama-4llama4text-to-speechttsqwenqwen3
Python 40.54 k
3 天前
https://static.github-zh.com/github_avatars/2noise?size=40
2noise / ChatTTS

#大语言模型#ChatTTS是专门为对话场景设计的文本转语音模型,例如LLM助手对话任务。它支持英文和中文两种语言

agenttext-to-speechchatChatGPTchattts中文chinese-languageenglishenglish-languagegpt大语言模型llm-agentnatural-language-inferencePythontorchtts
Python 36.8 k
23 天前
babysor/MockingBird
https://static.github-zh.com/github_avatars/babysor?size=40
babysor / MockingBird

#计算机科学#🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

人工智能speechPyTorch深度学习text-to-speechtts
Python 36.33 k
7 个月前
https://static.github-zh.com/github_avatars/mudler?size=40
mudler / LocalAI

#大语言模型#:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, tr...

llamarwkv人工智能大语言模型stable-diffusionAPIKubernetesgpt4allttsmusicgenmambaaudio-generationimage-generationtext-generationgemmamistralllama3rerankdistributedlibp2p
Go 33.21 k
3 小时前
https://static.github-zh.com/github_avatars/myshell-ai?size=40
myshell-ai / OpenVoice

Instant voice cloning by MIT and MyShell. Audio foundation model.

text-to-speechttsvoice-clonezero-shot-tts
Python 32.62 k
2 个月前
https://static.github-zh.com/github_avatars/fishaudio?size=40
fishaudio / fish-speech

SOTA Open Source TTS

llamatransformerttsvallevitsvqganvqvae
Python 21.75 k
3 天前
https://static.github-zh.com/github_avatars/NVIDIA?size=40
NVIDIA / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

machine-translationspeaker-recognitionasrttsgenerative-aimultimodal深度学习neural-networksspeaker-diariazationspeech-translationspeech-synthesislarge-language-models
Python 14.8 k
4 小时前
https://static.github-zh.com/github_avatars/FunAudioLLM?size=40
FunAudioLLM / CosyVoice

#大语言模型#Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

audio-generationgpt-4otext-to-speechttscantonese聊天机器人ChatGPT中文englishfine-grainedfine-tuningjapanesekoreanmulti-lingualnatural-language-generationPythoncosyvoicecross-lingual声音克隆
Python 14.54 k
3 天前
https://static.github-zh.com/github_avatars/mastra-ai?size=40
mastra-ai / mastra

#大语言模型#The TypeScript AI agent framework. ⚡ Assistants, RAG, observability. Supports any LLM: GPT-4, Claude, Gemini, Llama.

agents人工智能chatbotsJavaScript大语言模型NextNode.jsReactTypeScriptworkflowsevalsmcptts
TypeScript 14.18 k
1 天前
pot-app/pot-desktop
https://static.github-zh.com/github_avatars/pot-app?size=40
pot-app / pot-desktop

🌈一个跨平台的划词翻译和OCR软件 | A cross-platform software for text translation and recognition.

translationpotTauritranslatepot-appOCRLinuxmacOSWindowsrecognizetts
JavaScript 12.65 k
1 个月前
https://static.github-zh.com/github_avatars/PaddlePaddle?size=40
PaddlePaddle / PaddleSpeech

PaddleSpeech 是基于飞桨 PaddlePaddle 的语音方向的开源模型库,用于语音和音频中的各种关键任务的开发,典型的应用包括:语音识别、语音翻译、语音合成等

transformerconformerspeech-translationstreaming-asrspeech-alignmentpunctuation-restorationstreaming-ttsspeech-synthesisttsasrspeech-recognition声音克隆vocodervoice-recognitionself-supervised-learningWhisper
Python 11.99 k
5 天前
https://static.github-zh.com/github_avatars/DrewThomasson?size=40
DrewThomasson / ebook2audiobook

Convert ebooks to audiobooks with chapters and metadata using dynamic AI models and voice cloning. Supports 1,107+ languages!

audiobooksDockerepubLinuxmacOSttsWindowsxtts声音克隆gradio中文englishmultilingualcolab-notebookkaggleaudiobook
Python 10.01 k
12 小时前
https://static.github-zh.com/github_avatars/mozilla?size=40
mozilla / TTS

#计算机科学#:robot: 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

深度学习text-to-speechPythonPyTorchtacotronttsspeaker-encoderdataset-analysistacotron2tensorflow2vocodermelganglow-ttsspeech
Jupyter Notebook 9.88 k
2 年前
https://static.github-zh.com/github_avatars/rhasspy?size=40
rhasspy / piper

A fast, local neural text to speech system

speech-synthesistext-to-speechtts
C++ 9.31 k
1 个月前
readest/readest
https://static.github-zh.com/github_avatars/readest?size=40
readest / readest

#安卓#Readest is a modern, feature-rich ebook reader designed for avid readers offering seamless cross-platform access, powerful tools, and an intuitive interface to elevate your reading experience.

ebookebook-readerepubNextreaderTaurittsAndroidcross-platformiOSsync
TypeScript 8.65 k
5 小时前
https://static.github-zh.com/github_avatars/jianchang512?size=40
jianchang512 / clone-voice

A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频

clonevoicettsvoice-assistantspeech-analysissts
Python 8.59 k
6 个月前
https://static.github-zh.com/github_avatars/fishaudio?size=40
fishaudio / Bert-VITS2

#大语言模型#vits2 backbone with multilingual-bert

bertbert-vits2ttsvitsvits2bert-vits大语言模型friendly interactive shellvocoderagent
Python 8.47 k
5 天前
loading...