GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

audio-generation

Website
Wikipedia
https://static.github-zh.com/github_avatars/mudler?size=40
mudler / LocalAI

#大语言模型#:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, tr...

llamarwkv人工智能大语言模型stable-diffusionAPIKubernetesgpt4allttsmusicgenmambaaudio-generationimage-generationtext-generationgemmamistralllama3rerankdistributedlibp2p
Go 33.21 k
4 小时前
https://static.github-zh.com/github_avatars/FunAudioLLM?size=40
FunAudioLLM / CosyVoice

#大语言模型#Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

audio-generationgpt-4otext-to-speechttscantonese聊天机器人ChatGPT中文englishfine-grainedfine-tuningjapanesekoreanmulti-lingualnatural-language-generationPythoncosyvoicecross-lingual声音克隆
Python 14.54 k
3 天前
open-mmlab/Amphion
https://static.github-zh.com/github_avatars/open-mmlab?size=40
open-mmlab / Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, ...

audio-generationaudio-synthesisaudioldmmusic-generationnaturalspeech2singing-voice-conversionspeech-synthesistext-to-audiotext-to-speechvall-evoice-conversionauditfastspeech2vitsemiliamaskgctvocoder
Python 9.15 k
20 天前
https://static.github-zh.com/github_avatars/multimodal-art-projection?size=40
multimodal-art-projection / YuE

#计算机科学#YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open

foundation-modelsmusic-generationhuggingfacellamaaudio-generation声音克隆大语言模型人工智能深度学习gpt
Python 5.08 k
11 天前
https://static.github-zh.com/github_avatars/haoheliu?size=40
haoheliu / AudioLDM

AudioLDM: Generate speech, sound effects, music and beyond, with text.

audio-generation
Python 2.68 k
16 天前
https://static.github-zh.com/github_avatars/haoheliu?size=40
haoheliu / AudioLDM2

Text-to-Audio/Music Generation

audio-generation
Python 2.44 k
9 个月前
rsxdalv/TTS-WebUI
https://static.github-zh.com/github_avatars/rsxdalv?size=40
rsxdalv / TTS-WebUI

A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, Mus...

gradiotext-to-speechtts人工智能audio-generationGeneratormusicmusicgenrvcmagnetgenerative-aiopenai-api
TypeScript 2.26 k
6 天前
https://static.github-zh.com/github_avatars/archinetai?size=40
archinetai / audio-diffusion-pytorch

#计算机科学#Audio generation using diffusion models, in PyTorch.

人工智能audio-generation深度学习denoising-diffusion
Python 2.06 k
2 年前
https://static.github-zh.com/github_avatars/archinetai?size=40
archinetai / audio-ai-timeline

#计算机科学#A timeline of the latest AI models for audio generation, starting in 2023!

人工智能audio-generation机器学习
1.9 k
1 年前
https://static.github-zh.com/github_avatars/lucidrains?size=40
lucidrains / soundstorm-pytorch

#计算机科学#Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch

人工智能audio-generation深度学习non-autoregressivetransformersattention-mechanism
Python 1.51 k
2 个月前
declare-lab/tango
https://static.github-zh.com/github_avatars/declare-lab?size=40
declare-lab / tango

A family of diffusion models for text-to-audio generation.

audio-generationdiffusiondiffusion-modelslanguage-modelslarge-language-modelstext-to-audio
Python 1.17 k
6 个月前
https://static.github-zh.com/github_avatars/FunAudioLLM?size=40
FunAudioLLM / InspireMusic

InspireMusic: Music, Song, Audio Generation.

music-generationPyTorchaudio-generationaudio-processing
Python 1.12 k
1 个月前
https://static.github-zh.com/github_avatars/NVIDIA?size=40
NVIDIA / BigVGAN

Official PyTorch implementation of BigVGAN (ICLR 2023)

audio-synthesisspeech-synthesismusic-synthesisneural-vocoderaudio-generationsinging-voice-synthesis
Python 1.04 k
9 个月前
https://static.github-zh.com/github_avatars/Yuan-ManX?size=40
Yuan-ManX / ai-audio-datasets

#数据仓库#AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio app...

aigcaudioaudio-effect数据集人工智能audio-generation深度学习机器学习music-generation
765
4 个月前
https://static.github-zh.com/github_avatars/researchmm?size=40
researchmm / MM-Diffusion

[CVPR'23] MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation

audio-generationcontent-creationdiffusion-modelsmulti-modalityvideo-generation
Python 429
1 年前
https://static.github-zh.com/github_avatars/modelscope?size=40
modelscope / FunCodec

FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.

audio-generationcodecspeech-synthesisspeech-to-texttts
Python 408
1 年前
https://static.github-zh.com/github_avatars/metame-ai?size=40
metame-ai / awesome-audio-plaza

Daily tracking of awesome audio papers, including music generation, zero-shot tts, asr, audio generation

music-generationasraudio-generationAwesome Liststtszero-shot-tts
386
5 天前
https://static.github-zh.com/github_avatars/Yuan-ManX?size=40
Yuan-ManX / audio-development-tools

#计算机科学#This is a list of sound, audio and music development tools which contains machine learning, audio generation, audio signal processing, sound synthesis, spatial audio, music information retrieval, musi...

audioaudio-processingmusicsignal-processingspeech-processing深度学习dspspeech人工智能audio-generation机器学习music-generationspeech-synthesis
372
9 个月前
https://static.github-zh.com/github_avatars/v-iashin?size=40
v-iashin / SpecVQGAN

Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)

transformervqvaeGenerative Adversarial NetworkPyTorchaudio-generationmelganmulti-modalvideo-understandingevaluation-metricsaudioVideo
Jupyter Notebook 362
1 年前
https://static.github-zh.com/github_avatars/cabralpinto?size=40
cabralpinto / modular-diffusion

#计算机科学#Python library for designing and training your own Diffusion Models with PyTorch.

diffusion-modelsmodular-designPythonaudio-generation深度学习image-generation机器学习PyTorchtext-generationtransformer
Python 283
1 年前
loading...