GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

audioldm

Website
Wikipedia
open-mmlab/Amphion
https://static.github-zh.com/github_avatars/open-mmlab?size=40
open-mmlab / Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, ...

audio-generationaudio-synthesisaudioldmmusic-generationnaturalspeech2singing-voice-conversionspeech-synthesistext-to-audiotext-to-speechvall-evoice-conversionauditfastspeech2vitsemiliamaskgctvocoder
Python 9.15 k
20 天前
https://static.github-zh.com/github_avatars/gitmylo?size=40
gitmylo / audio-webui

A webui for different audio related Neural Networks

人工智能audioldmbarkrvctext-to-audiotext-to-speech声音克隆audiocraftmusicgenerative-musicttsaioall-in-one
Python 1.17 k
1 个月前
https://static.github-zh.com/github_avatars/ivcylc?size=40
ivcylc / OpenMusic

OpenMusic: SOTA Text-to-music (TTM) Generation

人工智能diffusion-modelsmusic-generationtext-to-audioai-musicaudioldmdiffusion-transformerdithifi-ganvall-e
Python 568
2 个月前
https://static.github-zh.com/github_avatars/Dartvauder?size=40
Dartvauder / NeuroSandboxWebUI

#大语言模型#(Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on 3 languages

gradio大语言模型神经网络Pythonstable-diffusionttswebuiWhisperdiffusersllamacpptransformersaudioldmwav2lipcogvideoxfluxrvc
Python 96
6 天前
https://static.github-zh.com/github_avatars/olaviinha?size=40
olaviinha / NeuralTextToAudio

Text prompt steered synthetic audio generators

text2audioaudio-generationaudio-synthesisaudioldmmusic-generationvoice-synthesis声音克隆audioaudio-processingcolabcolab-notebook
Jupyter Notebook 47
2 个月前
https://static.github-zh.com/github_avatars/zelaki?size=40
zelaki / DreamSound

[ICASSP'24] Investigating Personalization Methods in Text to Music Generation

dreamboothaudioldm
Python 38
1 年前
https://static.github-zh.com/github_avatars/camenduru?size=40
camenduru / audioldm-colab

AudioLDM text to audio colab

colabcolab-notebooktext-to-audioaudioldm
Jupyter Notebook 19
2 年前
https://static.github-zh.com/github_avatars/TemporalLabsLLC-SOL?size=40
TemporalLabsLLC-SOL / TemporalPromptEngine

A comprehensive, click to install, fully open-source, Video + Audio Generation AIO Toolkit using advanced prompt engineering plus the power of CogVideox + AudioLDM2 + Python!

人工智能audioprompt-engineeringVideoaudioldmcogvideoxvideogeneration
Python 19
6 个月前
https://static.github-zh.com/github_avatars/dimitreOliveira?size=40
dimitreOliveira / GenAI-GeoGuesser

#大语言模型#Generative AI version of the GeoGuesser game.

audioldmgemmagemma-2b-itgenaigenerative-ai大语言模型stable-diffusiontext-to-audiotext-to-image
Python 4
1 年前
https://static.github-zh.com/github_avatars/Danand?size=40
Danand / audio-ldm-webui

Simple web UI for AudioLDM 2.

audiocraftaudioldmwebui
Python 1
1 年前
https://static.github-zh.com/github_avatars/Abdelhakim-gh?size=40
Abdelhakim-gh / GenAI_Fusion_Multimodale

Workshop for Multimodale media generator

audioldmgenerative-aigradiomultimodalstable-diffusiontext-to-audiotext-to-image
Jupyter Notebook 1
5 个月前
https://static.github-zh.com/github_avatars/jeanhacker28?size=40
jeanhacker28 / geo-guesser

#大语言模型#In this game, your given an image for so many seconds to view. Then you have to guess just by clicking on any point in the world that the photo was taken. NOTICE: This game is INCOMPLETE

audioldmbrowsergemma-2b-itgenerative-aigeospatial大语言模型locationReactstable-diffusionTypeScriptWebsite
JavaScript 0
17 小时前