audioldm · GitHub Topics

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, ...

audio-generation audio-synthesis audioldm music-generation naturalspeech2 singing-voice-conversion speech-synthesis text-to-audio text-to-speech vall-e voice-conversion audit fastspeech2 vits emilia maskgct vocoder

Python 9.38 k

4 个月前

gitmylo / audio-webui

A webui for different audio related Neural Networks

人工智能 audioldm bark rvc text-to-audio text-to-speech 声音克隆 audiocraft music generative-music tts aio all-in-one

Python 1.2 k

4 个月前

ivcylc / OpenMusic

OpenMusic: SOTA Text-to-music (TTM) Generation

人工智能 diffusion-models music-generation text-to-audio ai-music audioldm diffusion-transformer dit hifi-gan vall-e

Python 609

3 个月前

Dartvauder / NeuroSandboxWebUI

#大语言模型#(Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on 3 languages

gradio 大语言模型神经网络 Python stable-diffusion tts webui Whisper diffusers llamacpp transformers audioldm wav2lip cogvideox flux rvc

Python 101

16 天前

olaviinha / NeuralTextToAudio

Text prompt steered synthetic audio generators

text2audio audio-generation audio-synthesis audioldm music-generation voice-synthesis 声音克隆 audio audio-processing colab colab-notebook

Jupyter Notebook 49

5 个月前

zelaki / DreamSound

[ICASSP'24] Investigating Personalization Methods in Text to Music Generation

dreambooth audioldm

Python 41

1 年前

TemporalLabsLLC-SOL / TemporalPromptEngine

A comprehensive, click to install, fully open-source, Video + Audio Generation AIO Toolkit using advanced prompt engineering plus the power of CogVideox + AudioLDM2 + Python!

人工智能 audio prompt-engineering Video audioldm cogvideox videogeneration

Python 20

9 个月前

camenduru / audioldm-colab

AudioLDM text to audio colab

colab colab-notebook text-to-audio audioldm

Jupyter Notebook 19

2 年前

dimitreOliveira / GenAI-GeoGuesser

#大语言模型#Generative AI version of the GeoGuesser game.

audioldm gemma gemma-2b-it genai generative-ai 大语言模型 stable-diffusion text-to-audio text-to-image

Python 4

1 年前

Danand / audio-ldm-webui

Simple web UI for AudioLDM 2.

audiocraft audioldm webui

Python 1

2 年前

Abdelhakim-gh / GenAI_Fusion_Multimodale

Workshop for Multimodale media generator

audioldm generative-ai gradio multimodal stable-diffusion text-to-audio text-to-image

Jupyter Notebook 1

8 个月前

2025-comprehensive-design / AudioLDM-with-LoRA

Enhancing Diffusion-Based Music Generation Performance with LoRA.

audioldm diffusion-models huggingface lora

Python 1

18 天前

jeanhacker28 / geo-guesser

#大语言模型#In this game, your given an image for so many seconds to view. Then you have to guess just by clicking on any point in the world that the photo was taken. NOTICE: This game is INCOMPLETE

audioldm browser gemma-2b-it generative-ai geospatial 大语言模型 location React stable-diffusion TypeScript Website

JavaScript 0

19 天前