Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)
#计算机科学#Image Generation using VQVAE and GPT Models
#计算机科学#Interactive VQ-VAE (Vector-Quantized Variational Autoencoder) in the browser
#计算机科学#implementation of VQVAE in pytorch
#计算机科学#State of the art of generative models and in-depth study of diffusion models