#计算机科学#A Collection of Variational Autoencoders (VAE) in PyTorch.
Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)
[NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.
A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder
Torchélie is a set of utility functions, layers, losses, models, trainers and other things for PyTorch.
(ECCV 2024) SignAvatars: A Large-scale 3D Sign Language Holistic Motion Dataset and Benchmark
Language Quantized AutoEncoders
#计算机科学#Fast and scalable search of whole-slide images via self-supervised deep learning - Nature Biomedical Engineering
#计算机科学#Voice conversion (VC) investigation using three variants of VAE
This repo implements VQVAE on mnist and as well as colored version of mnist images. It also implements simple LSTM for generating sample numbers using the encoder outputs of trained VQVAE
#计算机科学#VQ-VAE/GAN implementation in pytorch-lightning
#自然语言处理#Inverse DALL-E for Optical Character Recognition
Experimental implementation for a sparse-dictionary based version of the VQ-VAE2 paper
official training and inference code of bitwise tokenizer
#计算机科学#Large-Scale Bidirectional Training for Zero-Shot Image Captioning
#计算机科学#Image Generation using VQVAE and GPT Models
#计算机科学#Tensorflow Implementation of "Theory and Experiments on Vector Quantized Autoencoders"