vqvae · GitHub Topics

fishaudio / fish-speech

SOTA Open Source TTS

llama transformer tts valle vits vqgan vqvae

Python 22.52 k

8 天前

AntixK / PyTorch-VAE

#计算机科学#A Collection of Variational Autoencoders (VAE) in PyTorch.

PyTorch pytorch-implementation vae vae-implementation 深度学习 reproducible-research paper-implementations pytorch-vae variational-autoencoders architecture vqvae

Python 7.28 k

4 个月前

v-iashin / SpecVQGAN

Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)

transformer vqvae Generative Adversarial Network PyTorch audio-generation melgan multi-modal video-understanding evaluation-metrics audio Video

Jupyter Notebook 363

1 年前

FoundationVision / OmniTokenizer

[NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.

auto-regressive-model image-generation tokenization vae video-generation vqvae

Python 304

1 年前

k2kobayashi / crank

A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder

speech-synthesis voice-conversion vqvae adversarial-learning vocoder

Python 170

1 年前

ZhengdiYu / SignAvatars

(ECCV 2024) SignAvatars: A Large-scale 3D Sign Language Holistic Motion Dataset and Benchmark

human-pose-estimation motion-generation smplx vqvae eccv2024

Python 115

14 天前

Vermeille / Torchelie

Torchélie is a set of utility functions, layers, losses, models, trainers and other things for PyTorch.

PyTorch utils perceptual loss Generative Adversarial Network vqvae torch

Python 110

7 个月前

haoliuhl / language-quantized-autoencoders

Language Quantized AutoEncoders

bert large-language-models multimodal roberta vqvae

Python 108

2 年前

mahmoodlab / SISH

#计算机科学#Fast and scalable search of whole-slide images via self-supervised deep learning - Nature Biomedical Engineering

pathology image-retrieval image-search-engine histopathology friendly interactive shell 深度学习 vqvae

Python 107

2 年前

hqyyqh888 / RobustSemanComm

Demo of robust semantic communication against semantic noise

mask vqvae

Python 80

2 年前

Neur-IO / OptVQ

Towards training VQ-VAE models robustly!

optimal-transport vq-vae vqgan vqvae

Python 77

17 天前

vsimkus / vae-voice-conversion

#计算机科学#Voice conversion (VC) investigation using three variants of VAE

vae voice-conversion 机器学习 vqvae

Python 59

6 年前

explainingai-code / VQVAE-Pytorch

This repo implements VQVAE on mnist and as well as colored version of mnist images. It also implements simple LSTM for generating sample numbers using the encoder outputs of trained VQVAE

PyTorch vq-vae vqvae

Python 57

1 年前

SerezD / vqvae-vqgan-pytorch-lightning

#计算机科学#VQ-VAE/GAN implementation in pytorch-lightning

深度学习 PyTorch pytorch-lightning vqgan vqvae

Python 45

9 个月前

affjljoo3581 / Inverse-DALL-E-for-Optical-Character-Recognition

#自然语言处理#Inverse DALL-E for Optical Character Recognition

dalle 自然语言处理 gpt2 huggingface image-captioning image-generation image-to-text multimodal OCR optical-character-recognition PyTorch text-to-image transformers vqvae

Python 38

3 年前

FoundationVision / BitVAE

official training and inference code of bitwise tokenizer

autoregressive-models image-generation vae vqvae

Python 37

2 个月前

amzn / sparse-vqvae

Experimental implementation for a sparse-dictionary based version of the VQ-VAE2 paper

vqvae

Python 34

2 年前

MIMICLab / BITTERS

#计算机科学#Large-Scale Bidirectional Training for Zero-Shot Image Captioning

深度学习 image-captioning PyTorch pytorch-lightning vqvae bitters transformer

Python 21

2 年前

BhanuPrakashPebbeti / Image-Generation-Using-VQVAE

#计算机科学#Image Generation using VQVAE and GPT Models

深度学习 vqvae gpt image-generation 人工智能

Jupyter Notebook 18

5 个月前

jaywalnut310 / Vector-Quantized-Autoencoders

#计算机科学#Tensorflow Implementation of "Theory and Experiments on Vector Quantized Autoencoders"

vqvae vae autoencoder transformer Tensorflow 深度学习

Python 14

6 年前