#Awesome#An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)
Summary of Transformer applications for computer vision tasks.
Continuous Augmented Positional Embeddings (CAPE) implementation for PyTorch
#计算机科学#We use MixedWM38, the mixed-type wafer defect pattern dataset for wafer defect pattern regcognition with visual transformers.
#Awesome#A comprehensive paper list of Transformer & Attention for Vision Recognition / Foundation Model, including papers, codes, and related websites.
Implementation of Image Classification using Visual Transformers in Amazon SageMaker based on the ideas from research paper - Visual Transformers: Token-based Image Representation and Processing for C...
This repository contains the source code for the implementation of two deep learning models concerning the audio super resolution task.
#计算机科学#Team "team-name" solution for Weather4cast Challenge
Image-Scene-Classification with 30 different classes.
A Multimodal Deep Learning Approach for Skin Cancer Classification using ViTs (Visual Transformers)
#计算机科学#Swin backbone for UNet network for semantic segmentation
#计算机科学#Human Pose Classifier using Vision Transformers (ViT) – end-to-end pipeline for preprocessing, training, testing, and deploying models with FastAPI/Streamlit and AWS integration.
#计算机科学#Open source project for waste detection developed by students of postgraduate course Artificial Intelligence with Deep Learning UPC
Soft-Transformers For Continual Learning
Fine-tune the Vision Transformer (ViT) using LoRA and Optuna for hyperparameter search.
Implementation of Visual Attention (ViT) for Image Classification using pytorch
#计算机科学#Sea Surface Temperature Reconstruction under Cloud Occlusion
Implementing federated learning on IoT devices using the CIFAR-10 dataset / CIFAR-10 데이터셋을 활용하여 IoT기기에서의 연합학습을 구현
#计算机科学#A Robust Approach Towards Distinguishing Natural and Computer Generated Images using Multi-Colorspace fused and Enriched Vision Transformer