#

audio-classification

https://static.github-zh.com/github_avatars/YuanGongND?size=40

#计算机科学#Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

Jupyter Notebook 1.34 k
2 年前
https://static.github-zh.com/github_avatars/seth814?size=40

Code for YouTube series: Deep Learning for Audio Classification

Jupyter Notebook 569
3 年前
https://static.github-zh.com/github_avatars/aqibsaeed?size=40
Jupyter Notebook 520
3 年前
https://static.github-zh.com/github_avatars/towhee-io?size=40

#自然语言处理#Analyze the unstructured data with Towhee, such as reverse image search, reverse video search, audio classification, question and answer systems, molecular search, etc.

Jupyter Notebook 515
2 年前
https://static.github-zh.com/github_avatars/RetroCirce?size=40

The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"

Python 430
1 年前
https://static.github-zh.com/github_avatars/YuanGongND?size=40

Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event Taggers"

Python 404
2 年前
https://static.github-zh.com/github_avatars/YuanGongND?size=40

Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".

Python 396
3 年前
https://static.github-zh.com/github_avatars/ksanjeevan?size=40

UrbanSound classification using Convolutional Recurrent Networks in PyTorch

Python 388
4 年前
https://static.github-zh.com/github_avatars/kkoutini?size=40
Python 351
2 年前
https://static.github-zh.com/github_avatars/drscotthawley?size=40
Python 268
4 年前
https://static.github-zh.com/github_avatars/cwx-worst-one?size=40
Python 186
3 个月前
https://static.github-zh.com/github_avatars/kaistmm?size=40

#计算机科学#Official Implementation of the work "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning"

Python 152
10 个月前
https://static.github-zh.com/github_avatars/YuanGongND?size=40

Dataset and baseline code for the VocalSound dataset (ICASSP2022).

Jupyter Notebook 151
3 年前
https://static.github-zh.com/github_avatars/YuanGongND?size=40

#计算机科学#Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".

Python 149
2 年前
https://static.github-zh.com/github_avatars/micah5?size=40
Python 137
6 年前
loading...
Website
Wikipedia