GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

conformer

Website
Wikipedia
https://static.github-zh.com/github_avatars/PaddlePaddle?size=40
PaddlePaddle / PaddleSpeech

PaddleSpeech 是基于飞桨 PaddlePaddle 的语音方向的开源模型库,用于语音和音频中的各种关键任务的开发,典型的应用包括:语音识别、语音翻译、语音合成等

transformerconformerspeech-translationstreaming-asrspeech-alignmentpunctuation-restorationstreaming-ttsspeech-synthesisttsasrspeech-recognition声音克隆vocodervoice-recognitionself-supervised-learningWhisper
Python 12.11 k
9 天前
https://static.github-zh.com/github_avatars/modelscope?size=40
modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

conformerPyTorchspeech-recognitionparaformerpunctuationspeaker-diarizationrnntaudio-visual-speech-recognitionpretrained-modelvoice-activity-detectionWhisperdfsmnvadspeechgptspeechllm
Python 11.76 k
8 天前
https://static.github-zh.com/github_avatars/wenet-e2e?size=40
wenet-e2e / wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

e2e-modelsPyTorchasrtransformerconformerproduction-readyautomatic-speech-recognitionspeech-recognitionWhisper
Python 4.71 k
19 天前
https://static.github-zh.com/github_avatars/FireRedTeam?size=40
FireRedTeam / FireRedASR

#大语言模型#Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics recogn...

asr大语言模型Open Sourcespeech-recognitionautomatic-speech-recognitionconformerspeechllmtransformer
Python 1.2 k
4 个月前
https://static.github-zh.com/github_avatars/sooftware?size=40
sooftware / conformer

[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)

conformertransformercnntransformer-xlasrspeech-recognitionPyTorchconvolutionspeechrecognition
Python 1.05 k
2 年前
https://static.github-zh.com/github_avatars/TensorSpeech?size=40
TensorSpeech / TensorFlowASR

⚡ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords

automatic-speech-recognitionspeech-recognitionspeech-to-texttensorflow2rnn-transducerconformertflitectcTensorflow
Python 987
2 个月前
https://static.github-zh.com/github_avatars/yeyupiaoling?size=40
yeyupiaoling / PPASR

#计算机科学#基于PaddlePaddle实现端到端中文语音识别,从入门到实战,超简单的入门案例,超实用的企业项目。支持当前最流行的DeepSpeech2、Conformer、Squeezeformer模型

asrpaddlepaddle深度学习中文speech-to-textspeechspeech-recognitionstreaming-asrconformer
Python 861
2 个月前
https://static.github-zh.com/github_avatars/yeyupiaoling?size=40
yeyupiaoling / MASR

#计算机科学#Pytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conformer、Squeezeformer、DeepSpeech2模型,支持多种数据增强方法。

deepspeechPyTorchasr深度学习speech-recognitionspeech-to-textspeechconformer
Python 688
2 个月前
https://static.github-zh.com/github_avatars/sooftware?size=40
sooftware / kospeech

Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.

speech-recognitionasrend-to-endPyTorchseq2seqtransformerattention-is-all-you-needconformer
Python 624
2 年前
https://static.github-zh.com/github_avatars/eeyhsong?size=40
eeyhsong / EEG-Conformer

EEG Transformer 2.0. i. Convolutional Transformer for EEG Decoding. ii. Novel visualization - Class Activation Topography.

EEGtransformerconformerattention
Python 591
1 年前
https://static.github-zh.com/github_avatars/liusongxiang?size=40
liusongxiang / ppg-vc

PPG-Based Voice Conversion

voice-conversionspeech-synthesisconformer
Python 342
3 年前
https://static.github-zh.com/github_avatars/voicekit-team?size=40
voicekit-team / T-one

T-one is a high-performance streaming ASR pipeline for Russian, specialized for the telephony domain.

asrconformerrussianspeechspeech-recognitionspeech-to-textstreamingstttelephony
Python 144
6 天前
https://static.github-zh.com/github_avatars/tuanio?size=40
tuanio / noisy-student-training-asr

#计算机科学#Pytorch implementation of Noisy Student Training for Automatic Speech Recognition and Automatic Pronunciation Error Detection problem

conformerpretrainedPyTorchsemi-supervised-learningdata-augmentation深度学习机器学习speech-recognition
Python 96
2 个月前
https://static.github-zh.com/github_avatars/istupakov?size=40
istupakov / onnx-asr

Automatic Speech Recognition in Python using ONNX models

onnxPythonspeech-recognitionspeech-to-textasrsttWhisperconformerkaldi
Python 77
1 个月前
https://static.github-zh.com/github_avatars/hyperion-ml?size=40
hyperion-ml / hyperion

Python toolkit for speech processing

speaker-recognitionadversarial-attackscifarmnistPyTorchcalibrationvq-vaevaeresnetefficientnettransformerconformer
Python 69
1 个月前
https://static.github-zh.com/github_avatars/MinkaiXu?size=40
MinkaiXu / CGCF-ConfGen

🧪 Learning Neural Generative Dynamics for Molecular Conformation Generation (ICLR 2021)

MoleculeiclrconformerPyTorch
Python 46
4 年前
https://static.github-zh.com/github_avatars/sooftware?size=40
sooftware / lightning-asr

Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.

asrspeech-recognitionpytorch-lightningconformerhydra
Python 46
4 年前
https://static.github-zh.com/github_avatars/TeaPoly?size=40
TeaPoly / Conformer-Athena

Dynamic Chunk Streaming and Offline Conformer based on athena-team/Athena.

tensorflow2conformertransformerasrTensorflowspeech-recognition
Python 44
3 年前
https://static.github-zh.com/github_avatars/Rishit-dagli?size=40
Rishit-dagli / Conformer

#计算机科学#An implementation of Conformer: Convolution-augmented Transformer for Speech Recognition, a Transformer Variant in TensorFlow/Keras

机器学习深度学习人工智能Tensorflowconformerspeech-recognitionKerastransformersattention-mechanismconvolutional-neural-networks
Python 44
4 年前
https://static.github-zh.com/github_avatars/Audio-WestlakeU?size=40
Audio-WestlakeU / SAR-SSL

A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Multi-Channel Conformer” [TASLP 2024]

self-supervised-learningconformerfine-tuningmulti-channel
Python 37
10 个月前
loading...