#自然语言处理#为 Jax、PyTorch 和 TensorFlow 打造的先进的自然语言处理
OpenAI Whisper语音识别模型,C++移植版本。
#计算机科学#DeepSpeech 是一款开源嵌入式(离线、设备上)语音识别引擎,最低可以在树莓派上运行
#计算机科学#Faster Whisper transcription with CTranslate2
🧠 Leon is your open-source personal assistant.
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
kaldi-asr/kaldi is the official location of the Kaldi project.
#自然语言处理#Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!
#安卓#Vosk 是一个离线的语言识别工具。支持 Python, Java, Node.JS, C#, C++ ,能识别20+种语言,包括中文、英语、法语等。
PaddleSpeech 是基于飞桨 PaddlePaddle 的语音方向的开源模型库,用于语音和音频中的各种关键任务的开发,典型的应用包括:语音识别、语音翻译、语音合成等
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
#计算机科学#A PyTorch-based Speech Toolkit
Speech recognition module for Python, supporting several engines and APIs, online and offline.
#自然语言处理#OpenVINO™ is an open source toolkit for optimizing and deploying AI inference
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
#计算机科学#Facebook AI Research's Automatic Speech Recognition Toolkit
#大语言模型#Multilingual Voice Understanding Model