GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

Whisper

Whisper is an autoregressive language model developed by OpenAI. It is trained on a large corpus of text using a transformer architecture and is capable of generating high-quality natural language text. Whisper can be used for tasks such as language modeling, text completion, and text generation. It has shown impressive performance on various benchmarks and has been released by OpenAI to encourage research in the field of language modeling. Whisper is not yet available for public use, but it has the potential to transform the field of natural language processing and generate new opportunities for language-based applications.

Created by OpenAI

发布于 August 2021

Repository
openai/whisper
Website
openai.com
Wikipedia

相关主题

机器学习人工智能
ggml-org/whisper.cpp
https://static.github-zh.com/github_avatars/ggml-org?size=40
ggml-org / whisper.cpp

OpenAI Whisper语音识别模型,C++移植版本。

openaispeech-to-texttransformerWhisperinferencespeech-recognition
C++ 40.79 k
2 天前
https://static.github-zh.com/github_avatars/SYSTRAN?size=40
SYSTRAN / faster-whisper

#计算机科学#Faster Whisper transcription with CTranslate2

深度学习inferencequantizationspeech-recognitionspeech-to-texttransformerWhisperopenai
Python 16.55 k
13 天前
https://static.github-zh.com/github_avatars/m-bain?size=40
m-bain / whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

asrspeechspeech-recognitionspeech-to-textWhisper
Python 16.26 k
7 天前
https://static.github-zh.com/github_avatars/chidiwilliams?size=40
chidiwilliams / buzz

Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.

Whisper
Python 14.63 k
7 天前
https://static.github-zh.com/github_avatars/PaddlePaddle?size=40
PaddlePaddle / PaddleSpeech

PaddleSpeech 是基于飞桨 PaddlePaddle 的语音方向的开源模型库,用于语音和音频中的各种关键任务的开发,典型的应用包括:语音识别、语音翻译、语音合成等

transformerconformerspeech-translationstreaming-asrspeech-alignmentpunctuation-restorationstreaming-ttsspeech-synthesisttsasrspeech-recognition声音克隆vocodervoice-recognitionself-supervised-learningWhisper
Python 11.99 k
6 天前
https://static.github-zh.com/github_avatars/modelscope?size=40
modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

conformerPyTorchspeech-recognitionparaformerpunctuationspeaker-diarizationrnntaudio-visual-speech-recognitionpretrained-modelvoice-activity-detectionWhisperdfsmnvadspeechgptspeechllm
Python 11 k
19 天前
https://static.github-zh.com/github_avatars/niedev?size=40
niedev / RTranslator

#安卓#Open source real-time translation app for Android that runs locally

translatorbluetooth-lerealtime-translatorAndroidonnxonnxruntimesentencepiecetransformerstranslationnllbWhispermobile-appoffline
C++ 8.14 k
3 天前
https://static.github-zh.com/github_avatars/xorbitsai?size=40
xorbitsai / inference

#大语言模型#Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any...

ggmlPyTorchchatglm部署flan-t5大语言模型wizardlm人工智能机器学习Whisperinferenceopenai-apimistralgemmallamallamacppvllmqwenllama3glm4
Python 8.03 k
5 小时前
Zackriya-Solutions/meeting-minutes
https://static.github-zh.com/github_avatars/Zackriya-Solutions?size=40
Zackriya-Solutions / meeting-minutes

#大语言模型#A free and open source, self hosted Ai based live meeting note taker and minutes summary generator that can completely run in your Local device (Mac OS and windows OS Support added. Working on adding ...

meeting-minutesmeeting-notesrecorder自动化cross-platformLinux大语言模型macOSWindowsRustWhisperwhisper-cpp人工智能livetranscripttranscription
C++ 6.36 k
25 天前
https://static.github-zh.com/github_avatars/argmaxinc?size=40
argmaxinc / WhisperKit

#IOS#On-device Speech Recognition for Apple Silicon

inferenceiOSspeech-recognitionSwiftWhispertransformersmacOSvisionOSwatchOS
Swift 4.7 k
3 天前
https://static.github-zh.com/github_avatars/MahmoudAshraf97?size=40
MahmoudAshraf97 / whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

asrspeaker-diarizationspeechspeech-recognitionspeech-to-textWhisper
Jupyter Notebook 4.63 k
2 个月前
https://static.github-zh.com/github_avatars/sanchit-gandhi?size=40
sanchit-gandhi / whisper-jax

#计算机科学#JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.

深度学习jaxspeech-recognitionspeech-to-textWhisper
Jupyter Notebook 4.6 k
1 年前
https://static.github-zh.com/github_avatars/NexaAI?size=40
NexaAI / nexa-sdk

#大语言模型#Nexa SDK is a comprehensive toolkit for supporting GGML and ONNX models. It supports text generation, image generation, vision-language models (VLM), Audio Language Model, auto-speech-recognition (ASR...

asredge-computing大语言模型on-device-aion-device-mlSDKstable-diffusiontransformersttsvlmlanguage-modelsdk-pythonWhisperaudio
Python 4.57 k
3 个月前
https://static.github-zh.com/github_avatars/wenet-e2e?size=40
wenet-e2e / wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

e2e-modelsPyTorchasrtransformerconformerproduction-readyautomatic-speech-recognitionspeech-recognitionWhisper
Python 4.56 k
5 天前
https://static.github-zh.com/github_avatars/leetcode-mafia?size=40
leetcode-mafia / cheetah

#大语言模型#Mac app for crushing tech interviews with AI

gptopenaiWhisperwhisper-cpp人工智能ChatGPTgpt-4SwiftSwiftUI
Swift 4.2 k
5 个月前
https://static.github-zh.com/github_avatars/huggingface?size=40
huggingface / distil-whisper

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

audiospeech-recognitionWhisper
Python 3.88 k
5 个月前
embarklabs/embark
https://static.github-zh.com/github_avatars/embarklabs?size=40
embarklabs / embark

#区块链#Framework for serverless Decentralized Applications using Ethereum, IPFS and other platforms

以太坊dappIPFSsmart-contractsServerlessdecentralized区块链框架Whisperswarm
JavaScript 3.79 k
1 年前
abus-aikorea/voice-pro
https://static.github-zh.com/github_avatars/abus-aikorea?size=40
abus-aikorea / voice-pro

Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isola...

faster-whisperttsWhispergradiosubtitlestranscriptiontranslatorwebuispeech-recognitionspeech-synthesisspeech-to-texttext-to-speechyt-dlp声音克隆podcastsaudiobookvoice-conversionkaraokewhisperx
Python 3.71 k
19 天前
Grt1228/chatgpt-java
https://static.github-zh.com/github_avatars/Grt1228?size=40
Grt1228 / chatgpt-java

#大语言模型#ChatGPT Java SDK支持流式输出、Gpt插件、联网。支持OpenAI官方所有接口。ChatGPT的Java客户端。OpenAI GPT-3.5-Turb GPT-4 Api Client for Java

ChatGPTgpt-35-turbogpt-4Javachatgpt-javaopenai-apiopenai-imagesopenai-whisperWhisperopenai-chatgpt
Java 3.45 k
10 个月前
https://static.github-zh.com/github_avatars/n3d1117?size=40
n3d1117 / chatgpt-telegram-bot

#大语言模型#🤖 A Telegram bot that integrates with OpenAI's official ChatGPT APIs to provide answers, written in Python

ChatGPTopenaiPythonTelegramdall-eWhisper
Python 3.3 k
12 天前
loading...