集合主题趋势排行榜

audio-to-text

pluja / whishper

Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!

人工智能 audio-to-text Go subtitles sveltekit transcription Whisper ui Web app speech-recognition speech-to-text stt Web

Svelte 2.71 k

2 个月前

SakiRinn / LiveCaptions-Translator

Lightweight and powerful real-time audio/speech translation tool based on Windows LiveCaptions.

livecaptions Windows speech-to-text audio-to-text API api-integration translation real-time

C# 1.51 k

22 天前

Saik0s / Whisperboard

#IOS#The open-source iOS app that's making quality voice transcription more accessible on mobile devices.

openai iOS speech-recognition speech-to-text SwiftUI transcription audio-to-text tca Whisper whisper-cpp

Swift 926

22 天前

URUWorks / TeroSubtitler

#编辑器#Tero Subtitler is an open source, cross-platform, and free subtitle editing software.

editor Linux macOS subtitles Windows 免费 captions Open Source transcription audio-to-text FFmpeg mpv Whisper yt-dlp 人工智能 blu-ray

Pascal 388

6 天前

Kabanosk / whisper-website

Simple web application, which can be used to convert audio to subtitles by OpenAI's Whisper model

FastAPI openai speech-to-text Whisper Python uvicorn Website audio-to-text subtitles subtitles-generator Open Source Hacktoberfest

Python 319

2 个月前

HenestrosaDev / audiotext

A desktop application that transcribes audio from files, microphone input or YouTube videos with the option to translate the content and create subtitles.

Python speech-recognition audio-to-text speech-to-text subtitles-generator whisperx FFmpeg

Python 228

1 年前

javedali99 / audio-to-text-transcription

This repository contains a Python script that allows users to download the audio from a YouTube video, transcribe it into text, detect the language and save the transcription in txt file automatically...

audio-to-text Open Source openai Python transcription Whisper audio YouTube

Python 152

6 个月前

bai0012 / Whisper_auto2lrc

Use Whisper to convert audio files into LRC subtitle files in bulk. 使用whisper实现将音频文件批量转换为lrc字幕文件

audio-to-text Python Whisper Windows PyTorch

Python 66

2 个月前

rudymohammadbali / Whisper-Transcriber

Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.

GUI openai Whisper audio-to-text stt

Python 57

1 年前

persiandataset / PersianSpeech

Persian ASR dataset

dataset asr persian-speech-recognition audio-to-text

2 年前

xndien2004 / Speech-to-text-Realtime-with-extension

"Speech-to-Text Realtime with Extension" is a browser extension that converts speech to text in real-time. It supports multiple languages, making it ideal for note-taking, customer service, and access...

audio-to-text Django Google 云 openai-api realtime

Jupyter Notebook 37

1 年前

KostasEreksonas / Audio-transcriber

Simple Python audio transcriber using OpenAI's Whisper speech recognition model

audio openai openai-whisper text transcription Whisper audio-to-text Python pip YouTube youtube-dl

Python 34

7 个月前

Education-Victory / whisper-webui

WebUI for Whisper API

audio-to-text transcription

Python 32

1 年前

inferless / whisper-large-v3

State‑of‑the‑art speech recognition model for English, delivering transcription accuracy across diverse audio scenarios. <metadata> gpu: T4 | collections: ["CTranslate2"] </metadata>

audio-to-text

Python 17

6 个月前

thinh-vu / ur_audio_sub

Generate text captions for audio files & youtube video using OpenAI Whisper on Google Colab. Multiple languages support.

audio-to-text audio-transcription speech-recognition Whisper

Jupyter Notebook 16

2 年前

GabrieleRisso / aiyu

core shell functions building blocks for advanced AI pipelines

人工智能 audio-to-text gpt-3 stable-diffusion text-to-audio text-to-image text-to-speech tts Whisper

2 年前

markydoodled / Journal.it

#IOS#A SwiftUI App For People Who Need To Take Down Important Information Quickly.

SwiftUI texteditor camera iOS macOS audio-editing photo-editing audio-to-text Swift

Swift 13

2 年前

AzizBenAli / YouTube-AI-Assistant

Develop a python application that allows you to extract valuable insights, engage in meaningful conversations, and explore video content in a whole new way.

agents 聊天机器人 retrieval-augmented-generation Streamlit youtube-api audio-to-text embeddings openai pineconedb conversational-agents conversational-bots memory generative-ai

Python 12

2 年前

gisty-org / chrome-extension

Chrome Extension to capture captions of ongoing meetings by using webkitspeechrecognition api for all the web video conferencing platforms (for google meet, it directly extracts the captions) and send...

JavaScript audio-to-text Google Meet Chrome 插件

JavaScript 11

2 年前

gabrielsenadev / audioinsight

AudioInsight is a web application that processes audio, generates transcriptions, and allows users to ask questions about the related audio.

audio-processing audio-to-text cloudflare-ai full-stack webdev Whisper

TypeScript 8

1 年前

Website
Wikipedia