#

audio-to-text

https://static.github-zh.com/github_avatars/pluja?size=40

Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!

Svelte 2.71 k
2 个月前
https://static.github-zh.com/github_avatars/SakiRinn?size=40

Lightweight and powerful real-time audio/speech translation tool based on Windows LiveCaptions.

C# 1.51 k
22 天前
https://static.github-zh.com/github_avatars/Saik0s?size=40

#IOS#The open-source iOS app that's making quality voice transcription more accessible on mobile devices.

Swift 926
22 天前
https://static.github-zh.com/github_avatars/Kabanosk?size=40

Simple web application, which can be used to convert audio to subtitles by OpenAI's Whisper model

Python 319
2 个月前
https://static.github-zh.com/github_avatars/HenestrosaDev?size=40

A desktop application that transcribes audio from files, microphone input or YouTube videos with the option to translate the content and create subtitles.

Python 228
1 年前
https://static.github-zh.com/github_avatars/javedali99?size=40

This repository contains a Python script that allows users to download the audio from a YouTube video, transcribe it into text, detect the language and save the transcription in txt file automatically...

Python 152
6 个月前
https://static.github-zh.com/github_avatars/bai0012?size=40

Use Whisper to convert audio files into LRC subtitle files in bulk. 使用whisper实现将音频文件批量转换为lrc字幕文件

Python 66
2 个月前
https://static.github-zh.com/github_avatars/rudymohammadbali?size=40

Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.

Python 57
1 年前
https://static.github-zh.com/github_avatars/xndien2004?size=40

"Speech-to-Text Realtime with Extension" is a browser extension that converts speech to text in real-time. It supports multiple languages, making it ideal for note-taking, customer service, and access...

Jupyter Notebook 37
1 年前
https://static.github-zh.com/github_avatars/KostasEreksonas?size=40
Python 34
7 个月前
https://static.github-zh.com/github_avatars/inferless?size=40

State‑of‑the‑art speech recognition model for English, delivering transcription accuracy across diverse audio scenarios. <metadata> gpu: T4 | collections: ["CTranslate2"] </metadata>

Python 17
6 个月前
https://static.github-zh.com/github_avatars/thinh-vu?size=40

Generate text captions for audio files & youtube video using OpenAI Whisper on Google Colab. Multiple languages support.

Jupyter Notebook 16
2 年前
https://static.github-zh.com/github_avatars/markydoodled?size=40

#IOS#A SwiftUI App For People Who Need To Take Down Important Information Quickly.

Swift 13
2 年前
https://static.github-zh.com/github_avatars/AzizBenAli?size=40

Develop a python application that allows you to extract valuable insights, engage in meaningful conversations, and explore video content in a whole new way.

Python 12
2 年前
https://static.github-zh.com/github_avatars/gisty-org?size=40

Chrome Extension to capture captions of ongoing meetings by using webkitspeechrecognition api for all the web video conferencing platforms (for google meet, it directly extracts the captions) and send...

JavaScript 11
2 年前
https://static.github-zh.com/github_avatars/gabrielsenadev?size=40

AudioInsight is a web application that processes audio, generates transcriptions, and allows users to ask questions about the related audio.

TypeScript 8
1 年前
loading...
Website
Wikipedia