Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isola...
faster_whisper GUI with PySide6
Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
#大语言模型#turnkey self-hosted offline transcription and diarization service with llm summary
A desktop application that transcribes audio from files, microphone input or YouTube videos with the option to translate the content and create subtitles.
A simple GUI to use Whisper.
#计算机科学#Open source subtitling platform 💻 for transcribing and translating videos/audios in Indic languages.
Transcription from mp3 files to html with or without embedded player
a cross-platform and customizable vlc video player that can generate subtitles using WhisperX model
deploy whsiper on aws
#计算机科学#Transcribe Like a Pro, Without Paying a Penny!
A sleek, web-based audio player featuring synchronized subtitle display, speaker diarization support, and keyboard controls in a modern, responsive interface
#自然语言处理#This repository contains a Jupyter notebook for qualitative researchers to transcribe, diarize speakers, and convert audio or video files into various text formats (csv, txt, json, & vtt).
#大语言模型#AI 驱动的视频译配工具. An AI powered tool to execute end-to-end video dubbing.
Generate fully aligned subtitles for any Video or Audio file on your local system for free using the amazing capabilities of WhisperX.
#大语言模型#VideoWise is a video transcription and AI-powered analysis tool that helps users easily upload, transcribe, and interact with video content. Using WhisperX for high-quality transcriptions and Ollama f...
Code for our INTERSPEECH 2024 paper: Comparing ASR Systems in the Context of Speech Disfluencies.
A tool for automatically adding subtitles to short social media videos