Instantly generate AI-powered subtitles on your device. Works standalone or connects to DaVinci Resolve.
#面试#Cutting edge AI technology for automated audio transcription. A nice GUI for OpenAIs Whisper and pyannote (speaker identification)
Open source inference code for Rev's model
Very fast, accurate speaker diarization
Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.
Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.
Official repository for Mamba-based Segmentation Model for Speaker Diarization
EchoInStone is an audio processing tool that transcribes, diarizes, and aligns speaker segments from audio files, prioritizing accuracy and reliability.
Transcription from mp3 files to html with or without embedded player
Automatically setup the AISHELL-4 and MSDWild dataset for usage with pyannote-database (and pyannote-audio)
#大语言模型#A package that can be locally executed to generate minutes in Japanese
speech to text gui for different Whisper models and backends, including whisper.cpp, mlx-whisper, faster-whisper, ctranslate2; applies pyannote for diarization
Faster Whisper with Speaker Diarization
#大语言模型#you feed in a video; it outputs context contained clips resized to 9:16, keeping speaker in center
#自然语言处理#Hobby project to transcribe audio files from meetings to transcripts with a summary
Subtitle generation w/ Speaker Diarization using Whisper and pyannote.audio
Toolkit for using Whisper to transcribe YouTube videos. Includes Whisper transcription of YouTube videos, conversion of YouTube video into HuggingFace dataset (using audio and subtitles) and evaluatio...
#自然语言处理#🎵 Complete offline audio transcription system with speaker diarization using OpenAI Whisper and PyAnnote. Features automatic audio cleaning, precise timestamps, multiple output formats (JSON/TXT/Mark...