#

pyannote

tmoroney/auto-subs
https://static.github-zh.com/github_avatars/tmoroney?size=40

Instantly generate AI-powered subtitles on your device. Works standalone or connects to DaVinci Resolve.

TypeScript 1.54 k
22 天前
https://static.github-zh.com/github_avatars/kaixxx?size=40

#面试#Cutting edge AI technology for automated audio transcription. A nice GUI for OpenAIs Whisper and pyannote (speaker identification)

Python 1.4 k
3 天前
https://static.github-zh.com/github_avatars/narcotic-sh?size=40
Python 123
2 天前
https://static.github-zh.com/github_avatars/FrenchKrab?size=40

Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.

Jupyter Notebook 88
2 年前
https://static.github-zh.com/github_avatars/clement-pages?size=40

Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.

Svelte 68
19 天前
https://static.github-zh.com/github_avatars/nttcslab-sp?size=40

Official repository for Mamba-based Segmentation Model for Speaker Diarization

Python 41
5 个月前
https://static.github-zh.com/github_avatars/jeanjerome?size=40

EchoInStone is an audio processing tool that transcribes, diarizes, and aligns speaker segments from audio files, prioritizing accuracy and reliability.

Python 35
2 个月前
https://static.github-zh.com/github_avatars/FrenchKrab?size=40

Automatically setup the AISHELL-4 and MSDWild dataset for usage with pyannote-database (and pyannote-audio)

Shell 12
3 个月前
https://static.github-zh.com/github_avatars/jumtra?size=40
Python 10
2 年前
https://static.github-zh.com/github_avatars/CrispStrobe?size=40

speech to text gui for different Whisper models and backends, including whisper.cpp, mlx-whisper, faster-whisper, ctranslate2; applies pyannote for diarization

Python 9
2 个月前
https://static.github-zh.com/github_avatars/jarvisx17?size=40

ASR (Automatic Speech Recognition) Notebooks

Jupyter Notebook 7
2 年前
https://static.github-zh.com/github_avatars/austinwmille?size=40

#大语言模型#you feed in a video; it outputs context contained clips resized to 9:16, keeping speaker in center

Python 7
2 个月前
https://static.github-zh.com/github_avatars/gillan-krishna?size=40
Python 4
3 年前
https://static.github-zh.com/github_avatars/dptools?size=40

Subtitle generation w/ Speaker Diarization using Whisper and pyannote.audio

Python 4
1 年前
https://static.github-zh.com/github_avatars/adamelkholyy?size=40

Toolkit for using Whisper to transcribe YouTube videos. Includes Whisper transcription of YouTube videos, conversion of YouTube video into HuggingFace dataset (using audio and subtitles) and evaluatio...

Python 3
1 年前
https://static.github-zh.com/github_avatars/TheSeraphim?size=40

#自然语言处理#🎵 Complete offline audio transcription system with speaker diarization using OpenAI Whisper and PyAnnote. Features automatic audio cleaning, precise timestamps, multiple output formats (JSON/TXT/Mark...

Python 3
3 个月前
loading...
Website
Wikipedia