pyannote

Instantly generate AI-powered subtitles on your device. Works standalone or connects to DaVinci Resolve.

人工智能 davinci-resolve subtitles subtitles-generator davinci openai Whisper speech-to-text transcribe pyannote speaker Linux macOS Windows

TypeScript 1.54 k

22 天前

kaixxx / noScribe

#面试#Cutting edge AI technology for automated audio transcription. A nice GUI for OpenAIs Whisper and pyannote (speaker identification)

audio-transcription 面试 pyannote qualitative-research transcription whisper-cpp

Python 1.4 k

3 天前

yinruiqing / pyannote-whisper

#大语言模型#

asr speaker-diarization Whisper pyannote ChatGPT

Python 629

6 天前

revdotcom / reverb

Open source inference code for Rev's model

speech-recognition speech-to-text asr canary Docker Whisper Open Source speechrecognition diarization pyannote huggingface speaker-diarization 深度学习神经网络

Python 429

5 个月前

narcotic-sh / senko

Very fast, accurate speaker diarization

diarization pyannote rapids silero-vad speaker-diarization

Python 123

2 天前

FrenchKrab / IS2023-powerset-diarization

Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.

pyannote speaker-diarization

Jupyter Notebook 88

2 年前

clement-pages / gryannote

Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.

annotation-processing annotation-tool audio gradio pyannote speaker-diarization speech-processing

Svelte 68

19 天前

nttcslab-sp / mamba-diarization

Official repository for Mamba-based Segmentation Model for Speaker Diarization

pyannote speaker-diarization state-space-models

Python 41

5 个月前

jeanjerome / EchoInStone

EchoInStone is an audio processing tool that transcribes, diarizes, and aligns speaker segments from audio files, prioritizing accuracy and reliability.

alignment diarization localhost pyannote Python transcribe Whisper

Python 35

2 个月前

pulijon / Sttcast

Transcription from mp3 files to html with or without embedded player

Ansible 自动化 Infrastructure as code Puppet Python Terraform transcription Vagrant Whisper aws-ec2 aws-s3 gpu diarization whisperx 人工智能 openai-api rag pyannote

Jupyter Notebook 20

25 天前

FrenchKrab / datasets-pyannote

Automatically setup the AISHELL-4 and MSDWild dataset for usage with pyannote-database (and pyannote-audio)

dataset pyannote

Shell 12

3 个月前

jumtra / agenda_maker

#大语言模型#A package that can be locally executed to generate minutes in Japanese

agenda japanese-language 大语言模型 minutes pyannote transcription Whisper

Python 10

2 年前

CrispStrobe / Susurrus

speech to text gui for different Whisper models and backends, including whisper.cpp, mlx-whisper, faster-whisper, ctranslate2; applies pyannote for diarization

ctranslate2 diarization pyannote speech-to-text stt Whisper whisper-ai whisper-cpp

Python 9

2 个月前

gorkemkaramolla / whisper-run

Faster Whisper with Speaker Diarization

faster-whisper Whisper openai pyannote speaker-diarization speech-recognition transcription

Python 8

1 年前

jarvisx17 / ASR

ASR (Automatic Speech Recognition) Notebooks

asr nemo pyannote speakers Whisper whisperx

Jupyter Notebook 7

2 年前

austinwmille / orca

#大语言模型#you feed in a video; it outputs context contained clips resized to 9:16, keeping speaker in center

diarization nltk pyannote whisperx 大语言模型 huggingface

Python 7

2 个月前

gillan-krishna / meeting_notes

#自然语言处理#Hobby project to transcribe audio files from meetings to transcripts with a summary

audio 深度学习 hobby-project 自然语言处理 pyannote speech-recognition Whisper

Python 4

3 年前

dptools / WhisperNote

Subtitle generation w/ Speaker Diarization using Whisper and pyannote.audio

pyannote speaker-diarization subtitles Whisper

Python 4

1 年前

adamelkholyy / whisper-yt

Toolkit for using Whisper to transcribe YouTube videos. Includes Whisper transcription of YouTube videos, conversion of YouTube video into HuggingFace dataset (using audio and subtitles) and evaluatio...

asr diarization pyannote transcription Whisper YouTube

Python 3

1 年前

TheSeraphim / scribe-forge-ai

#自然语言处理#🎵 Complete offline audio transcription system with speaker diarization using OpenAI Whisper and PyAnnote. Features automatic audio cleaning, precise timestamps, multiple output formats (JSON/TXT/Mark...

audio-analysis audio-processing audio-transcription diarization FFmpeg huggingface 机器学习 multi-speaker 自然语言处理 openai-whisper pyannote Python speaker-diarization speech-recognition speech-to-text Whisper

Python 3

3 个月前

Website
Wikipedia