diarization · GitHub Topics

Synchronized Translation for Videos. Video dubbing

audio-processing diarization translation translate-audio translate-video video-dubbing asr automatic-dubbing document-translator dubbing speech-to-text stt text-to-speech tts

Python 1.19 k

1 个月前

transcriptionstream / transcriptionstream

#大语言模型#turnkey self-hosted offline transcription and diarization service with llm summary

自动化 diarization 大语言模型 speaker-diarization speech-recognition transcription Whisper ollama mistral-7b whisperx

Python 874

10 个月前

microsoft / UniSpeech

UniSpeech - Large Scale Self-Supervised Learning for Speech

PyTorch speech-recognition speech-processing speech diarization speech-separation speaker-verification

Python 467

1 年前

revdotcom / reverb

Open source inference code for Rev's model

speech-recognition speech-to-text asr canary Docker Whisper Open Source speechrecognition diarization huggingface speaker-diarization 深度学习神经网络

Python 415

3 个月前

cvqluu / simple_diarizer

Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code

speech-to-text transcription diarization asr colab-notebook speaker-diarization

Python 149

1 年前

taresh18 / TTSizer

🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨

diarization speech-recognition text-to-speech audio automatic-speech-recognition

Python 97

2 个月前

desh2608 / dover-lap

Python package for combining diarization system outputs.

diarization

Python 88

2 年前

bunyaminergen / Callytics

#大语言模型#Callytics is an advanced call analytics solution that leverages speech recognition and large language models (LLMs) technologies to analyze phone conversations from customer service and call centers.

diarization llama3 大语言模型 openai Open Source speech-processing speech-recognition speech-to-text voice-activity-detection voice-recognition denoising sentiment-analysis summary topic-modeling transcription

Python 71

4 个月前

wq2012 / SimpleDER

#计算机科学#A lightweight library to compute Diarization Error Rate (DER).

speaker-diarization 监控 speech-processing speech-recognition diarization 机器学习

Python 60

2 年前

JSchmie / ScrAIbe

Tool for automatic transcription and speaker diarization based on whisper and pyannote.

diarization speech-to-text transcription

Python 52

6 个月前

Picovoice / falcon

#计算机科学#On-device speaker diarization powered by deep learning

speaker-diarization 深度学习 diarization on-device speaker-recognition

Python 52

17 天前

cvqluu / nn-similarity-diarization

Neural network based similarity scoring for diarization (pytorch implementation of "LSTM based Similarity Measurement with Spectral Clustering for Speaker Diarization")

PyTorch diarization 神经网络 speech similarity kaldi lstm speaker-recognition speaker-diarization

Python 44

5 年前

jeanjerome / EchoInStone

EchoInStone is an audio processing tool that transcribes, diarizes, and aligns speaker segments from audio files, prioritizing accuracy and reliability.

alignment diarization localhost Python transcribe Whisper

Python 30

18 天前

chimechallenge / chime-utils

Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.

diarization speech-processing speech-recognition speech-separation automatic-speech-recognition speech-enhancement

Python 23

5 个月前

shahruk10 / kaldi-tflite

Convert kaldi feature extraction and nnet3 models into Tensorflow Lite models. Currently aimed at converting kaldi's x-vector models and diarization pipelines to tensorflow models.

Tensorflow kaldi speech tflite diarization

Python 20

3 年前