An Open Source text-to-speech system built by inverting Whisper.
WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.
#计算机科学#Faster Whisper transcription with CTranslate2
Whisper realtime streaming for long speech-to-text transcription and translation
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
whisper 是一个通用语音识别模型
Zero-Shot Speech Editing and Text-to-Speech in the Wild
An Open Source text-to-speech system built by inverting Whisper.
Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. D...
Real time transcription with OpenAI Whisper.
Open-Sora: 完全开源的高效复现类Sora视频生成方案
The #1 open-source voice interface for desktop, mobile, and ESP32 chips.
WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
fabric is an open-source framework for augmenting humans using AI. It provides a modular framework for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere.