Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching
#计算机科学#[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
A simple VITS HTTP API, developed by extending Moegoe with additional features.
#大语言模型#A feature-rich portal to chat with GPT-4, Claude, Gemini, Mistral, & OpenAI Assistant APIs via a lightweight Node.js web app; supports customizable multimodality for voice, images, & files.
openai-whisper-talk is a sample voice conversation application powered by OpenAI technologies such as Whisper, Completions, Embeddings, and the latest Text-to-Speech. The application is built using Nu...
NoneBot DeepSeek 插件。接入 DeepSeek 模型,提供智能对话与问答功能
🌻 VITS ONNX TTS server designed for fast inference 🔥
Self-host the powerful Dia TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible), support for SafeTensors/BF16, voice cloning, dialogue generation, and...
AivisSpeech Engine: AI Voice Imitation System - Text to Speech Engine
#大语言模型#Open-Audio TTS: A robust web app leveraging OpenAI's powerful Text-to-Speech (TTS) models to generate natural-sounding audio from text. Built with modern web technologies for an intuitive user experie...
Streaming TTS based on Piper with optional RK3588 NPU support
An AI-powered chatbot integrated with Telegram, using OpenAI GPT-3.5 Turbo, language embeddings, and FAISS for similarity search to provide more contextually relevant responses to user queries
Simple Python script to interact with the TikTok TTS Voices.
A Non-Official ElevenLabs RESTful API Client for dotnet
not official API for Microsoft speech synthesis from Microsoft Edge web browser read aloud
#大语言模型#Twitch Streamer GPT is a NodeJS-based Twitch enhancement tool, offering interactive stream experiences with AI-powered automated responses, voice command activations, and advanced modules. It's easy t...
Text To Speech Multilingual Support (+20 Language)