podcastfy 是一个 Google NotebookLM 播客功能的Python开源替代品,使用生成式 AI 将网页、PDF、图片、Youtube等多模态内容转换为引人入胜的音频对话
#大语言模型#Free, high-quality text-to-speech API endpoint to replace OpenAI, Azure, or ElevenLabs
Aura is like Siri, but in your browser. An AI voice assistant optimized for low latency responses.
#大语言模型#Your own personal voice assistant: Voice to Text to LLM to Speech, displayed in a web interface
#大语言模型#A Conversational Assistant equipped with synthetic voices including J.A.R.V.I.S's. Powered by OpenAI and IBM Watson APIs and a Tacotron model for voice generation.
#大语言模型#End-to-end platform for building voice first multimodal agents
Conversational voice AI agents
This project is a digital human that can talk and listen to you. It uses OpenAI's GPT to generate responses, OpenAI's Whisper to transcript the audio, Eleven Labs to generate voice and Rhubarb Lip Syn...
Low latency ai companion voice talk in 60 lines of code using faster_whisper and elevenlabs input streaming
#大语言模型#Allows you to have an engaging and safely emotive spoken / CLI conversation with the AI ChatGPT / GPT-4 while giving you the option to let it remember things discussed.
#大语言模型#Local, OpenAI-compatible text-to-speech (TTS) API using Chatterbox, enabling users to generate voice cloned speech anywhere the OpenAI API is used (e.g. Open WebUI, AnythingLLM, etc.)
Avatar Generation For Characters and Game Assets Using Deep Fakes
The AI Podcast Studio: generate podcasts scripts and their audio version with a team of AI workers in a Podcast Studio 🎙️📜
Provides unlimited ElevenLabs API calls.
Eleven Labs text to speech package for NodeJS. You can use the official package at: https://www.npmjs.com/package/elevenlabs
React / Vanilla JS Text to Speech with highlighting the words and sentences that are being spoken using audio files, text to speech API, and web speech synthesis API