This project is a digital human that can talk and listen to you. It uses OpenAI's GPT to generate responses, OpenAI's Whisper to transcript the audio, Eleven Labs to generate voice and Rhubarb Lip Sync to generate the lip sync.
2024-01-11
否
2024-09-07T13:20:14Z
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
#大语言模型#Educational voice conversation partner using Chat-GPT, Whisper, and AWS Polly.
#安卓#🤖 Your Personalised AI Chat Companion With 50+ Avatars Over 10+ Categories - Powered by OpenAI's GPT-3 / ChatGPT-3.5 Turbo / GPT-4, Google's Bard-AI, Pygmalion's 6B & StabilityAI's Stable-Diffusion m...
Using a single image and just 10 seconds of sample audio, our project enables you to create a video where it appears as if you're speaking the desired text.
Wav2Lip UHQ extension for Automatic1111
The API server version of the SadTalker project. Runs in Docker, 10 times faster than the original!
This Unreal Engine sample project demonstrates how to bring Epic Games' MetaHuman digital characters to life using the Amazon Polly text-to-speech service from AWS. Use this project as a starting poin...
The fastest way to deploy a restful api's with Gin Framework with a structured project that defaults to PostgreSQL database and JWT authentication middleware stored in Redis
这是一个就GPT的语音对话助手APP
Cloned repository from Hugging Face Spaces (CVPR 2022 Demo)
Devika is now Opcode
#大语言模型#A programming framework for agentic AI
This is the official source for our ACM MM 2023 paper "SelfTalk: A Self-Supervised Commutative Training Diagram to Comprehend 3D Talking Faces""
Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
0 条讨论