eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
2015-12-08
否
2025-05-28T22:22:03Z
该仓库已收录但尚未编辑。项目介绍及使用教程请前往 GitHub 阅读 README
数据准备中,请稍后重试
大模型Grok-1开源
Zero-Shot Speech Editing and Text-to-Speech in the Wild
#大语言模型#利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. D...
#大语言模型#🙌 OpenHands: Code Less, Make More
Open-Sora: 完全开源的高效复现类Sora视频生成方案
#计算机科学#🐸💬 - 一个深度学习的 TTS 语言合成库
Puter 是一个运行在浏览器上的OS。功能丰富、快速、可扩展性强。它可以用于构建远程桌面环境,也可以作为云存储服务、远程服务器、网络托管平台等的接口。
#大语言模型#本地化搭建和运行 Llama2 和其他大模型
Pingora是一个Rust框架,用于构建快速、可靠、可编程的网络系统。Pingora 久经考验,它每秒处理的互联网请求数已超过4000万次。
A flexible distributed key-value database that is optimized for caching and other realtime workloads.
whisper 是一个通用语音识别模型
Instant voice cloning by MIT and MyShell. Audio foundation model.
上传截图通过GPT生成HTML/Tailwind/JavaScript代码
Jan is an open source alternative to ChatGPT that runs 100% offline on your computer
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫
A fast, local neural text to speech system
Open-Source Form Builder
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
[WIP] Layer Diffusion for WebUI (via Forge)