#

multimodal

Mintplex-Labs/anything-llm
https://static.github-zh.com/github_avatars/Mintplex-Labs?size=40
JavaScript 49.15 k
1 小时前
https://static.github-zh.com/github_avatars/bytedance?size=40

Agent TARS 是一个通用的多模态 AI Agent Stack,它将 GUI Agent 和 Vision 的强大功能带入你的终端、计算机、浏览器和产品中。UI-TARS Desktop 是一个桌面应用程序,基于 UI-TARS 模型提供原生的 GUI Agent。

TypeScript 18.83 k
2 小时前
https://static.github-zh.com/github_avatars/deepseek-ai?size=40
Python 17.55 k
8 个月前
https://static.github-zh.com/github_avatars/NVIDIA-NeMo?size=40

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 15.72 k
5 小时前
https://static.github-zh.com/github_avatars/mediar-ai?size=40

#大语言模型#全天候24小时 AI 屏幕和麦克风录制。构建具有完整上下文的 AI 应用。与 Ollama 配合使用。Rewind.ai 的替代品。开放。安全。您拥有自己的数据。Rust 开发。

TypeScript 15.62 k
16 天前
https://static.github-zh.com/github_avatars/modelscope?size=40

#大语言模型#Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, P...

Python 9.94 k
14 小时前
rerun-io/rerun
https://static.github-zh.com/github_avatars/rerun-io?size=40

Visualize streams of multimodal data. Free, fast, easy to use, and simple to integrate. Built in Rust.

Rust 9.26 k
10 小时前
https://static.github-zh.com/github_avatars/apache?size=40

#大语言模型#SeaTunnel (原名为 waterdrop)是一个易用的支持海量数据实时同步的高性能分布式数据集成平台,每天可以稳定同步数百亿数据

Java 8.79 k
2 天前
bentoml/BentoML
https://static.github-zh.com/github_avatars/bentoml?size=40
Python 8.09 k
20 小时前
enricoros/big-AGI
https://static.github-zh.com/github_avatars/enricoros?size=40

#大语言模型#AI suite powered by state-of-the-art models and providing advanced AI/AGI functions. It features AI personas, AGI functions, multi-model chats, text-to-image, voice, response streaming, code highlight...

TypeScript 6.62 k
10 小时前
https://static.github-zh.com/github_avatars/SkalskiP?size=40
Python 6.18 k
1 年前
https://static.github-zh.com/github_avatars/swyxio?size=40

notes for software engineers getting up to speed on new AI developments. Serves as datastore for https://latent.space writing, and product brainstorming, but has cleaned up canonical references under ...

HTML 6.05 k
3 天前
https://static.github-zh.com/github_avatars/facebookresearch?size=40

#计算机科学#A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

Python 5.59 k
5 个月前
https://static.github-zh.com/github_avatars/om-ai-lab?size=40
Python 5.54 k
20 天前
loading...
Website
Wikipedia