GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

multimodal

Website
Wikipedia
Mintplex-Labs/anything-llm
https://static.github-zh.com/github_avatars/Mintplex-Labs?size=40
Mintplex-Labs / anything-llm

#大语言模型#The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.

raglmstudiolocalaivector-databaseollamalocal-llmllama3大语言模型ai-agentsmultimodalagent-framework-javascriptcustom-ai-agentsdeepseekdeepseek-r1mcpmcp-serversllm-webui无代码qwen3
JavaScript 45.31 k
2 天前
https://static.github-zh.com/github_avatars/haotian-liu?size=40
haotian-liu / LLaVA

#大语言模型#LLaVA是一个具有 GPT-4V 级别功能的大语言和视觉模型助手

gpt-4聊天机器人ChatGPTllamamultimodalllavafoundation-modelsinstruction-tuningmulti-modalityvisual-language-learningllama-2llama2vision-language-model
Python 22.78 k
10 个月前
jina-ai/serve
https://static.github-zh.com/github_avatars/jina-ai?size=40
jina-ai / serve

#计算机科学#Jina 是一个基于深度学习的搜索框架,支持各种类型如图片,视频,长文本,PDF等。

neural-searchcloud-native深度学习机器学习框架gRPCKubernetesmultimodalmlopspipelineFastAPIgenerative-aiDockerjaegerllmopsOpenTelemetrycncf微服务orchestrationprometheus
Python 21.61 k
3 个月前
https://static.github-zh.com/github_avatars/microsoft?size=40
microsoft / unilm

#自然语言处理#Unilm是一个跨任务、语言和模式的大规模自监督预训练模型

自然语言处理pre-trained-modelunilmminilmlayoutlmlayoutxlmbeitdocument-aitrocrbeit-3foundation-modelsxlm-edeepnet大语言模型multimodalmllmkosmoskosmos-1textdiffuserbitnet
Python 21.39 k
12 天前
https://static.github-zh.com/github_avatars/deepseek-ai?size=40
deepseek-ai / Janus

#大语言模型#Janus-Series: Unified Multimodal Understanding and Generation Models

any-to-anyfoundation-models大语言模型multimodalvision-language-pretrainingunified-model
Python 17.36 k
4 个月前
https://static.github-zh.com/github_avatars/mediar-ai?size=40
mediar-ai / screenpipe

#大语言模型#全天候24小时 AI 屏幕和麦克风录制。构建具有完整上下文的 AI 应用。与 Ollama 配合使用。Rewind.ai 的替代品。开放。安全。您拥有自己的数据。Rust 开发。

人工智能机器视觉大语言模型机器学习multimodalvisionagentsagi
TypeScript 15.02 k
10 天前
https://static.github-zh.com/github_avatars/NVIDIA?size=40
NVIDIA / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

machine-translationspeaker-recognitionasrttsgenerative-aimultimodal深度学习neural-networksspeaker-diariazationspeech-translationspeech-synthesislarge-language-models
Python 14.8 k
8 小时前
rerun-io/rerun
https://static.github-zh.com/github_avatars/rerun-io?size=40
rerun-io / rerun

Visualize streams of multimodal data. Free, fast, easy to use, and simple to integrate. Built in Rust.

可视化机器视觉PythonRoboticsRustmultimodalC++
Rust 8.57 k
11 小时前
https://static.github-zh.com/github_avatars/modelscope?size=40
modelscope / ms-swift

#大语言模型#Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Llava, GLM4v, ...

大语言模型lorallamasftdeploymultimodalpeftinternvlligerqwen2-vlrftdeepseek-r1embeddinggrpoopen-r1megatronomnillama4qwen3qwen3-moe
Python 8.09 k
1 天前
bentoml/BentoML
https://static.github-zh.com/github_avatars/bentoml?size=40
bentoml / BentoML

#大语言模型#The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!

model-servingmlopsllmopsgenerative-aillm-inference深度学习llm-serving机器学习Pythonmultimodalml-engineering大语言模型
Python 7.78 k
2 天前
enricoros/big-AGI
https://static.github-zh.com/github_avatars/enricoros?size=40
enricoros / big-AGI

#大语言模型#AI suite powered by state-of-the-art models and providing advanced AI/AGI functions. It features AI personas, AGI functions, multi-model chats, text-to-image, voice, response streaming, code highlight...

ChatGPTgenerative-aiuichatgpt-uiagilarge-language-modelsstable-diffusiongptgpt-4openaiopenai-apianthropicbeamgpt-5multimodalgroqmistral
TypeScript 6.47 k
12 小时前
https://static.github-zh.com/github_avatars/SkalskiP?size=40
SkalskiP / courses

#自然语言处理#This repository is a curated collection of links to various courses and resources about Artificial Intelligence (AI)

机器视觉深度学习深度神经网络机器学习mlopsmultimodaltransformers教程自然语言处理generative-modelstable-diffusion
Python 6.06 k
1 年前
https://static.github-zh.com/github_avatars/swyxio?size=40
swyxio / ai-notes

notes for software engineers getting up to speed on new AI developments. Serves as datastore for https://latent.space writing, and product brainstorming, but has cleaned up canonical references under ...

人工智能prompt-engineeringstable-diffusionopenaigptgpt-3multimodal
HTML 5.83 k
3 天前
https://static.github-zh.com/github_avatars/facebookresearch?size=40
facebookresearch / mmf

#计算机科学#A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

PyTorchvqapretrained-modelsmultimodal深度学习captioningdialogtextvqahateful-memesmulti-tasking
Python 5.57 k
2 个月前
https://static.github-zh.com/github_avatars/om-ai-lab?size=40
om-ai-lab / VLM-R1

#大语言模型#Solve Visual Understanding with Reinforced VLMs

deepseek-r1grpo大语言模型multimodalvlmqwenreinforcement-learning
Python 5.14 k
1 个月前
PySpur-Dev/pyspur
https://static.github-zh.com/github_avatars/PySpur-Dev?size=40
PySpur-Dev / pyspur

#大语言模型#A visual playground for agentic workflows: Iterate over your agents 10x faster

agent人工智能大语言模型Pythonworkflowdeepseek框架geminigraphhuman-in-the-looploopsmultimodalollamaragtracebuilder工具reasoningagents
TypeScript 5.07 k
1 个月前
kyegomez/swarms
https://static.github-zh.com/github_avatars/kyegomez?size=40
kyegomez / swarms

#大语言模型#The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework. Website: https://swarms.ai

人工智能attention-mechanismgpt4langchain机器学习multi-modal-imagingmulti-modalitymultimodalswarmstransformer-modelsagentsprompt-engineeringprompt-toolkitpromptingtree-of-thoughtsChatGPTgpt4allhuggingfacelangchain-python
Python 4.93 k
21 小时前
https://static.github-zh.com/github_avatars/kyegomez?size=40
kyegomez / tree-of-thoughts

#大语言模型#Plug in and Play Implementation of Tree of Thoughts: Deliberate Problem Solving with Large Language Models that Elevates Model Reasoning by atleast 70%

人工智能ChatGPTgpt4multimodalprompt-engineering深度学习promptprompt-learningprompt-tuning
Python 4.5 k
8 个月前
https://static.github-zh.com/github_avatars/X-PLUG?size=40
X-PLUG / MobileAgent

#安卓#Mobile-Agent: The Powerful Mobile Device Operation Assistant Family

agentgpt4vmllmmobile-agentsmultimodalmultimodal-large-language-modelsmultimodal-agentAndroidAppGUI移动自动化copilotharmonyiOS
Python 4.33 k
9 天前
https://static.github-zh.com/github_avatars/luban-agi?size=40
luban-agi / Awesome-AIGC-Tutorials

#自然语言处理#Curated tutorials and resources for Large Language Models, AI Painting, and more.

aigc大语言模型人工智能midjourneystable-diffusion深度学习教程courses-resourceprompt-engineering自然语言处理Awesome ListsChatGPTmultimodal
4.22 k
1 年前
loading...