GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

multi-modal

Website
Wikipedia
https://static.github-zh.com/github_avatars/OpenBMB?size=40
OpenBMB / MiniCPM-o

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

minicpmminicpm-vmulti-modal
Python 19.62 k
3 天前
https://static.github-zh.com/github_avatars/activeloopai?size=40
activeloopai / deeplake

#数据仓库#Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop....

数据集深度学习机器学习数据科学PyTorchTensorflowPython人工智能mlops机器视觉cv图像处理datalakelangchain大语言模型large-language-modelsvector-databasevector-searchmulti-modal
Python 8.66 k
5 天前
https://static.github-zh.com/github_avatars/OpenGVLab?size=40
OpenGVLab / InternVL

#大语言模型#[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

image-classificationimage-text-retrieval大语言模型semantic-segmentationvideo-classificationvision-language-modelvit-22bvit-6bmulti-modalgptgpt-4vgpt-4o
Python 8.33 k
17 天前
modelscope/modelscope
https://static.github-zh.com/github_avatars/modelscope?size=40
modelscope / modelscope

#自然语言处理#ModelScope: bring the notion of Model-as-a-Service to life.

自然语言处理cvspeechmulti-modalscience深度学习机器学习Python
Python 7.98 k
3 天前
https://static.github-zh.com/github_avatars/modelscope?size=40
modelscope / agentscope

#大语言模型#Start building LLM-empowered multi-agent applications in an easier way.

agent聊天机器人gpt-4large-language-models大语言模型llm-agentmulti-agentdistributed-agentsmulti-modalllama3gpt-4odrag-and-dropmcp
Python 7.49 k
4 天前
https://static.github-zh.com/github_avatars/THUDM?size=40
THUDM / CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

cross-modalitylanguage-modelmulti-modalpretrained-modelsvisual-language-models
Python 6.58 k
1 年前
https://static.github-zh.com/github_avatars/TEN-framework?size=40
TEN-framework / ten-framework

Open-source framework for all AI agents.

人工智能multi-modalreal-timeVideovoice
C 6.15 k
3 天前
https://static.github-zh.com/github_avatars/lucidrains?size=40
lucidrains / DALLE-pytorch

#计算机科学#Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch

人工智能深度学习attention-mechanismtext-to-imagetransformersmulti-modal
Python 5.62 k
1 年前
https://static.github-zh.com/github_avatars/OFA-Sys?size=40
OFA-Sys / Chinese-CLIP

#自然语言处理#本项目为CLIP模型的中文版本,使用大规模中文数据进行训练(~2亿图文对),旨在帮助用户快速实现中文领域的图文特征&相似度计算、跨模态检索、零样本图片分类等任务

中文机器视觉multi-modal-learning自然语言处理PyTorchvision-and-language-pre-trainingimage-text-retrievalclippretrained-modelsvision-language深度学习multi-modalcontrastive-losstransformerscoreml-models
Python 5.28 k
10 个月前
https://static.github-zh.com/github_avatars/valhalla?size=40
valhalla / valhalla

Open Source Routing Engine for OpenStreetMap

OpenStreetMapdijkstraastartileddirectionsisochronesmulti-modaltraveling-salesmanrouting-engineRouting (disambiguation)
C++ 4.9 k
15 小时前
marqo-ai/marqo
https://static.github-zh.com/github_avatars/marqo-ai?size=40
marqo-ai / marqo

#搜索#Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai

深度学习information-retrieval机器学习vector-searchtensor-searchclipmulti-modal搜索引擎transformersvision-languagesemantic-searchvisual-search自然语言处理hnswknnHacktoberfestChatGPTgptlarge-language-models
Python 4.89 k
2 小时前
https://static.github-zh.com/github_avatars/modelscope?size=40
modelscope / data-juicer

#大语言模型#Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

数据分析数据科学large-language-models大语言模型数据可视化instruction-tuningpre-trainingmulti-modalsynthetic-datadatadata-pipelinedata-processingfoundation-models
Python 4.58 k
2 天前
https://static.github-zh.com/github_avatars/FareedKhan-dev?size=40
FareedKhan-dev / all-rag-techniques

#大语言模型#Implementation of all RAG techniques in a simpler way

人工智能大语言模型openaiPythonragmulti-modal
Jupyter Notebook 4.21 k
3 天前
https://static.github-zh.com/github_avatars/THUDM?size=40
THUDM / VisualGLM-6B

Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型

chatglm-6bgptmulti-modal
Python 4.16 k
10 个月前
https://static.github-zh.com/github_avatars/VectorSpaceLab?size=40
VectorSpaceLab / OmniGen

OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340

diffusionImageimage-generationmulti-modalimage-edit
Jupyter Notebook 4.12 k
4 个月前
https://static.github-zh.com/github_avatars/zjunlp?size=40
zjunlp / DeepKE

#自然语言处理#[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction

knowledge-graphrelation-extraction中文named-entity-recognitionattribute-extractionlow-resourcedocument-levelinformation-extractionPyTorchdeepkener自然语言处理few-shotprompt深度学习multi-modal
Python 3.95 k
2 个月前
https://static.github-zh.com/github_avatars/PKU-YuanGroup?size=40
PKU-YuanGroup / Video-LLaVA

【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

instruction-tuninglarge-vision-language-modelmulti-modal
Python 3.27 k
6 个月前
https://static.github-zh.com/github_avatars/SciSharp?size=40
SciSharp / LLamaSharp

#大语言模型#A C#/.NET library to run LLM (🦙LLaMA/LLaVA) on your local device efficiently.

聊天机器人gptllamallamacpp大语言模型semantic-kernelllavamulti-modalllama2llama3llama-cpp
C# 3.23 k
20 小时前
docarray/docarray
https://static.github-zh.com/github_avatars/docarray?size=40
docarray / docarray

#计算机科学#Represent, send, store and search multimodal data

docarray数据结构multimodalcross-modalneural-search深度学习nested-dataqdrantweaviatenearest-neighbor-searchprotobufelasticsearchmulti-modalsemantic-search机器学习PyTorchFastAPIpydantic
Python 3.07 k
9 天前
https://static.github-zh.com/github_avatars/open-compass?size=40
open-compass / VLMEvalKit

#大语言模型#Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

gpt-4vlarge-language-modelsllavamulti-modalopenaivqa大语言模型openai-apiqwengpt机器视觉PyTorchgpt4ChatGPTclipvitevaluationclaudegemini
Python 2.52 k
2 天前
loading...