GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

llava

Website
Wikipedia
https://static.github-zh.com/github_avatars/ollama?size=40
ollama / ollama

#大语言模型#本地化搭建和运行 Llama2 和其他大模型

llama大语言模型llama2Goollamamistralgemmallama3llavaphi3gemma2phi4deepseekgemma3qwen
Go 143.72 k
16 小时前
https://static.github-zh.com/github_avatars/haotian-liu?size=40
haotian-liu / LLaVA

#大语言模型#LLaVA是一个具有 GPT-4V 级别功能的大语言和视觉模型助手

gpt-4聊天机器人ChatGPTllamamultimodalllavafoundation-modelsinstruction-tuningmulti-modalityvisual-language-learningllama-2llama2vision-language-model
Python 22.78 k
10 个月前
https://static.github-zh.com/github_avatars/sgl-project?size=40
sgl-project / sglang

#大语言模型#SGLang is a fast serving framework for large language models and vision language models.

CUDAinferencellamallava大语言模型llm-servingmoePyTorchtransformervlmllama3llama3-1deepseekdeepseek-llmdeepseek-v3deepseek-r1deepseek-r1-zeroqwen3llama4
Python 15.14 k
6 小时前
https://static.github-zh.com/github_avatars/Fanghua-Yu?size=40
Fanghua-Yu / SUPIR

#计算机科学#SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.

深度学习diffusion-modelsllavasdxlstable-diffusionsuper-resolutionrestorationPyTorchpytorch-lightning
Python 5.1 k
1 个月前
https://static.github-zh.com/github_avatars/InternLM?size=40
InternLM / xtuner

#大语言模型#An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

baichuanchatglm2internlmlarge-language-modelsllama2大语言模型llm-trainingpeftqwen聊天机器人conversational-aiagentchatglm3llavamixtralllama3phi3
Python 4.59 k
17 天前
https://static.github-zh.com/github_avatars/yuanzhoulvpi2017?size=40
yuanzhoulvpi2017 / zero_nlp

#自然语言处理#中文nlp解决方案(大模型、数据、模型、训练、推理)

bert自然语言处理transformersgpt2chatglm-6bclipgptPyTorchtext-generationhuggingface-transformersllama2llamallava
Jupyter Notebook 3.51 k
6 天前
https://static.github-zh.com/github_avatars/SciSharp?size=40
SciSharp / LLamaSharp

#大语言模型#A C#/.NET library to run LLM (🦙LLaMA/LLaVA) on your local device efficiently.

聊天机器人gptllamallamacpp大语言模型semantic-kernelllavamulti-modalllama2llama3llama-cpp
C# 3.23 k
1 天前
https://static.github-zh.com/github_avatars/open-compass?size=40
open-compass / VLMEvalKit

#大语言模型#Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

gpt-4vlarge-language-modelsllavamulti-modalopenaivqa大语言模型openai-apiqwengpt机器视觉PyTorchgpt4ChatGPTclipvitevaluationclaudegemini
Python 2.52 k
2 天前
https://static.github-zh.com/github_avatars/om-ai-lab?size=40
om-ai-lab / OmAgent

#大语言模型#Build multimodal language agents for fast prototype and production

large-language-modelsmultimodal-agentvision-and-languageagentworkflow聊天机器人gpt4大语言模型multimodalragvlmgptgradiollamallavaopenaiPythongemini
Python 2.51 k
3 个月前
https://static.github-zh.com/github_avatars/chenking2020?size=40
chenking2020 / FindTheChatGPTer

#大语言模型#ChatGPT爆火,开启了通往AGI的关键一步,本项目旨在汇总那些ChatGPT的开源平替们,包括文本大模型、多模态大模型等,为大家提供一些便利

chatglmllamabellevicunaChatGPTalpacaguanacolorallavaminigpt4autogptagicevalbaichuanllama2
2.04 k
2 年前
https://static.github-zh.com/github_avatars/mbzuai-oryx?size=40
mbzuai-oryx / Video-ChatGPT

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for ...

聊天机器人clipgpt-4llamallavavicunavision-languagevision-language-pretraining
Python 1.38 k
3 个月前
https://static.github-zh.com/github_avatars/Blaizzy?size=40
Blaizzy / mlx-vlm

#大语言模型#MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.

llava大语言模型MLXvision-transformerapple-siliconideficslocal-aipaligemmavision-frameworkvision-language-modelflorence2molmopixtral
Python 1.33 k
7 天前
unum-cloud/uform
https://static.github-zh.com/github_avatars/unum-cloud?size=40
unum-cloud / uform

#向量搜索引擎#Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️

huggingface-transformerslanguage-visionmultimodalPyTorchsemantic-searchtransformercross-attentionvector-searchbert神经网络pretrained-modelsmulti-lingualclipopenaicontrastive-learningrepresentation-learningclusteringimage-searchllava
Python 1.15 k
5 个月前
https://static.github-zh.com/github_avatars/jhc13?size=40
jhc13 / taggui

Tag manager and captioner for image datasets

image-captioningpyside6stable-diffusionllavacogvlmflorence-2
Python 1.02 k
1 个月前
https://static.github-zh.com/github_avatars/gokayfem?size=40
gokayfem / awesome-vlm-architectures

#Awesome#Famous Vision Language Models and Their Architectures

clipllavavlmmultimodalblipcogvlminternlmkosmosvision-language-modelAwesome Lists
Markdown 859
4 个月前
https://static.github-zh.com/github_avatars/mbzuai-oryx?size=40
mbzuai-oryx / LLaVA-pp

#大语言模型#🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)

conversationllama3llava大语言模型lmmsphi3vision-languagellama-3-llavallama-3-visionllama3-llavaphi-3-visionphi3-vision
Python 839
1 年前
https://static.github-zh.com/github_avatars/TinyLLaVA?size=40
TinyLLaVA / TinyLLaVA_Factory

#自然语言处理#A Framework of Small-scale Large Multimodal Models

large-multimodal-modelsllamallava自然语言处理transformersvision-language
Python 835
2 个月前
https://static.github-zh.com/github_avatars/NVlabs?size=40
NVlabs / EAGLE

#大语言模型#Eagle Family: Exploring Model Designs, Data Recipes and Training Strategies for Frontier-Class Multimodal LLMs

Demogpt4huggingfacellamallama3llavalmmmllm大语言模型large-language-models
Python 792
2 个月前
https://static.github-zh.com/github_avatars/PsyChip?size=40
PsyChip / machina

OpenCV+YOLO+LLAVA powered video surveillance system

camerallavaollama-apiOpenCVPythonrtspyolo
Python 761
5 天前
https://static.github-zh.com/github_avatars/PaddlePaddle?size=40
PaddlePaddle / PaddleMIX

Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high per...

aigcstable-diffusionclipimage-to-texttext-to-imagecontrolnetmultimodaltext-to-videoditllavasoraqwen2-vlminicpm-v
Python 653
5 天前
loading...