GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

qwen2-vl

Website
Wikipedia
https://static.github-zh.com/github_avatars/modelscope?size=40
modelscope / ms-swift

#大语言模型#Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Llava, GLM4v, ...

大语言模型lorallamasftdeploymultimodalpeftinternvlligerqwen2-vlrftdeepseek-r1embeddinggrpoopen-r1megatronomnillama4qwen3qwen3-moe
Python 8.09 k
2 天前
https://static.github-zh.com/github_avatars/langmanus?size=40
langmanus / langmanus

#大语言模型#A community-driven AI automation framework that builds upon the incredible work of the open source community. Our goal is to combine language models with specialized tools for tasks like web search, c...

agi自动化deep-researchlangchainlanggraph大语言模型qwenqwen2-vlagentagents人工智能multi-agentmulti-agent-systemsdeepseekdeepseek-r1
Python 5.13 k
3 个月前
https://static.github-zh.com/github_avatars/roboflow?size=40
roboflow / maestro

streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL

captioningfine-tuningflorence-2multimodalobjectdetectionpaligemmaphi-3-visiontransformersvision-and-languagevqaqwen2-vl
Python 2.57 k
6 天前
https://static.github-zh.com/github_avatars/2U1?size=40
2U1 / Qwen2-VL-Finetune

An open-source implementaion for fine-tuning Qwen2-VL and Qwen2.5-VL series by Alibaba Cloud.

聊天机器人multimodalqwen2-vlvision-languagevision-language-modelqwen2-5
Python 824
4 天前
https://static.github-zh.com/github_avatars/PaddlePaddle?size=40
PaddlePaddle / PaddleMIX

Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high per...

aigcstable-diffusionclipimage-to-texttext-to-imagecontrolnetmultimodaltext-to-videoditllavasoraqwen2-vlminicpm-v
Python 653
5 天前
https://static.github-zh.com/github_avatars/NetEase-Media?size=40
NetEase-Media / grps_trtllm

#大语言模型#Higher performance OpenAI LLM service than vLLM serve: A pure C++ high-performance OpenAI LLM service implemented with GPRS+TensorRT-LLM+Tokenizers.cpp, supporting chat and function call, AI agents, d...

大语言模型openaitensorrt-llmchatglmllama3qwen2function-callai-agentllama-indexmulti-modaldeepseek-r1phiqwqqwen2-vlminicpm-vinternvlqwen3
Python 140
1 个月前
https://static.github-zh.com/github_avatars/lucasjinreal?size=40
lucasjinreal / Crane

A Pure Rust based LLM (Any LLM based MLLM such as Spark-TTS) Inference Engine, powering by Candle framework.

llama-cppmllmqwen2-vlRustqwen3
Rust 123
3 个月前
https://static.github-zh.com/github_avatars/drive-bench?size=40
drive-bench / toolkit

#大语言模型#Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives

autonomous-drivingChatGPTinternvlqwen2-vl
Python 81
4 个月前
https://static.github-zh.com/github_avatars/arcstep?size=40
arcstep / illufly

#大语言模型#✨🦋 illufly - 【幻蝶】基于记忆蒸馏、资料检索的自我进化智能体

agent人工智能glm-4gpt大语言模型multiagentopenaiqwenqwen2qwen2-vlraggrowth
Python 66
11 天前
https://static.github-zh.com/github_avatars/soulteary?size=40
soulteary / dify-with-qwen-vl

视频理解:千问视频多模态模型 & Dify

difyqwen2qwen2-vl
Python 59
9 个月前
https://static.github-zh.com/github_avatars/fireicewolf?size=40
fireicewolf / wd-llm-caption-cli

A Python base cli tool for caption images with WD series, Joy-caption-pre-alpha,meta Llama 3.2 Vision Instruct and Qwen2 VL Instruct models.

qwen2-vlflorence-2
Python 37
3 个月前
https://static.github-zh.com/github_avatars/see2023?size=40
see2023 / autoXHS

#网络爬虫#基于多模态大模型的智能搜索助手,通过AI技术实现小红书平台的智能化信息检索和知识整合|An intelligent search assistant based on multimodal large models, enabling smart information retrieval and knowledge integration on the Xiaohongshu platform.

大语言模型qwen2-vlSeleniumxiaohongshuspider
Python 19
7 个月前
https://static.github-zh.com/github_avatars/col14m?size=40
col14m / cadrille

#大语言模型#cadrille: Multi-modal CAD Reconstruction with Online Reinforcement Learning

cad大语言模型PyTorchqwen2-vlvlm
Python 19
4 天前
https://static.github-zh.com/github_avatars/shaadclt?size=40
shaadclt / Qwen2-VL-OCR-VQA

This project demonstrates how to use the Qwen2-VL model from Hugging Face for Optical Character Recognition (OCR) and Visual Question Answering (VQA). The model combines vision and language capabiliti...

optical-character-recognitionqwen2-vlvisual-question-answering
Jupyter Notebook 15
8 个月前
https://static.github-zh.com/github_avatars/BUAADreamer?size=40
BUAADreamer / Qwen2-VL-History

Qwen2-VL在文旅领域的LLaMA-Factory微调案例 The case for fine-tuning Qwen2-VL in the field of historical literature and museums

historymllmmultimodal-large-language-modelsqwen2-vl
10
9 个月前
https://static.github-zh.com/github_avatars/Younis-Ahmed?size=40
Younis-Ahmed / qwen-ai-provider

Community-built Qwen AI Provider for Vercel AI SDK - Integrate Alibaba Cloud's Qwen models with Vercel's AI application framework

人工智能vercel-aivercel-ai-sdkqwenqwen2-5qwen2-vlgenerative-aiVercelalibaba-cloudlanguage-model
TypeScript 10
4 天前
https://static.github-zh.com/github_avatars/ZachcZhang?size=40
ZachcZhang / Qwen2-VL-inference

An open-source server implementation for inference Qwen2-VL series model using fastapi.

FastAPIhuggingfaceinferencemllmqwen2-vl
Python 9
7 个月前
https://static.github-zh.com/github_avatars/Valdanitooooo?size=40
Valdanitooooo / chat_with_qwen2_vl_test

qwen2-vl
Python 8
6 个月前
https://static.github-zh.com/github_avatars/Kazuhito00?size=40
Kazuhito00 / Qwen2-VL-Colaboratory-Sample

Colaboratory上でQwenLM/Qwen2-VLをお試しするサンプル

colaboratoryPythonqwen2-vlvlm
Jupyter Notebook 7
9 个月前
https://static.github-zh.com/github_avatars/zhangguanghao523?size=40
zhangguanghao523 / CMMCoT

Official implementation of CMMCoT: Enhancing Complex Multi-Image Comprehension via Multi-Modal Chain-of-Thought and Memory Augmentation

chain-of-thoughtcotmllmqwen2-vl
Python 5
2 个月前
loading...