GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

gpt-4v

Website
Wikipedia
https://static.github-zh.com/github_avatars/OpenGVLab?size=40
OpenGVLab / InternVL

#大语言模型#[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

image-classificationimage-text-retrieval大语言模型semantic-segmentationvideo-classificationvision-language-modelvit-22bvit-6bmulti-modalgptgpt-4vgpt-4o
Python 8.33 k
17 天前
https://static.github-zh.com/github_avatars/open-compass?size=40
open-compass / VLMEvalKit

#大语言模型#Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

gpt-4vlarge-language-modelsllavamulti-modalopenaivqa大语言模型openai-apiqwengpt机器视觉PyTorchgpt4ChatGPTclipvitevaluationclaudegemini
Python 2.52 k
2 天前
https://static.github-zh.com/github_avatars/ShareGPT4Omni?size=40
ShareGPT4Omni / ShareGPT4Video

#大语言模型#[NeurIPS 2024] An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions

ChatGPTgptgpt-4vlarge-language-modelslarge-multimodal-modelslarge-vision-language-modelssoratext-to-video
Python 1.06 k
8 个月前
https://static.github-zh.com/github_avatars/RLHF-V?size=40
RLHF-V / RLAIF-V

[CVPR'25 highlight] RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness

聊天机器人gpt-4vmultimodalllavaminicpm-v
Python 374
1 个月前
https://static.github-zh.com/github_avatars/tianyi-lab?size=40
tianyi-lab / HallusionBench

#大语言模型#[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models

benchmarkvlmsgpt-4gpt-4vllavabenchmarkshallucination大语言模型lmmlarge-language-modelslarge-vision-language-models
Python 284
7 个月前
https://static.github-zh.com/github_avatars/ShareGPT4Omni?size=40
ShareGPT4Omni / ShareGPT4V

#大语言模型#[ECCV 2024] ShareGPT4V: Improving Large Multi-modal Models with Better Captions

ChatGPTgptgpt-4vgpt4vinstruction-tuninglanguage-modellarge-language-modelslarge-multimodal-modelslarge-vision-language-modelsvision-language-modeleccv2024
Python 221
1 年前
https://static.github-zh.com/github_avatars/davideuler?size=40
davideuler / awesome-assistant-api

#大语言模型#Try openai assistant api apps on Google Colab for free. Awesome assistant API Demos!

assistantChatGPTdalle-3function-callinggpt-4-turbogpt-4vassistant-apiExample
Jupyter Notebook 213
1 年前
https://static.github-zh.com/github_avatars/yachty66?size=40
yachty66 / gpt_pdf_md

🚀 gpt_pdf_md: Convert PDF to Markdown with GPT-4V & more. Extract images, upload to Google Cloud, & generate Markdown with images. Python, GPT-4V Vision, Scala. Ideal for developers, researchers. PDF...

人工智能gpt-4vMarkdownpdfPython
Scala 80
2 年前
https://static.github-zh.com/github_avatars/jiayuww?size=40
jiayuww / SpatialEval

#计算机科学#[NeurIPS'24] SpatialEval: a benchmark to evaluate spatial reasoning abilities of MLLMs and LLMs

large-language-models机器学习multimodal-deep-learningreasoningclaudefoundation-modelsgeminigpt-4ogpt-4vllama3
Python 39
5 个月前
https://static.github-zh.com/github_avatars/jameszhou-gl?size=40
jameszhou-gl / gpt-4v-distribution-shift

Code for ICLR'24 workshop ME-FoMo-How Well Does GPT-4V(ision) Adapt to Distribution Shifts? A Preliminary Investigation

人工智能gpt-4vopenaiPythonclipllavageneralizationrobustness
Jupyter Notebook 36
8 个月前
https://static.github-zh.com/github_avatars/taogoddd?size=40
taogoddd / GPT-4V-API

Self-hosted GPT-4V api

gpt-4gpt-4-apigpt-4v
JavaScript 29
2 年前
https://static.github-zh.com/github_avatars/autodistill?size=40
autodistill / autodistill-gpt-4v

GPT-4V(ision) module for use with Autodistill.

机器视觉gpt-4gpt-4vobject-detection
Python 26
1 年前
https://static.github-zh.com/github_avatars/logicalroot?size=40
logicalroot / gpt-4v-demos

🤖 GPT-4V Demos • Test the model's vision capabilities in your browser using Streamlit • Easy setup

gpt-4openaiPythonStreamlitgpt-4vgpt4gpt4v
Python 18
2 年前
https://static.github-zh.com/github_avatars/android-com-pl?size=40
android-com-pl / wp-ai-alt-generator

WordPress plugin that leverages OpenAI's Vision API to automatically generate descriptive alt text for images, enhancing accessibility and SEO.

gpt-4gpt-4vopenaiwordpress-pluginWordPress人工智能HacktoberfestPHP插件
TypeScript 16
12 天前
https://static.github-zh.com/github_avatars/ShareGPT4Omni?size=40
ShareGPT4Omni / ShareGPT4Omni

#大语言模型#ShareGPT4Omni: Towards Building Omni Large Multi-modal Models with Comprehensive Multi-modal Annotations

ChatGPTgptgpt-4ogpt-4vlarge-multimodal-modelslarge-vision-language-models
8
1 年前
https://static.github-zh.com/github_avatars/aymenfurter?size=40
aymenfurter / copilot-insurance-claim-demo

How a Picture of Car Damage Can File Your Insurance Claim

azure-openaigpt-4-visiongpt-4vJavasemantic-kernel
Java 7
1 年前
https://static.github-zh.com/github_avatars/afonso07?size=40
afonso07 / ruskin

Your own personal Ruskin.

elevenlabsFastAPIgptgpt4NextopenaiReactvisiongpt4visiongpt-4v
TypeScript 6
2 年前
https://static.github-zh.com/github_avatars/ndurner?size=40
ndurner / oai_chat

#大语言模型#Multi-modal Chatbot based on OpenAI

chat聊天机器人gpt-4gpt-4vopenaigradiogradio-interface大语言模型llm-inferencevision-language-modelvlm
Python 4
2 个月前
https://static.github-zh.com/github_avatars/gutbash?size=40
gutbash / lmm-graph-vision

How well do the GPT-4V, Gemini Pro Vision, and Claude 3 Opus models perform zero-shot vision tasks on data structures?

数据结构gpt-4-vision-previewgpt-4vopenaivisual-question-answeringvqagemini-pro-visionclaude-3
Python 3
1 年前
https://static.github-zh.com/github_avatars/aymenfurter?size=40
aymenfurter / azure-chat-with-your-photos-demo

Chatbot that comprehends uploaded images and engages in detailed conversations about their content.

gpt-4-visiongpt-4vgpt4openai
Bicep 3
1 年前
loading...