LLaVA是一个具有 GPT-4V 级别功能的大语言和视觉模型助手

关于

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

llava.hliu.cc

gpt-4 聊天机器人 ChatGPT llama multimodal llava foundation-models instruction-tuning multi-modality visual-language-learning llama-2 llama2 vision-language-model

创建时间

2023-04-17

是否国产

否

语言

Python85.7%
Shell8.5%
JavaScript2.8%
HTML2.1%
CSS0.5%
Dockerfile0.4%

该仓库已收录但尚未编辑。项目介绍及使用教程请前往 GitHub 阅读 README

0 条讨论

登录后发表评论

haotian-liu 的其他开源项目

yolact_edge

@WisconsinAIVision

The first competitive instance segmentation approach that runs on small edge devices at real-time speeds.

翻译 - 第一种竞争性实例分割方法可在小型边缘设备上以实时速度运行。

realtime real-time instance-segmentation yolactedge PyTorch

Python1.3 k

2 年前

transformers_llava

@haotian-liu

Python13

2 年前

您可能感兴趣的

grok-1

@xai-org

大模型Grok-1开源

Python50.28 k

9 个月前

Open-Sora

@hpcaitech

Open-Sora：完全开源的高效复现类Sora视频生成方案

Python26.49 k

23 天前

ollama

@ollama

#大语言模型#本地化搭建和运行 Llama2 和其他大模型

llama 大语言模型 llama2 llms Go

Go141.38 k

4 小时前

Open-Sora-Plan

@PKU-YuanGroup

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python11.97 k

2 个月前

OpenHands

@All-Hands-AI

#大语言模型#🙌 OpenHands: Code Less, Make More

agent 人工智能大语言模型 ChatGPT claude-ai

Python55.3 k

1 小时前

LLaMA-Factory

@hiyouga

#大语言模型#Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

fine-tuning language-model llama 大语言模型 peft

Python49.45 k

2 天前

devika

@stitionai

Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. D...

Python18.77 k

8 个月前

open-webui

@open-webui

#大语言模型#ChatGPT 风格的 Ollama Web界面

ollama ollama-webui 大语言模型 webui 自托管

JavaScript95.77 k

7 小时前

llama.cpp

@ggml-org

Facebook 的 LLaMA 模型在 C/C++ 中的移植

llama ggml

C++80.72 k

1 小时前

llama

@meta-llama

LLaMA模型的推理代码

Python58.25 k

4 个月前

Awesome-Multimodal-Large-Language-Models

@BradyFU

✨✨Latest Advances on Multimodal Large Language Models

instruction-tuning instruction-following large-vision-language-model visual-instruction-tuning multi-modality

15.27 k

8 天前

Qwen-VL

@QwenLM • 阿里巴巴

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

large-language-models vision-language-model

Python5.92 k

10 个月前

FastChat

@lm-sys

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python38.62 k

2 天前

CogVLM

THUDM@THUDM

a state-of-the-art-level open visual language model | 多模态预训练模型

cross-modality language-model multi-modal pretrained-models visual-language-models

Python6.55 k

1 年前

Video-LLaVA

@PKU-YuanGroup

【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

instruction-tuning large-vision-language-model multi-modal

Python3.25 k

6 个月前

transformer-debugger

OpenAI@openai

Python4.08 k

1 年前

CLIP

OpenAI@openai

#计算机科学#CLIP（Contrastive Language-Image Pretraining），根据图像预测最相关的文本片段

深度学习机器学习

Jupyter Notebook29.06 k

10 个月前

llamafile

@Mozilla-Ocho

Distribute and run LLMs with a single file.

C++22.46 k

8 天前

vllm

@vllm-project

#大语言模型#A high-throughput and memory-efficient inference and serving engine for LLMs

gpt 大语言模型 PyTorch llmops mlops

Python47.88 k

2 小时前

LAVIS

Salesforce@salesforce

#计算机科学#LAVIS - A One-stop Library for Language-Vision Intelligence

深度学习 deep-learning-library image-captioning salesforce vision-and-language

Jupyter Notebook10.57 k

6 个月前