LLaVA是一个具有 GPT-4V 级别功能的大语言和视觉模型助手
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
2023-04-17
否
2024-08-12T09:52:38Z
该仓库已收录但尚未编辑。项目介绍及使用教程请前往 GitHub 阅读 README
The first competitive instance segmentation approach that runs on small edge devices at real-time speeds.
翻译 - 第一种竞争性实例分割方法可在小型边缘设备上以实时速度运行。
大模型Grok-1开源
Open-Sora: 完全开源的高效复现类Sora视频生成方案
#大语言模型#本地化搭建和运行 Llama2 和其他大模型
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
#大语言模型#🙌 OpenHands: Code Less, Make More
#大语言模型#Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. D...
#大语言模型#ChatGPT 风格的 Ollama Web界面
Facebook 的 LLaMA 模型在 C/C++ 中的移植
LLaMA模型的推理代码
✨✨Latest Advances on Multimodal Large Language Models
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
a state-of-the-art-level open visual language model | 多模态预训练模型
【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
#计算机科学#CLIP(Contrastive Language-Image Pretraining),根据图像预测最相关的文本片段
Distribute and run LLMs with a single file.
#大语言模型#A high-throughput and memory-efficient inference and serving engine for LLMs
#计算机科学#LAVIS - A One-stop Library for Language-Vision Intelligence