回车: Github搜索 Shift+回车: Google搜索

该仓库已收录但尚未编辑。项目介绍及使用教程请前往 GitHub 阅读 README

0 条讨论

登录后发表评论

关于

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

generation large-language-models vision-language-model

创建时间

2024-03-26

是否国产

否

修改时间

2024-05-04T14:36:51Z

Readme

语言

Python85.0%
Shell11.5%
JavaScript1.8%
HTML1.4%
CSS0.3%

dvlab-research 的其他开源项目

LongLoRA

@dvlab-research

#大语言模型#Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

fine-tuning-llm large-language-models long-context 大语言模型 lora

Python2.69 k

1 年前

LISA

@dvlab-research

#大语言模型#Project Page for "LISA: Reasoning Segmentation via Large Language Model"

大语言模型 multi-modal segmentation

Python2.43 k

8 个月前

VoxelNeXt

@dvlab-research

VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking (CVPR 2023)

3d-multi-object-tracking 3d-object-detection autonomous-driving lidar

Python829

2 年前

DeepUPE

@dvlab-research

Underexposed Photo Enhancement Using Deep Illumination Estimation

Python564

3 年前

您可能感兴趣的

Open-Sora

@hpcaitech

Open-Sora：完全开源的高效复现类Sora视频生成方案

Python27.28 k

5 个月前

grok-1

@xai-org

大模型Grok-1开源

Python50.51 k

1 年前

Open-Sora-Plan

@PKU-YuanGroup

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python12.03 k

4 天前

OpenHands

@All-Hands-AI

#大语言模型#🙌 OpenHands: Code Less, Make More

agent 人工智能大语言模型 ChatGPT claude-ai

Python63.93 k

3 小时前

devika

@stitionai

Devika is now Opcode

Python19.5 k

8 天前

VoiceCraft

@jasonppy

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Jupyter Notebook8.4 k

7 个月前

DeepSeek-VL

@deepseek-ai

DeepSeek-VL: Towards Real-World Vision-Language Understanding

vision-language-model vision-language-pretraining foundation-models

Python3.97 k

1 年前

AIOS

@agiresearch

AIOS: AI Agent Operating System

Python4.67 k

9 天前

AniPortrait

@Zejun-Yang

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Python5 k

1 年前

dust3r

NAVER@naver

DUSt3R: Geometric 3D Vision Made Easy

Python6.65 k

8 天前

MoneyPrinterTurbo

@harry0703

#大语言模型#利用AI大模型，一键生成高清短视频 Generate short videos with one click using AI LLM.

shortvideo 自动化 ChatGPT moviepy Python

Python45.47 k

4 个月前

TripoSR

@VAST-AI-Research

TripoSR: Fast 3D Object Reconstruction from a Single Image

Python5.78 k

1 年前

openui

Weights and Biases@wandb

OpenUI let's you describe UI using your imagination, then see it rendered live.

人工智能 generative-ai html-css-javascript Tailwind CSS

TypeScript21.74 k

2 天前

sd-forge-layerdiffuse

@lllyasviel

[WIP] Layer Diffusion for WebUI (via Forge)

Python4.1 k

1 年前

Mora

@lichao-sun

Mora: More like Sora for Generalist Video Generation

Python1.57 k

1 年前

LaVague

@lavague-ai

#大语言模型#Large Action Model framework to develop AI Web Agents

人工智能 browser large-action-model 大语言模型 Open Source

Python6.18 k

8 个月前

transformer-debugger

OpenAI@openai

Python4.1 k

1 年前

champ

@fudan-generative-vision

[ECCV 2024] Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance

human-animation video-generation image-animatioln

Python4.23 k

1 年前

BrushNet

@TencentARC

[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"

diffusion diffusion-models image-inpainting text-to-image eccv

Python1.67 k

10 个月前

dbrx

Databricks@databricks

#大语言模型#Code examples and resources for DBRX, a large language model developed by Databricks

databricks gen-ai generative-ai 大语言模型 llm-inference

Python2.57 k

1 年前

dvlab-research / MGM

自述文件

0 条讨论

关于

创建时间

是否国产

修改时间

语言

dvlab-research 的其他开源项目

您可能感兴趣的