vlms · GitHub Topics

Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open source LLM / VLM!

dpo evaluation fine-tuning inference llama 大语言模型 sft vlms gpt-oss

Python 8.46 k

2 天前

#自然语言处理#An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)

Python 1.72 k

20 天前

yueliu1999 / Awesome-Jailbreak-on-LLMs

#大语言模型#Awesome-Jailbreak-on-LLMs is a collection of state-of-the-art, novel, exciting jailbreak methods on LLMs. It contains papers, codes, datasets, evaluations, and analyses.

人工智能 jailbreak 大语言模型隐私 safety 安全 vlm vlms

901

10 天前

dvlab-research / VisionZip

Official repository for VisionZip (CVPR 2025)

efficiency multi-modality vision-language-model vlms

Python 347

2 个月前

tianyi-lab / HallusionBench

#大语言模型#[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models

benchmark vlms gpt-4 gpt-4v llava benchmarks hallucination 大语言模型 lmm large-language-models large-vision-language-models

Python 297

10 个月前

cequence-io / openai-scala-client

#大语言模型#Scala client for OpenAI API and other major LLM providers

ChatGPT openai Scala anthropic-api gemini-ai groq-api 大语言模型 nlp-library perplexity-api vertex-ai-gemini-api vlms aws-bedrock anthropic gemini

Scala 233

10 天前

Beckschen / ViTamin

[CVPR 2024] Official implementation of "ViTamin: Designing Scalable Vision Models in the Vision-language Era"

vlms

Python 209

1 年前

Alpha-Innovator / OmniCaptioner

Official Repository of OmniCaptioner

deepseek-r1 multi-modal vlms

Python 161

5 个月前

MCG-NJU / AWT

[NeurIPS 2024] AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation

clip 机器视觉 video-understanding vlms zero-shot-learning transfer-learning

Python 107

1 年前

TUM-AVS / FM-AD-Survey

This repository collects research papers of large Foundation Models for Scenario Generation and Analysis in Autonomous Driving. The repository will be continuously updated to track the latest update.

diffusion-models 大语言模型 vlms world-models autonomous-driving foundation-models scenario-analysis

12 天前