GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

clip

Website
Wikipedia
https://static.github-zh.com/github_avatars/mikel-brostrom?size=40
mikel-brostrom / boxmot

#计算机科学#BoxMOT: pluggable SOTA tracking modules for segmentation, object detection and pose estimation models

strongsortbytetrackocsortosnet深度学习segmentationtensorrttracking-by-detectionyolobotsortdeepocsortmulti-object-trackingmotmotsmulti-object-tracking-segmentationimprovedassociationboosttrackcliporiented-bounding-box-tracking机器学习
Python 7.41 k
1 天前
https://static.github-zh.com/github_avatars/CVHub520?size=40
CVHub520 / X-AnyLabeling

#大语言模型#Effortless data labeling with AI support from Segment Anything and other awesome models.

labeling-toolpaddlePyTorchresnetsamyolo深度学习onnxclip大语言模型annotation-toolclassificationdepth-estimationgrounding-dinoimage-segmentationmattingobject-detectionpose-estimationvlm
Python 5.77 k
20 小时前
https://static.github-zh.com/github_avatars/OFA-Sys?size=40
OFA-Sys / Chinese-CLIP

#自然语言处理#本项目为CLIP模型的中文版本,使用大规模中文数据进行训练(~2亿图文对),旨在帮助用户快速实现中文领域的图文特征&相似度计算、跨模态检索、零样本图片分类等任务

中文机器视觉multi-modal-learning自然语言处理PyTorchvision-and-language-pre-trainingimage-text-retrievalclippretrained-modelsvision-language深度学习multi-modalcontrastive-losstransformerscoreml-models
Python 5.28 k
10 个月前
marqo-ai/marqo
https://static.github-zh.com/github_avatars/marqo-ai?size=40
marqo-ai / marqo

#搜索#Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai

深度学习information-retrieval机器学习vector-searchtensor-searchclipmulti-modal搜索引擎transformersvision-languagesemantic-searchvisual-search自然语言处理hnswknnHacktoberfestChatGPTgptlarge-language-models
Python 4.89 k
8 小时前
https://static.github-zh.com/github_avatars/easychen?size=40
easychen / pushdeer

开放源码的无App推送服务,iOS14+扫码即用。亦支持快应用/iOS和Mac客户端、Android客户端、自制设备

Apppushclipnotification-service
C 4.84 k
3 个月前
https://static.github-zh.com/github_avatars/open-mmlab?size=40
open-mmlab / mmpretrain

#计算机科学#OpenMMLab Pre-training Toolbox and Benchmark

image-classificationresnetmobilenetPyTorch深度学习swin-transformerbeitclipconstrastive-learningconvnextmasked-image-modelingmocopretrained-modelsself-supervised-learningvision-transformermultimodal
Python 3.68 k
7 个月前
https://static.github-zh.com/github_avatars/yuanzhoulvpi2017?size=40
yuanzhoulvpi2017 / zero_nlp

#自然语言处理#中文nlp解决方案(大模型、数据、模型、训练、推理)

bert自然语言处理transformersgpt2chatglm-6bclipgptPyTorchtext-generationhuggingface-transformersllama2llamallava
Jupyter Notebook 3.51 k
6 天前
https://static.github-zh.com/github_avatars/pharmapsychotic?size=40
pharmapsychotic / clip-interrogator

Image to prompt with BLIP and CLIP

clipPyTorch
Python 2.85 k
1 年前
https://static.github-zh.com/github_avatars/jingyi0000?size=40
jingyi0000 / VLM_survey

#计算机科学#Collection of AWESOME vision-language models for vision tasks

机器视觉深度学习knowledge-distillationsurveytransfer-learningvision-language-modelclip
2.77 k
22 天前
https://static.github-zh.com/github_avatars/rom1504?size=40
rom1504 / clip-retrieval

#计算机科学#Easily compute clip embeddings and build a clip retrieval system with them

semantic-search深度学习multimodal人工智能clipknn
Jupyter Notebook 2.57 k
1 年前
https://static.github-zh.com/github_avatars/open-compass?size=40
open-compass / VLMEvalKit

#大语言模型#Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

gpt-4vlarge-language-modelsllavamulti-modalopenaivqa大语言模型openai-apiqwengpt机器视觉PyTorchgpt4ChatGPTclipvitevaluationclaudegemini
Python 2.52 k
3 天前
https://static.github-zh.com/github_avatars/RuffianZhong?size=40
RuffianZhong / RWidgetHelper

Android UI 快速开发,专治原生控件各种不服

stateselectorcircletextviewimageviewgradientshaperippershadowclip
Java 1.94 k
1 年前
https://static.github-zh.com/github_avatars/cambrian-mllm?size=40
cambrian-mllm / cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

聊天机器人clip机器视觉dinoinstruction-tuninglarge-language-models大语言模型mllmmultimodal-large-language-modelsrepresentation-learning
Python 1.91 k
8 个月前
QIN2DIM/hcaptcha-challenger
https://static.github-zh.com/github_avatars/QIN2DIM?size=40
QIN2DIM / hcaptcha-challenger

#大语言模型#🥂 Gracefully face hCaptcha challenge with multimodal large language model.

hcaptchahcaptcha-solveryoloPlaywrightclipagentgemini大语言模型ai-agentsChatGPTopenaicaptcha-solvercaptchacaptcha-solving
Python 1.76 k
5 天前
https://static.github-zh.com/github_avatars/roboflow?size=40
roboflow / awesome-openai-vision-api-experiments

#大语言模型#Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API 🔥

ChatGPT机器视觉openaiclassificationclipzero-shotgrounding-dinoopen-vocabulary-detectionopen-vocabulary-segmentationsegment-anything
Python 1.68 k
5 个月前
https://static.github-zh.com/github_avatars/mbzuai-oryx?size=40
mbzuai-oryx / Video-ChatGPT

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for ...

聊天机器人clipgpt-4llamallavavicunavision-languagevision-language-pretraining
Python 1.38 k
3 个月前
https://static.github-zh.com/github_avatars/yzhuoning?size=40
yzhuoning / Awesome-CLIP

Awesome list for research on CLIP (Contrastive Language-Image Pre-Training).

clipcontrastive-learningpre-training
1.2 k
1 年前
unum-cloud/uform
https://static.github-zh.com/github_avatars/unum-cloud?size=40
unum-cloud / uform

#向量搜索引擎#Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️

huggingface-transformerslanguage-visionmultimodalPyTorchsemantic-searchtransformercross-attentionvector-searchbert神经网络pretrained-modelsmulti-lingualclipopenaicontrastive-learningrepresentation-learningclusteringimage-searchllava
Python 1.15 k
5 个月前
https://static.github-zh.com/github_avatars/SkalskiP?size=40
SkalskiP / vlms-zero-to-hero

#自然语言处理#This series will take you on a journey from the fundamentals of NLP and Computer Vision to the cutting edge of Vision-Language Models.

bert-modelclip机器视觉embeddingsgptgpt-2lora自然语言处理seq2seqvision-language-modelword2vec
Jupyter Notebook 1.09 k
5 个月前
https://static.github-zh.com/github_avatars/EdVince?size=40
EdVince / Stable-Diffusion-NCNN

#安卓#Stable Diffusion in NCNN with c++, supported txt2img and img2img

clipC++diffusionmnnncnnonnxstable-diffusiontensorrttnnAndroidexecutableimg2imgtxt2img
C++ 1.04 k
2 年前
loading...