GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

grounding-dino

Website
Wikipedia
https://static.github-zh.com/github_avatars/open-mmlab?size=40
open-mmlab / mmdetection

OpenMMLab Detection Toolbox and Benchmark

object-detectioninstance-segmentationfast-rcnnfaster-rcnnmask-rcnncascade-rcnnssdretinanetPyTorchpanoptic-segmentationrtmdetswin-transformertransformervision-transformeryoloconvnextdetrgrounding-dino
Python 31.17 k
10 个月前
https://static.github-zh.com/github_avatars/CVHub520?size=40
CVHub520 / X-AnyLabeling

#大语言模型#Effortless data labeling with AI support from Segment Anything and other awesome models.

labeling-toolpaddlePyTorchresnetsamyolo深度学习onnxclip大语言模型annotation-toolclassificationdepth-estimationgrounding-dinoimage-segmentationmattingobject-detectionpose-estimationvlm
Python 5.77 k
19 小时前
autodistill/autodistill
https://static.github-zh.com/github_avatars/autodistill?size=40
autodistill / autodistill

#计算机科学#Images to inference with no labeling (use foundation models to train supervised models).

机器视觉auto-labeling深度学习foundation-modelsgrounding-dinoimage-annotationimage-classificationinstance-segmentationlabeling-tool机器学习multimodalobject-detectionPyTorchsegment-anythingyolov5yolov8
Python 2.29 k
1 个月前
https://static.github-zh.com/github_avatars/roboflow?size=40
roboflow / awesome-openai-vision-api-experiments

#大语言模型#Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API 🔥

ChatGPT机器视觉openaiclassificationclipzero-shotgrounding-dinoopen-vocabulary-detectionopen-vocabulary-segmentationsegment-anything
Python 1.68 k
5 个月前
https://static.github-zh.com/github_avatars/IDEA-Research?size=40
IDEA-Research / Grounding-DINO-1.5-API

Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series

grounding-dinoobject-detectionopen-setfoundation-modelopen-vocabulary-detectionopen-worldzero-shot-object-detection
Python 964
5 个月前
https://static.github-zh.com/github_avatars/SkalskiP?size=40
SkalskiP / awesome-foundation-and-multimodal-models

#自然语言处理#👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]

blipclipfoundational-modelsgrounding-dinollavamultimodalsegment-anything机器视觉自然语言处理open-vocabulary-detectionopen-vocabulary-segmentationimage-captioning
Python 621
1 年前
https://static.github-zh.com/github_avatars/light-and-ray?size=40
light-and-ray / sd-webui-replacer

A tab for sd-webui for replacing objects in pictures or videos using detection prompt

grounding-dinoinpaintingsegment-anythingstable-diffusionstable-diffusion-webuicontrolnetvideo-generationvideo-inpainting
Python 232
6 个月前
https://static.github-zh.com/github_avatars/securade?size=40
securade / hub

#人脸识别#Securade.ai HUB - A generative AI based edge platform for computer vision that connects to existing CCTV cameras and makes them smart.

人工智能机器视觉face-detectiongenerative-aigrounding-dinojetson机器学习model-zooNvidiaobject-detectionvideo-analytics
Python 183
4 个月前
https://static.github-zh.com/github_avatars/jamjamjon?size=40
jamjamjon / usls

A Rust library integrated with ONNXRuntime, providing a collection of Computer Vison and Vision-Language models.

CUDAtensorrtyolov8OCRyolosamgrounding-dinoonnxruntimeflorence2clipyolov10onnx
Rust 175
12 小时前
https://static.github-zh.com/github_avatars/patrick-tssn?size=40
patrick-tssn / Streaming-Grounded-SAM-2

Grounded Tracking for Streaming Videos

grounding-dinosegment-anythingstreaming-videotracking
Jupyter Notebook 104
8 个月前
https://static.github-zh.com/github_avatars/autodistill?size=40
autodistill / autodistill-grounded-sam

GroundedSAM Base Model plugin for Autodistill

grounding-dino
Python 51
1 年前
https://static.github-zh.com/github_avatars/rhysdg?size=40
rhysdg / vision-at-a-clip

#计算机科学#Low-latency ONNX and TensorRT based zero-shot classification and detection with contrastive language-image pre-training based prompts

clipfoundation-models机器学习onnxtensorrtzero-shot-classificationgrounding-dinozero-shot-object-detection
Jupyter Notebook 41
10 个月前
https://static.github-zh.com/github_avatars/VectorInstitute?size=40
VectorInstitute / SegMate

SegMate: A Segmentation Toolkit

机器视觉foundation-modelsegment-anything-modelgrounding-dinoimage-segmentationprompt-tuning
Jupyter Notebook 23
1 年前
https://static.github-zh.com/github_avatars/autodistill?size=40
autodistill / autodistill-grounding-dino

Grounding DINO module for use with Autodistill.

grounding-dino
Python 21
1 年前
https://static.github-zh.com/github_avatars/pooya-mohammadi?size=40
pooya-mohammadi / easy_image_inpainting

Generative AI based image editing/inpainting made super easy to work with.

generative-aidiffusion-modelsglidegrounding-dinoPyTorchimage-editingimage-inpainting
Jupyter Notebook 20
2 年前
https://static.github-zh.com/github_avatars/CoffeeVampir3?size=40
CoffeeVampir3 / grounding-sam2-demo

#计算机科学#A simple demo for utilizing grounding dino and segment anything v2 models together

demo-appgradiogroundinggrounding-dino机器学习segmentsegment-anythingsegmentation
Python 20
1 年前
https://static.github-zh.com/github_avatars/tyrerodr?size=40
tyrerodr / real-time-drowsy-driving-detection

The Drowsiness Detection System uses YOLOv8 models to monitor drowsiness in real-time by detecting eye states and yawning. Built with Python and leveraging the GroundingDINO library for bounding box g...

机器视觉grounding-dinoyolov8
Jupyter Notebook 14
2 个月前
https://static.github-zh.com/github_avatars/JayyShah?size=40
JayyShah / End-to-end-Computer-Vision

#计算机科学#Explore the cutting edge of computer vision with this comprehensive repository, showcasing a spectrum from classical machine learning to state-of-the-art transformer models.

机器视觉深度学习image-classificationimage-segmentation机器学习神经网络object-detectiontransformeryoloclipgrounding-dinorcnnssd
Jupyter Notebook 8
1 年前
https://static.github-zh.com/github_avatars/heitorrapela?size=40
heitorrapela / ModPrompt

[ArXiv2024] ModPrompt: Visual Modality Prompt for Adapting Vision-Language Object Detectors

adaptationgrounding-dinoobjectdetectionvisual-promptyolo-world
8
6 个月前
https://static.github-zh.com/github_avatars/abdur75648?size=40
abdur75648 / V-Zen

#大语言模型#V-Zen: Efficient GUI Understanding and Precise Grounding With A Novel Multimodal LLM Resources

ChatGPTgptgrounding-dinoGUIlarge-language-modelsllama大语言模型mistralmultimodal-large-language-modelsvicunaagisuperagi
6
1 年前
loading...