OpenMMLab Detection Toolbox and Benchmark
#大语言模型#Effortless data labeling with AI support from Segment Anything and other awesome models.
#计算机科学#Images to inference with no labeling (use foundation models to train supervised models).
#大语言模型#Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API 🔥
Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series
#自然语言处理#👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]
VisioFirm: Cross-Platform AI-assisted Annotation Tool for Computer Vision
A tab for sd-webui for replacing objects in pictures or videos using detection prompt
#人脸识别#Securade.ai HUB - A generative AI based edge platform for computer vision that connects to existing CCTV cameras and makes them smart.
A Rust library integrated with ONNXRuntime, providing a collection of Computer Vison and Vision-Language models.
Grounded Tracking for Streaming Videos
#计算机科学#Low-latency ONNX and TensorRT based zero-shot classification and detection with contrastive language-image pre-training based prompts
SegMate: A Segmentation Toolkit
The Drowsiness Detection System uses YOLOv8 models to monitor drowsiness in real-time by detecting eye states and yawning. Built with Python and leveraging the GroundingDINO library for bounding box g...
Grounding DINO module for use with Autodistill.
Generative AI based image editing/inpainting made super easy to work with.
#计算机科学#A simple demo for utilizing grounding dino and segment anything v2 models together
[ICCV2025] ModPrompt: Visual Modality Prompt for Adapting Vision-Language Object Detectors
#计算机科学#Explore the cutting edge of computer vision with this comprehensive repository, showcasing a spectrum from classical machine learning to state-of-the-art transformer models.