OpenMMLab Detection Toolbox and Benchmark
#大语言模型#Effortless data labeling with AI support from Segment Anything and other awesome models.
#计算机科学#Images to inference with no labeling (use foundation models to train supervised models).
#大语言模型#Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API 🔥
Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series
#自然语言处理#👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]
A tab for sd-webui for replacing objects in pictures or videos using detection prompt
#人脸识别#Securade.ai HUB - A generative AI based edge platform for computer vision that connects to existing CCTV cameras and makes them smart.
A Rust library integrated with ONNXRuntime, providing a collection of Computer Vison and Vision-Language models.
Grounded Tracking for Streaming Videos
#计算机科学#Low-latency ONNX and TensorRT based zero-shot classification and detection with contrastive language-image pre-training based prompts
SegMate: A Segmentation Toolkit
Grounding DINO module for use with Autodistill.
Generative AI based image editing/inpainting made super easy to work with.
#计算机科学#A simple demo for utilizing grounding dino and segment anything v2 models together
The Drowsiness Detection System uses YOLOv8 models to monitor drowsiness in real-time by detecting eye states and yawning. Built with Python and leveraging the GroundingDINO library for bounding box g...
#计算机科学#Explore the cutting edge of computer vision with this comprehensive repository, showcasing a spectrum from classical machine learning to state-of-the-art transformer models.
[ArXiv2024] ModPrompt: Visual Modality Prompt for Adapting Vision-Language Object Detectors
#大语言模型#V-Zen: Efficient GUI Understanding and Precise Grounding With A Novel Multimodal LLM Resources