GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

florence-2

Website
Wikipedia
https://static.github-zh.com/github_avatars/roboflow?size=40
roboflow / maestro

streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL

captioningfine-tuningflorence-2multimodalobjectdetectionpaligemmaphi-3-visiontransformersvision-and-languagevqaqwen2-vl
Python 2.57 k
6 天前
https://static.github-zh.com/github_avatars/jhc13?size=40
jhc13 / taggui

Tag manager and captioner for image datasets

image-captioningpyside6stable-diffusionllavacogvlmflorence-2
Python 1.02 k
1 个月前
https://static.github-zh.com/github_avatars/D-Ogi?size=40
D-Ogi / WatermarkRemover-AI

AI-Powered Watermark Remover using Florence-2 and LaMA Models: A Python application leveraging state-of-the-art deep learning models to effectively remove watermarks from images with a user-friendly P...

florence-2lama-cleanerwatermark-removerdataset-creationinpainting
Python 576
1 个月前
https://static.github-zh.com/github_avatars/autodistill?size=40
autodistill / autodistill-grounded-sam-2

Use Segment Anything 2, grounded with Florence-2, to auto-label data for use in training vision models.

florence-2
Python 124
10 个月前
https://static.github-zh.com/github_avatars/Ravi-Teja-konda?size=40
Ravi-Teja-konda / Surveillance_Video_Summarizer

#大语言模型#VLM driven tool that processes surveillance videos, extracts frames, and generates insightful annotations using a fine-tuned Florence-2 Vision-Language Model. Includes a Gradio-based interface for que...

人工智能ChatGPTflorence-2gpt-4gradiogradio-python-llmhuggingfacesummarizationVideovision-and-languagevlm
Python 115
9 天前
https://static.github-zh.com/github_avatars/Damarcreative?size=40
Damarcreative / rem-wm

Watermark remover tool that leverages the capabilities of Microsoft Florence and Lama Cleaner models.

florence-2lama-cleanerwatermark
Python 79
5 个月前
https://static.github-zh.com/github_avatars/retkowsky?size=40
retkowsky / florence-2

Florence-2

Azureflorence-2
Jupyter Notebook 68
4 个月前
https://static.github-zh.com/github_avatars/anyantudre?size=40
anyantudre / Florence-2-Vision-Language-Model

#计算机科学#Florence-2 is a novel vision foundation model with a unified, prompt-based representation for a variety of computer vision and vision-language tasks.

机器视觉深度学习florence-2huggingfacevision-languagevision-language-modelvision-transformervision-transformer-models
Jupyter Notebook 66
1 年前
https://static.github-zh.com/github_avatars/autodistill?size=40
autodistill / autodistill-florence-2

Use Florence 2 to auto-label data for use in training fine-tuned object detection models.

florence-2object-detectionzero-shot-object-detection
Python 64
10 个月前
https://static.github-zh.com/github_avatars/fireicewolf?size=40
fireicewolf / wd-llm-caption-cli

A Python base cli tool for caption images with WD series, Joy-caption-pre-alpha,meta Llama 3.2 Vision Instruct and Qwen2 VL Instruct models.

qwen2-vlflorence-2
Python 37
3 个月前
https://static.github-zh.com/github_avatars/sayedmohamedscu?size=40
sayedmohamedscu / Vision-language-models-VLM

vision language models finetuning notebooks & use cases (paligemma - florence .....)

colab-notebook机器视觉finetuningmultimodalpaligemmavlmflorence-2
Jupyter Notebook 27
1 天前
https://static.github-zh.com/github_avatars/Iteranya?size=40
Iteranya / AktivaAI

Local LLM Discord Bot

人工智能discord-botflorence-2llamamultimodalroleplay聊天机器人
Python 16
2 个月前
https://static.github-zh.com/github_avatars/jacobmarks?size=40
jacobmarks / fiftyone_florence2_plugin

Run SOTA Vision-Language Model Florence-2 on your data!

机器视觉florence-2机器学习transformervision-language-model
Jupyter Notebook 10
3 个月前
https://static.github-zh.com/github_avatars/mithunparab?size=40
mithunparab / text2segment_video

Simple Video Summarization using Text-to-Segment Anything (Florence2 + SAM2) This project provides a video processing tool that utilizes advanced AI models, specifically Florence2 and SAM2, to detect...

florence-2optical-flowraftsegment-anything
Python 10
4 个月前
https://static.github-zh.com/github_avatars/nguyennpa412?size=40
nguyennpa412 / simple-multimodal-ai

#大语言模型#Simple Gradio application integrated with Hugging Face Multimodals to support visual question answering chatbot and more features

机器视觉Dockergradiotext-to-speechvisual-question-answeringvlm大语言模型mllmflorence-2
Python 5
10 个月前
https://static.github-zh.com/github_avatars/sitamgithub-MSIT?size=40
sitamgithub-MSIT / TextSnap

TextSnap: Demo for Florence 2 model used in OCR tasks to extract and visualize text from images.

人工智能florence-2gradiogradio-interfacehuggingface-spaceshuggingface-transformersoptical-character-recognitionvision-language-modelPython
Python 4
2 个月前
https://static.github-zh.com/github_avatars/regiellis?size=40
regiellis / ecko-cli

ecko-cli is a simple CLI tool that streamlines the process of processing images in a directory, generating captions, and saving them as text files. Additionally, it provides functionalities to create ...

人工智能命令行界面florence-2generative-aihuggingface-transformersimage-classification图像处理onnxruntime
Python 4
7 个月前
https://static.github-zh.com/github_avatars/Rm1n90?size=40
Rm1n90 / Florence2Onnx

ONNX deploys for Florence 2 visual multimodal

florence-2onnxonnxruntimeinference
Python 4
4 个月前
https://static.github-zh.com/github_avatars/PranayLendave?size=40
PranayLendave / text2video_synopsis

Video Synopsis: Intelligent Video Object Summarization using Florence/OWL-ViT and SAM. It uses OWL-ViT or Florence 2 for object detection, SAM for segmentation, and a custom video synopsis algorithm t...

florence-2sam
Python 2
6 个月前
https://static.github-zh.com/github_avatars/Kazuhito00?size=40
Kazuhito00 / Florence-2-Colaboratory-Sample

Microsoft の軽量VLMのFlorence-2のColaboratory上でのサンプル

colaboratoryflorence-2Pythonvlm
Jupyter Notebook 2
10 个月前
loading...