GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

florence-2

Website
Wikipedia
https://static.github-zh.com/github_avatars/roboflow?size=40
roboflow / maestro

streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL

captioningfine-tuningflorence-2multimodalobjectdetectionpaligemmaphi-3-visiontransformersvision-and-languagevqaqwen2-vl
Python 2.6 k
4 天前
https://static.github-zh.com/github_avatars/jhc13?size=40
jhc13 / taggui

Tag manager and captioner for image datasets

image-captioningpyside6stable-diffusionllavacogvlmflorence-2
Python 1.07 k
2 个月前
https://static.github-zh.com/github_avatars/D-Ogi?size=40
D-Ogi / WatermarkRemover-AI

AI-Powered Watermark Remover using Florence-2 and LaMA Models: A Python application leveraging state-of-the-art deep learning models to effectively remove watermarks from images with a user-friendly P...

florence-2lama-cleanerwatermark-removerdataset-creationinpainting
Python 658
7 小时前
https://static.github-zh.com/github_avatars/autodistill?size=40
autodistill / autodistill-grounded-sam-2

Use Segment Anything 2, grounded with Florence-2, to auto-label data for use in training vision models.

florence-2
Python 126
1 年前
https://static.github-zh.com/github_avatars/Ravi-Teja-konda?size=40
Ravi-Teja-konda / Surveillance_Video_Summarizer

#大语言模型#VLM driven tool that processes surveillance videos, extracts frames, and generates insightful annotations using a fine-tuned Florence-2 Vision-Language Model. Includes a Gradio-based interface for que...

人工智能ChatGPTflorence-2gpt-4gradiogradio-python-llmhuggingfacesummarizationVideovision-and-languagevlm
Python 119
2 个月前
https://static.github-zh.com/github_avatars/anyantudre?size=40
anyantudre / Florence-2-Vision-Language-Model

#计算机科学#Florence-2 is a novel vision foundation model with a unified, prompt-based representation for a variety of computer vision and vision-language tasks.

机器视觉深度学习florence-2huggingfacevision-languagevision-language-modelvision-transformervision-transformer-models
Jupyter Notebook 83
1 年前
https://static.github-zh.com/github_avatars/Damarcreative?size=40
Damarcreative / rem-wm

Watermark remover tool that leverages the capabilities of Microsoft Florence and Lama Cleaner models.

florence-2lama-cleanerwatermark
Python 80
6 个月前
https://static.github-zh.com/github_avatars/retkowsky?size=40
retkowsky / florence-2

Florence-2

Azureflorence-2
Jupyter Notebook 68
6 个月前
https://static.github-zh.com/github_avatars/autodistill?size=40
autodistill / autodistill-florence-2

Use Florence 2 to auto-label data for use in training fine-tuned object detection models.

florence-2object-detectionzero-shot-object-detection
Python 65
1 年前
https://static.github-zh.com/github_avatars/sayedmohamedscu?size=40
sayedmohamedscu / Vision-language-models-VLM

vision language models finetuning notebooks & use cases (Medgemma - paligemma - florence .....)

colab-notebook机器视觉finetuningmultimodalpaligemmavlmflorence-2loraMedical imagingqlora
Jupyter Notebook 46
1 个月前
https://static.github-zh.com/github_avatars/fireicewolf?size=40
fireicewolf / wd-llm-caption-cli

A Python base cli tool for caption images with WD series, Joy-caption-pre-alpha,meta Llama 3.2 Vision Instruct and Qwen2 VL Instruct models.

qwen2-vlflorence-2
Python 38
4 个月前
https://static.github-zh.com/github_avatars/Iteranya?size=40
Iteranya / AktivaAI

Local LLM Discord Bot

人工智能discord-botflorence-2llamamultimodalroleplay聊天机器人
Python 17
1 个月前
https://static.github-zh.com/github_avatars/jacobmarks?size=40
jacobmarks / fiftyone_florence2_plugin

Run SOTA Vision-Language Model Florence-2 on your data!

机器视觉florence-2机器学习transformervision-language-model
Jupyter Notebook 12
4 个月前
https://static.github-zh.com/github_avatars/mithunparab?size=40
mithunparab / text2segment_video

Simple Video Summarization using Text-to-Segment Anything (Florence2 + SAM2) This project provides a video processing tool that utilizes advanced AI models, specifically Florence2 and SAM2, to detect...

florence-2optical-flowraftsegment-anything
Python 10
5 个月前
https://static.github-zh.com/github_avatars/nguyennpa412?size=40
nguyennpa412 / simple-multimodal-ai

#大语言模型#Simple Gradio application integrated with Hugging Face Multimodals to support visual question answering chatbot and more features

机器视觉Dockergradiotext-to-speechvisual-question-answeringvlm大语言模型mllmflorence-2
Python 6
1 年前
https://static.github-zh.com/github_avatars/sitamgithub-MSIT?size=40
sitamgithub-MSIT / TextSnap

TextSnap: Demo for Florence 2 model used in OCR tasks to extract and visualize text from images.

人工智能florence-2gradiogradio-interfacehuggingface-spaceshuggingface-transformersoptical-character-recognitionvision-language-modelPython
Python 5
4 个月前
https://static.github-zh.com/github_avatars/regiellis?size=40
regiellis / ecko-cli

ecko-cli is a simple CLI tool that streamlines the process of processing images in a directory, generating captions, and saving them as text files. Additionally, it provides functionalities to create ...

人工智能命令行界面florence-2generative-aihuggingface-transformersimage-classification图像处理onnxruntime
Python 5
10 天前
https://static.github-zh.com/github_avatars/PRITHIVSAKTHIUR?size=40
PRITHIVSAKTHIUR / Florence-2-Image-Caption

This application utilizes the powerful Florence-2 vision-language model from Microsoft to generate comprehensive captions for images. The model is capable of understanding visual content and expressin...

florence-2gradiohuggingfaceimage-captioning图像处理pillowtimmtorchtransformersvision-language-model
Python 5
5 天前
https://static.github-zh.com/github_avatars/Rm1n90?size=40
Rm1n90 / Florence2Onnx

ONNX deploys for Florence 2 visual multimodal

florence-2onnxonnxruntimeinference
Python 5
6 个月前
https://static.github-zh.com/github_avatars/jkawamoto?size=40
jkawamoto / mcp-florence2

An MCP server for processing images using Florence-2

florence-2mcp-serverPython
Python 4
11 天前
loading...