GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

paligemma

Website
Wikipedia
https://static.github-zh.com/github_avatars/roboflow?size=40
roboflow / notebooks

#计算机科学#A collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-edge models like YOLO11, RT-DETR, SAM 2, ...

机器视觉深度学习深度神经网络image-classificationimage-segmentationobject-detectionyolov5PyTorch教程yolov8google-colab机器学习zero-shot-classificationopen-vocabulary-detectionautomatic-labeling-systemopen-vocabulary-segmentationpaligemmaqwenvlm
Jupyter Notebook 8.37 k
19 天前
https://static.github-zh.com/github_avatars/roboflow?size=40
roboflow / maestro

streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL

captioningfine-tuningflorence-2multimodalobjectdetectionpaligemmaphi-3-visiontransformersvision-and-languagevqaqwen2-vl
Python 2.63 k
2 天前
google-gemini/gemma-cookbook
https://static.github-zh.com/github_avatars/google-gemini?size=40
google-gemini / gemma-cookbook

A collection of guides and examples for the Gemma open models from Google.

codegemmagemmapaligemmarecurrentgemma
Jupyter Notebook 2.17 k
5 天前
https://static.github-zh.com/github_avatars/Blaizzy?size=40
Blaizzy / mlx-vlm

#大语言模型#MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.

llava大语言模型MLXvision-transformerapple-siliconideficslocal-aipaligemmavision-frameworkvision-language-modelflorence2molmopixtral
Python 1.63 k
10 天前
https://static.github-zh.com/github_avatars/adithya-s-k?size=40
adithya-s-k / YoloGemma

Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detection and segmentation.

gemmapaligemmavlm
Python 84
1 年前
https://static.github-zh.com/github_avatars/sayedmohamedscu?size=40
sayedmohamedscu / Vision-language-models-VLM

vision language models finetuning notebooks & use cases (Medgemma - paligemma - florence .....)

colab-notebook机器视觉finetuningmultimodalpaligemmavlmflorence-2loraMedical imagingqlora
Jupyter Notebook 49
2 个月前
https://static.github-zh.com/github_avatars/BUAADreamer?size=40
BUAADreamer / MLLM-Finetuning-Demo

使用LLaMA-Factory微调多模态大语言模型的示例代码 Demo of Finetuning Multimodal LLM with LLaMA-Factory

llavamllmpaligemmafinetune-llmloratransformerspretraining
Python 49
1 年前
https://static.github-zh.com/github_avatars/autodistill?size=40
autodistill / autodistill-paligemma

Use PaliGemma to auto-label data for use in training fine-tuned vision models.

机器视觉zero-shot-object-detectionpaligemma
Python 12
1 年前
https://static.github-zh.com/github_avatars/MaxLSB?size=40
MaxLSB / mini-paligemma2

#计算机科学#Minimalist implementation of PaliGemma 2 & PaliGemma VLM from scratch

深度学习机器学习paligemmaPythonPyTorchvision-language-modelvlm
Python 10
7 个月前
https://static.github-zh.com/github_avatars/anamabo?size=40
anamabo / SegmentWaterWithPaligemma

Segmentation of water in Satellite images using Paligemma

机器视觉paligemmaremote-sensingsatellite-imagery
Jupyter Notebook 7
9 个月前
https://static.github-zh.com/github_avatars/shaadclt?size=40
shaadclt / Fine-tune-PaliGemma-Image-Captioning

This project demonstrates how to fine-tune PaliGemma model for image captioning. The PaliGemma model, developed by Google Research, is designed to handle images and generate corresponding captions.

fine-tuningimage-captioningpaligemma
Jupyter Notebook 6
10 个月前
https://static.github-zh.com/github_avatars/kornia?size=40
kornia / kornia-paligemma

Rust implementation of Google Paligemma with Candle

paligemmaRustvisual-language-models
Rust 6
4 个月前
https://static.github-zh.com/github_avatars/GURPREETKAURJETHRA?size=40
GURPREETKAURJETHRA / PaliGemma-Inference-and-Fine-Tuning

#大语言模型#PaliGemma Inference and Fine Tuning

gemmagenerative-aiGooglelarge-language-models大语言模型paligemmafinetuningllm-inference
Jupyter Notebook 5
1 年前
https://static.github-zh.com/github_avatars/GURPREETKAURJETHRA?size=40
GURPREETKAURJETHRA / PaliGemma-FineTuning

PaliGemma FineTuning

fine-tuninggenerative-ailarge-language-models大语言模型openaipaligemma
Jupyter Notebook 5
1 年前
https://static.github-zh.com/github_avatars/tristandb8?size=40
tristandb8 / PyTorch-PaliGemma-2

#计算机科学#PyTorch implementation of PaliGemma 2

机器视觉深度学习paligemmaPyTorchvisual-language-modelsvlm
Python 3
5 个月前
https://static.github-zh.com/github_avatars/Mreeb?size=40
Mreeb / Finetune_PaliGemma

Fine Tuning PaliGemma

fine-tuningpaligemmaPython
Jupyter Notebook 3
1 年前
https://static.github-zh.com/github_avatars/kmk2977?size=40
kmk2977 / VLM-paligemma

Notes for the Vision Language Model implementation by Umar Jamil

gemmapaligemmapytorch-implementationtransformervision-language-model
Python 2
1 年前
https://static.github-zh.com/github_avatars/Khalidelommali?size=40
Khalidelommali / Foundation-Model-Tutorial

#计算机科学#Foundation-Models chat app tutorial for iOS with on-device LLMs, tools, and chat. Shows on-device inference with FoundationModels and calendar tool use. 🐙

APIautomatic-labeling-systemAmazon Web Services机器视觉深度学习深度神经网络foundation-modelsgoogle-colabimage-captioningmultimodalopen-vocabulary-detectionopen-vocabulary-segmentationpaligemmarepresentation-learningrobustnessspeech-processing教程zero-shot-classification
Swift 2
2 天前
https://static.github-zh.com/github_avatars/6DEADSHOT9?size=40
6DEADSHOT9 / Pali-pa-Jamma

#计算机科学#PyTorch implementation of Google’s Paligemma VLM with SigLip image encoder, KV caching, Rotary embeddings and Grouped Query attention . Modular, research-friendly, and easy to extend for experimentati...

深度学习gemmaGooglehuggingfacepaligemmaPythonPyTorchpytorch-implementation
Python 2
3 个月前
https://static.github-zh.com/github_avatars/AHMEDSANA?size=40
AHMEDSANA / PaliGemma-flickr8k-finetuning

#自然语言处理#This repository contains code for fine-tuning Google's PaliGemma vision-language model on the Flickr8k dataset for image captioning tasks

机器视觉深度学习fine-tuningflaximage-captioning图像处理jaxkaggle机器学习自然语言处理paligemmaPythonvision-language-model人工智能compter-visionimage-annotationPyTorchtransfer-learning
Jupyter Notebook 2
4 个月前
loading...