#大语言模型#Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/space...
A length-controllable and non-autoregressive image captioning model.
#计算机科学#PyTorch implementation of a Controllable Image Captioning model with a language-driven mechanism for advancing the region pointer state that keeps it in sync with the state of the language model. Code...
#计算机科学#Pipeline model for controllable image captioning with user preference settings. Code and model output for the paper Show, Prefer and Tell: Incorporating User Preferences into Image Captioning (Lindh e...