image-to-text · GitHub Topics

thiagoalessio / tesseract-ocr-for-php

A wrapper to work with Tesseract OCR inside PHP.

OCR tesseract PHP text-recognition image-to-text

PHP 3.01 k

6 个月前

killkimno / MORT

MORT 번역기 프로젝트 - Real-time game translator with OCR

OCR auto-translation translation translate game game-translation tesseract-ocr image-to-text

C# 1.2 k

8 天前

lucidrains / CoCa-pytorch

#计算机科学#Implementation of CoCa, Contrastive Captioners are Image-Text Foundation Models, in Pytorch

人工智能 attention-mechanism contrastive-learning 深度学习 multimodal transformers image-to-text

Python 1.18 k

2 年前

PaddlePaddle / PaddleMIX

Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high per...

aigc stable-diffusion clip image-to-text text-to-image controlnet multimodal text-to-video dit llava sora qwen2-vl minicpm-v

Python 697

12 天前

Flame-Code-VLM / Flame-Code-VLM

#前端开发#Flame is an open-source multimodal AI system designed to translate UI design mockups into high-quality React code. It leverages vision-language modeling, automated data synthesis, and structured train...

code-generation frontend-development vision-language-model 人工智能深度学习前端 multimodal Open Source React vlm deepseek design-to-code front-end image-to-text 大语言模型 Vue.js

Python 540

6 个月前

zapolnoch / node-tesseract-ocr

A Node.js wrapper for the Tesseract OCR API

tesseract OCR text-recognition image-to-text

JavaScript 313

2 年前

google / imageinwords

Data release for the ImageInWords (IIW) paper.

evaluation image-captioning image-to-text dataset dataset-generation

JavaScript 220

10 个月前

Yushi-Hu / tifa

TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering

image-to-text large-language-models text-to-image visual-question-answering

Python 173

1 年前

NormXU / nougat-latex-ocr

Codebase for fine-tuning / evaluating nougat-based image2latex generation models

image-to-text

Python 157

1 年前

shoryasethia / markdrop

#大语言模型#A Python package for converting PDFs to markdown while extracting images and tables, generate descriptive text descriptions for extracted tables/images using several LLM clients. And many more functio...

Open Source pypi-package image-to-text 大语言模型 pdf-to-markdown pdf-to-text table-to-text agents

Python 151

2 个月前

yardstick17 / image_text_reader

The module extracts text from image using the tesseract-OCR engine. Generally, text present in the images are blur or are of uneven sizes. The image is pre-processed for better comprehension by OCR. T...

OCR image-to-text tesseract-ocr

Python 147

6 年前

nateshmbhat / card-scanner-flutter

A flutter package for Fast, Accurate and Secure Credit card & Debit card scanning

Flutter Dart 机器学习人工智能 credit-card 图像处理 image-to-text

Swift 126

7 个月前

mshdabiola / NotePad

Notepad is multi module Jetpack compose note taking app with sketch pad, voice recorder, image capturing app

Android Actions Jetpack Compose Kotlin image-to-text room-persistence-library

Kotlin 114

13 天前

BEPb / image_to_ascii

Everything is very simple: you either download a picture file or specify its link when running a python script, and output you get a text file, and you can immediately view on the command line how it ...

cmd image-to-text conversion convert converter Python Script

Python 113

2 年前