#

data-extraction

D4Vinci/Scrapling
https://static.github-zh.com/github_avatars/D4Vinci?size=40
Python 7.31 k
7 小时前
https://static.github-zh.com/github_avatars/vi3k6i5?size=40
Python 5.68 k
5 个月前
https://static.github-zh.com/github_avatars/JonathanLink?size=40

Converts a pdf file into a text file while keeping the layout of the original pdf. Useful to extract the content from a table in a pdf file for instance. This is a subclass of PDFTextStripper class (f...

Java 1.6 k
2 年前
https://static.github-zh.com/github_avatars/brightdata?size=40
JavaScript 1.32 k
3 天前
https://static.github-zh.com/github_avatars/thinh-vu?size=40

A beginner-friendly yet powerful Python toolkit for financial analysis and automation — built to make modern investing accessible to everyone

Python 965
1 天前
https://static.github-zh.com/github_avatars/adrienjoly?size=40
HTML 691
8 个月前
https://static.github-zh.com/github_avatars/trustgraph-ai?size=40

The agentic AI platform for enterprise. Built by data engineers for data engineers. Complete context engineering and LLM orchestration infrastructure. Run anywhere - local, cloud, or bare metal.

Python 591
8 小时前
https://static.github-zh.com/github_avatars/a-maliarov?size=40

Pure Python, lightweight, Pillow-based solver for Amazon's text captcha.

Python 482
1 个月前
https://static.github-zh.com/github_avatars/py-pdf?size=40
Python 311
3 个月前
https://static.github-zh.com/github_avatars/ScrapeGraphAI?size=40

🤖 AI-powered web scraping editor with visual workflow builder. Build, test & deploy web scrapers using natural language. Powered by ScrapeGraphAI & LangGraph.

Python 176
1 个月前
https://static.github-zh.com/github_avatars/molybdenum-99?size=40

Wikipedia information extraction library

Ruby 175
2 年前
https://static.github-zh.com/github_avatars/dilawar?size=40
Python 150
1 年前
loading...
Website
Wikipedia