GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

extract

Website
Wikipedia
https://static.github-zh.com/github_avatars/YaoFANGUK?size=40
YaoFANGUK / video-subtitle-extractor

#计算机科学#视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.

深度学习OCRsubtitlessrthardsubextractrippersubrip
Python 7.4 k
1 个月前
https://static.github-zh.com/github_avatars/scinfu?size=40
scinfu / SwiftSoup

SwiftSoup: Pure Swift HTML Parser, with best of DOM, CSS, and jquery (Supports Linux, iOS, Mac, tvOS, watchOS)

SwiftswiftsoupParsinghtml-documentDocument Object Model (DOM)extractselectorHTML
Swift 4.84 k
4 天前
https://static.github-zh.com/github_avatars/mholt?size=40
mholt / archiver

DEPRECATED. Please use mholt/archives instead.

tarextractZipgzipxzGorarlz4bzip2archivessnappyzstandardbrotlicompressiondecompressionstreamingstreams7zip
Go 4.41 k
7 个月前
https://static.github-zh.com/github_avatars/torakiki?size=40
torakiki / pdfsam

PDFsam, a desktop application to split, merge, mix, rotate PDF files and extract pages

pdf-extractorextractsplitJavaFXJavamergesplittercombinerotatepdfpdf-manipulation
Java 3.81 k
7 天前
https://static.github-zh.com/github_avatars/dlt-hub?size=40
dlt-hub / dlt

data load tool (dlt) is an open source Python library that makes data loading easy 🛠️

dataPythondata-engineeringdata-lakedata-loadingdata-warehouseeltextractloadtransform
Python 3.72 k
4 天前
https://static.github-zh.com/github_avatars/atlanhq?size=40
atlanhq / camelot

Camelot: PDF Table Extraction for Humans

pdftableextractfor-humans
Python 3.68 k
2 年前
Wisser/Jailer
https://static.github-zh.com/github_avatars/Wisser?size=40
Wisser / Jailer

#前端开发#Database Subsetting and Relational Data Browsing Tool.

数据库subsetting前端exportextractSQLjdbcTestingJavasubsetterdb2GUIsql-serverMySQLOracle 数据库PostgreSQLredshift
Java 3.01 k
10 天前
https://static.github-zh.com/github_avatars/mafaca?size=40
mafaca / UtinyRipper

GUI and API library to work with Engine assets, serialized and bundle files

UnityassetbundleassetbundleHackathon-KitunpackripperextractsourceProjectdebugviewer
C# 2.96 k
3 年前
https://static.github-zh.com/github_avatars/CatchTheTornado?size=40
CatchTheTornado / text-extract-api

#大语言模型#Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSO...

APIextractJSON大语言模型pdfanonymizationOCRocr-pythonpii
Python 2.61 k
2 个月前
retroplasma/earth-reverse-engineering
https://static.github-zh.com/github_avatars/retroplasma?size=40
retroplasma / earth-reverse-engineering

Reversing Google's 3D satellite mode

google-earth逆向工程Google 地图extractGeographic Information System3Dclientexporter
C 2.27 k
4 年前
https://static.github-zh.com/github_avatars/DonJayamanne?size=40
DonJayamanne / pythonVSCode

#编辑器#This extension is now maintained in the Microsoft fork.

PythonTypeScriptVisual Studio Codeeditor终端python-terminalintellisenserefactoringsautopep8pylintTestingextractJupyter Notebookscientificlinter
TypeScript 2.1 k
3 个月前
https://static.github-zh.com/github_avatars/dompdf?size=40
dompdf / php-font-lib

A library to read, parse, export and make subsets of different types of font files.

字体font-filesPHPextracttruetypewoffttf
PHP 1.78 k
6 个月前
extractus/article-extractor
https://static.github-zh.com/github_avatars/extractus?size=40
extractus / article-extractor

#网络爬虫#To extract main article from given URL with Node.js

Node.jsarticle-parserreadabilityarticlearticle-extractor爬虫extractscraper
JavaScript 1.71 k
1 个月前
https://static.github-zh.com/github_avatars/camelot-dev?size=40
camelot-dev / excalibur

A web interface to extract tabular data from PDFs

pdftableextractfor-humans
Python 1.68 k
5 个月前
https://static.github-zh.com/github_avatars/j4k0xb?size=40
j4k0xb / webcrack

Deobfuscate obfuscator.io, unminify and unpack bundled javascript

Parsingbundleextract逆向工程unpackWebpackdeobfuscationdeobfuscatorJavaScriptbrowserifyjavascript-obfuscator
TypeScript 1.61 k
1 个月前
https://static.github-zh.com/github_avatars/JonathanLink?size=40
JonathanLink / PDFLayoutTextStripper

Converts a pdf file into a text file while keeping the layout of the original pdf. Useful to extract the content from a table in a pdf file for instance. This is a subclass of PDFTextStripper class (f...

layouttextJavapdfextractdata-extractionpdfbox
Java 1.59 k
1 年前
https://static.github-zh.com/github_avatars/activescott?size=40
activescott / lessmsi

A tool to view and extract the contents of an Windows Installer (.msi) file.

msichocolateyinstallC#install-scriptWindowsextract
C# 1.47 k
12 天前
https://static.github-zh.com/github_avatars/wix-incubator?size=40
wix-incubator / vscode-glean

The extension provides refactoring tools for your React codebase

Visual Studio CoderefactoringVS Code ExtensionReactextractJSX (JavaScript XML)clean-code
TypeScript 1.47 k
2 年前
https://static.github-zh.com/github_avatars/kevva?size=40
kevva / download

#下载器#Download and extract files

HTTPPromiseasyncdownloadextractstreamNode.js
JavaScript 1.3 k
2 年前
https://static.github-zh.com/github_avatars/laktak?size=40
laktak / extrakto

extrakto for tmux - quickly select, copy/insert/complete text without a mouse

tmuxextractcompleteautocompletecopy-pasteclipboardcompletion
Python 968
6 个月前
loading...