GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

html-to-markdown

Website
Wikipedia
https://static.github-zh.com/github_avatars/mendableai?size=40
mendableai / firecrawl

#网络爬虫#Firecrawl 是一种 API 服务,它爬取URL并将其转换为清洗过的 markdown 或结构化数据

人工智能爬虫dataMarkdownscraperhtml-to-markdown大语言模型ragscrapingweb-crawlerai-scrapingwebscraping
TypeScript 39.99 k
2 天前
https://static.github-zh.com/github_avatars/ScrapeGraphAI?size=40
ScrapeGraphAI / Scrapegraph-ai

#网络爬虫#Python scraper based on AI

scrapingscraping-pythonautomated-scraper大语言模型人工智能web-crawlerweb-scrapingai-scraping爬虫html-to-markdownMarkdownrag
Python 20 k
2 天前
https://static.github-zh.com/github_avatars/mixmark-io?size=40
mixmark-io / turndown

一个 HTML 转 Markdown 的 JavaScript 库

JavaScriptHTMLMarkdownhtml-to-markdownbrowserNode.jscommonmarkgfm
HTML 9.85 k
1 年前
https://static.github-zh.com/github_avatars/adbar?size=40
adbar / trafilatura

#网络爬虫#Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML

web-scrapingtext-extraction自然语言处理text-mining爬虫text-preprocessingarticle-extractorreadabilityscrapinghtml-to-markdowncorpus-toolsrss-feednews-aggregatorrag大语言模型
Python 4.36 k
16 天前
JohannesKaufmann/html-to-markdown
https://static.github-zh.com/github_avatars/JohannesKaufmann?size=40
JohannesKaufmann / html-to-markdown

⚙️ Convert HTML to Markdown. Even works with entire websites and can be extended through rules.

GoHTMLhtml-to-markdownMarkdownconverter命令行界面
Go 2.88 k
6 天前
https://static.github-zh.com/github_avatars/vsch?size=40
vsch / flexmark-java

CommonMark/Markdown Java parser with source level AST. CommonMark 0.28, emulation of: pegdown, kramdown, markdown.pl, MultiMarkdown. With HTML to MD, MD to PDF, MD to DOCX conversion modules.

commonmarkJavamarkdown-parsermarkdown-processormarkdown-flavorsMarkdownmarkdown-to-htmlhtml-to-markdownmarkdown-to-pdf
Java 2.43 k
2 个月前
https://static.github-zh.com/github_avatars/helloworld-Co?size=40
helloworld-Co / html2md

helloworld 开发者社区开源的一个轻量级,强大的 html 一键转 md 工具,支持多平台文章一键转换,并保存下载到本地。

HTMLMarkdownVue.jsJavaScriptmarkdown-to-htmlcsdnjuejinNode.jsjsdomNuxt.jshtml-to-markdown
JavaScript 755
1 年前
https://static.github-zh.com/github_avatars/philschmid?size=40
philschmid / clipper.js

#自然语言处理#HTML to Markdown converter and crawler.

crawlhtml-to-markdownMarkdown自然语言处理retrieval-augmented-generationsearch
TypeScript 562
1 年前
https://static.github-zh.com/github_avatars/breakdance?size=40
breakdance / breakdance

It's time for your markup to get down! HTML to markdown converter. Breakdance is a highly pluggable, flexible and easy to use.

MarkdownHTMLconvertParsingcompilerenderhtml-to-markdownconvertermarkupgfmcommonmarkremarkablemarkedmarkdown-it
JavaScript 533
3 年前
https://static.github-zh.com/github_avatars/mendableai?size=40
mendableai / firecrawl-app-examples

#大语言模型#🔥 This repository contains complete application examples, including websites and other projects, developed using Firecrawl.

人工智能ai-scrapingdataExamplehtml-to-markdown大语言模型Markdownragweb-crawlertemplates
Jupyter Notebook 409
13 天前
https://static.github-zh.com/github_avatars/paulpierre?size=40
paulpierre / markdown-crawler

#大语言模型#A multithreaded 🕸️ web crawler that recursively crawls a website and creates a 🔽 markdown file for each page, designed for LLM RAG

html-to-markdown大语言模型llmopsMarkdownmarkdown-parserragweb-scraper
Python 388
10 个月前
https://static.github-zh.com/github_avatars/mrusme?size=40
mrusme / reader

reader is for your command line what the “readability” view is for modern browsers: A lightweight tool offering better readability of web pages (and EML files!) on the CLI.

命令行界面tuicommand-line-toolHTMLMarkdownhtml-to-markdownasciiascii-artreadabilityWeb终端terminal-basedreader
Go 351
1 个月前
https://static.github-zh.com/github_avatars/notlmn?size=40
notlmn / copy-as-markdown

📋 Browser extension to copy text as Markdown (with GFM and MathML support)

Markdownhtml-to-markdownbrowser-extensionChrome 插件Firefox 插件
JavaScript 345
6 天前
https://static.github-zh.com/github_avatars/inhumantsar?size=40
inhumantsar / slurp

Slurps webpages and saves them as clean, uncluttered Markdown. Think Pocket, but better.

html-to-markdownObsidian
TypeScript 225
6 个月前
https://static.github-zh.com/github_avatars/0x6b?size=40
0x6b / copy-selection-as-markdown

Firefox add-on to copy selection as Markdown

Firefox 插件Markdownhtml-to-markdown
JavaScript 200
7 天前
https://static.github-zh.com/github_avatars/web3gautam?size=40
web3gautam / medium-2-md

A CLI tool that converts exported Medium posts (html) to Jekyll/Hugo compatible markdown with front matter.

MediumMarkdownHTMLhtml-to-markdownJekyllHugofrontmatter
JavaScript 147
1 年前
https://static.github-zh.com/github_avatars/bevacqua?size=40
bevacqua / domador

😼 Dependency-free and lean DOM parser that outputs Markdown

Markdownhtml-to-markdown
JavaScript 87
3 年前
https://static.github-zh.com/github_avatars/inaridiy?size=40
inaridiy / webforai

#网络爬虫#The best HTML to Markdown library, A esm-native & Useful Utilities with simple, lightweight and epic quality.

article-extractorextractorreadabilityscrapingtext-mininghtml-to-markdown
TypeScript 66
2 个月前
https://static.github-zh.com/github_avatars/EvitanRelta?size=40
EvitanRelta / htmlarkdown

HTML-to-Markdown converter that adaptively preserves HTML when needed (eg. when center-aligning, or resizing images)

converterhtml-to-markdownTypeScriptcommonmarkgfmhtml-converterJavaScriptNode.js
TypeScript 65
2 年前
https://static.github-zh.com/github_avatars/tim-gromeyer?size=40
tim-gromeyer / html2md

Transform your HTML into clean, easy-to-read markdown with html2md.

html-to-markdownHTMLMarkdownC++cpp-libraryPython
C++ 62
1 个月前
loading...