GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

ai-scraping

Website
Wikipedia
https://static.github-zh.com/github_avatars/firecrawl?size=40
firecrawl / firecrawl

#网络爬虫#Firecrawl 是一种 API 服务,它爬取URL并将其转换为清洗过的 markdown 或结构化数据

人工智能爬虫dataMarkdownscraperhtml-to-markdown大语言模型ragscrapingweb-crawlerai-scrapingwebscraping
TypeScript 57.2 k
3 小时前
https://static.github-zh.com/github_avatars/ScrapeGraphAI?size=40
ScrapeGraphAI / Scrapegraph-ai

#网络爬虫#Python scraper based on AI

scrapingscraping-pythonautomated-scraper大语言模型人工智能web-crawlerweb-scrapingai-scraping爬虫html-to-markdownMarkdownrag
Python 21.29 k
1 个月前
D4Vinci/Scrapling
https://static.github-zh.com/github_avatars/D4Vinci?size=40
D4Vinci / Scrapling

#网络爬虫#🕷️ An undetectable, powerful, flexible, high-performance Python library to make Web Scraping Easy and Effortless as it should be!

爬虫crawlingcrawling-pythonPlaywrightPythonscrapingselectorsstealth-gameweb-scraperweb-scrapingweb-scraping-pythonwebscrapingxpath自动化人工智能ai-scrapingdatadata-extractionmcpmcp-server
Python 7.29 k
1 小时前
https://static.github-zh.com/github_avatars/any4ai?size=40
any4ai / AnyCrawl

#网络爬虫#AnyCrawl 🚀: A Node.js/TypeScript crawler that turns websites into LLM-ready data and extracts structured SERP results from Google/Bing/Baidu/etc. Native multi-threading for bulk processing.

aitoolscrawlscrapewebscraperai-scrapingdatahtml-to-markdownragscraping
TypeScript 2.14 k
9 小时前
https://static.github-zh.com/github_avatars/itsOwen?size=40
itsOwen / CyberScraper-2077

#网络爬虫#A Powerful web scraper powered by LLM | OpenAI, Gemini & Ollama

ai-scraping大语言模型openaiscraperwebscrapinggemini-apiweb-scraper
Python 1.77 k
1 个月前
https://static.github-zh.com/github_avatars/raznem?size=40
raznem / parsera

#网络爬虫#Lightweight library for scraping web-sites with LLMs

data-extraction大语言模型scrapingPythonOpen Sourcewebscraping人工智能ai-scrapingPlaywright
Python 1.22 k
20 天前
https://static.github-zh.com/github_avatars/firecrawl?size=40
firecrawl / firecrawl-app-examples

#大语言模型#🔥 This repository contains complete application examples, including websites and other projects, developed using Firecrawl.

人工智能ai-scrapingdataExamplehtml-to-markdown大语言模型Markdownragweb-crawlertemplates
Jupyter Notebook 543
3 个月前
https://static.github-zh.com/github_avatars/ArchiveBox?size=40
ArchiveBox / abx-dl

#网络爬虫#⬇️ A simple all-in-one CLI tool to download EVERYTHING from a URL (like youtube-dl/yt-dlp, forum-dl, gallery-dl, simpler ArchiveBox). 🎭 Uses headless Chrome to get HTML, JS, CSS, images/video/audio/s...

ChromecrawlingcURL下载器headlessPlaywrightPuppeteerscrapingwgetyoutube-dlyt-dlpcli-tool命令行界面http-clientai-scraping
JavaScript 82
25 天前
https://static.github-zh.com/github_avatars/WeebDataHoarder?size=40
WeebDataHoarder / go-away

[Mirror] Self-hosted abuse detection and rule enforcement against low-effort mass AI scraping and bots.

ai-scrapinghttp-proxy安全mirror
Go 76
10 天前
https://static.github-zh.com/github_avatars/kaymen99?size=40
kaymen99 / ai-web-scraper

#网络爬虫#AI web scraper built with Crawl4AI for extracting structured leads data from websites.

ai-agentsai-scrapingcrawl4ai大语言模型scraperweb-scraperweb-scraping
Python 46
7 个月前
https://static.github-zh.com/github_avatars/spider-rs?size=40
spider-rs / web-crawling-guides

#网络爬虫#How to guides on web-crawling or scraping

agentsai-agentsai-scraping爬虫html-to-markdownscraperweb-scraping
23
5 个月前
https://static.github-zh.com/github_avatars/spider-rs?size=40
spider-rs / spider-clients

#网络爬虫#Python, Javascript, and Rust libraries for the Spider Cloud API.

人工智能ai-agentsai-scraping爬虫html-to-markdownscraperspiderweb-scrapingSupabase
Python 19
17 天前
https://static.github-zh.com/github_avatars/Chakszzz?size=40
Chakszzz / NB-Scraper

#网络爬虫#All Scrapers Resource Available Here! Give Us Stars🌟

ai-scrapingfacebook-scraperscraperOpen Sourceyoutube-downloaderytdl
TypeScript 15
2 个月前
https://static.github-zh.com/github_avatars/L1shed?size=40
L1shed / Turbo

Fastest and cheapest distributed residential proxy network.

ai-scrapingweb-scrapingpayment-gatewayiaascollaborate
TypeScript 9
14 天前
https://static.github-zh.com/github_avatars/kaymen99?size=40
kaymen99 / google-maps-lead-generator

Extract Google Maps business leads and enrich contact details using AI & web scraping

ai-agentsai-scrapingGoogle 地图google-maps-apiweb-scraping
Python 5
3 个月前
https://static.github-zh.com/github_avatars/oxylabs?size=40
oxylabs / oxylabs-ai-studio-py

Oxylabs AI Studio python SDK

ai-scrapingai-searchai-toolsweb-scrapingweb-scraping-python
Python 4
1 个月前
https://static.github-zh.com/github_avatars/GitRectify?size=40
GitRectify / scrapegraph-ai

#网络爬虫#ScrapeGraphAI is a Python-based web-scraping framework that pairs large-language-model reasoning with a graph-style pipeline engine to turn websites (or local XML/HTML/JSON/Markdown files) into struct...

人工智能ai-scrapingautomated-scraper爬虫html-to-markdown大语言模型Markdownragscrapingscraping-pythonweb-crawlerweb-scraping
Python 4
3 个月前
https://static.github-zh.com/github_avatars/drisskhattabi6?size=40
drisskhattabi6 / AI-Scraper

#网络爬虫#AI Scraper : scrap and extract data from website in any format (CSV, JSON, HTML...) using Selenium or Crawl4ai, and using Ollama or Sambanova API, and using Streamlit for UI as chatbot

ai-scrapingcrawl4ai爬虫crawlingollamaollama-apiopenrouteropenrouter-apiscraperscrapingSeleniumselenium-pythonStreamlitstreamlit-webapp
Python 3
4 个月前
https://static.github-zh.com/github_avatars/nathabonfim59?size=40
nathabonfim59 / md-fetch

#网络爬虫#A CLI tool and REST API that converts web content to clean Markdown, bypassing anti-scraping measures using headless browsers. Perfect for AI/LLM applications

ai-scrapingGoscraper
Go 3
7 个月前
https://static.github-zh.com/github_avatars/vonuyvicoo?size=40
vonuyvicoo / crava

#大语言模型#AI-powered web scraper using Javascript/Typescript.

ai-scraping大语言模型webscraping
TypeScript 2
3 个月前
loading...