GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

scraping-websites

Website
Wikipedia
MontFerret/ferret
https://static.github-zh.com/github_avatars/MontFerret?size=40
MontFerret / ferret

#网络爬虫#Declarative web scraping

Goquery-languagedata-miningscrapingscraping-websitesdslcdpcrawlingscraper爬虫Chrome命令行界面工具Library
Go 5.82 k
4 天前
https://static.github-zh.com/github_avatars/Anorov?size=40
Anorov / cloudflare-scrape

A Python module to bypass Cloudflare's anti-bot page.

Cloudflareanti-bot-pagescrapescraping-websites
Python 3.47 k
2 年前
https://static.github-zh.com/github_avatars/elixir-crawly?size=40
elixir-crawly / crawly

#网络爬虫#Crawly, a high-level web crawling & scraping framework for Elixir.

ElixirErlangscraperscrapingscraping-websitesextract-dataspider爬虫crawling
Elixir 1.03 k
9 个月前
https://static.github-zh.com/github_avatars/gildas-lormeau?size=40
gildas-lormeau / single-file-cli

#网络爬虫#CLI tool for saving a faithful copy of a complete web page in a single HTML file (based on SingleFile)

命令行界面Node.jssingle-fileweb-archivingweb-scraperweb-scrapingarchivingscraping-websites爬虫web-crawlerDenoDockerfile
JavaScript 849
13 天前
https://static.github-zh.com/github_avatars/AmmeySaini?size=40
AmmeySaini / Edu-Mail-Generator

#网络爬虫#Generate Free Edu Mail(s) within minutes

SeleniumPythonscrapingscraping-websitesmailselenium-python
Python 825
3 年前
https://static.github-zh.com/github_avatars/Python-World?size=40
Python-World / Python_and_the_Web

Build Bots, Scrape a website or use an API to solve a problem.

Pythonscraping-websitesAPIfunBotHacktoberfest
Python 690
2 年前
https://static.github-zh.com/github_avatars/slotix?size=40
slotix / dataflowkit

#网络爬虫#Extract structured data from web sites. Web sites scraping.

Gogolang-libraryextract-datascraping-websitescrawlingscraperscrapingcdpheadless
Go 686
2 年前
https://static.github-zh.com/github_avatars/josephlimtech?size=40
josephlimtech / linkedin-profile-scraper-api

#网络爬虫#🕵️‍♂️ LinkedIn profile scraper returning structured profile data in JSON.

PuppeteerNode.jsscraperscrapingscraping-websiteswebsite-scraperJSONlinkedin爬虫crawlingspiderExpress
TypeScript 646
1 年前
https://static.github-zh.com/github_avatars/KTZgraph?size=40
KTZgraph / sarenka

OSINT tool - gets data from services like shodan, censys etc. in one app

DjangoReactosint-pythonreconnaissanceOSINTdjango-rest-frameworkshodan-apicwescraping-websitesreact-reduxDockerPythonCommon Vulnerabilities and Exposures (CVE)cve-search
Python 644
2 年前
https://static.github-zh.com/github_avatars/spekulatius?size=40
spekulatius / PHPScraper

#网络爬虫#A universal web-util for PHP.

PHPscraping-websitesscraperscrapingweb-scraperweb-scrapingbeautifulsoupscrapyPuppeteerpyppeteerChromiumheadless-chrome
PHP 565
1 年前
https://static.github-zh.com/github_avatars/avidLearnerInProgress?size=40
avidLearnerInProgress / python-automation-scripts

#网络爬虫#Simple yet powerful automation stuffs.

comic-downloaderinstagram-scraperscraping-websites爬虫quoraImageSeleniumbeautifulsoupInstagrampdf-converterpdfimdb-webscrapping
Python 548
4 年前
https://static.github-zh.com/github_avatars/oxylabs?size=40
oxylabs / quick-start-guide

#网络爬虫#Python quick start guides to get the most out of Oxylabs' Web Scraper API free trial.

scraperweb-scraperscrapingscraping-websitesweb-scraping
521
2 个月前
https://static.github-zh.com/github_avatars/baptisteArno?size=40
baptisteArno / tinking

#网络爬虫#🧶 Extract data from any website without code, just clicks.

scrapperPuppeteerscrapingscraping-websitesscrapping
TypeScript 423
4 年前
https://static.github-zh.com/github_avatars/unixfox?size=40
unixfox / pupflare

A webpage proxy that request through Chromium (puppeteer) - can be used to bypass Cloudflare anti bot / anti ddos on any application (like curl)

cloudflare-bypassPuppeteerKoaproxyDockerCloudflareanti-bot-pagescrapescraping-websitescloudflare-scrapeChromium
JavaScript 396
7 天前
https://static.github-zh.com/github_avatars/crwlrsoft?size=40
crwlrsoft / crawler

#网络爬虫#Library for Rapid (Web) Crawler and Scraper Development

crawlingPHPscraperscrapingscraping-websitesweb-crawlerweb-crawlingweb-scrapingHacktoberfest爬虫web-scraper
PHP 364
5 天前
https://static.github-zh.com/github_avatars/lkuffo?size=40
lkuffo / web-scraping

#网络爬虫#Más de 50 ejemplos de web scraping utilizando: Requests | Scrapy | Selenium | LXML | BeautifulSoup

scrapingscraping-pythonscraping-websiteswebscrapingSeleniumselenium-pythonscrapybeautifulsoup4beautifulsoupweb-scraping
Python 362
8 个月前
https://static.github-zh.com/github_avatars/Go-phie?size=40
Go-phie / gophie

An Aggregator Engine for searching and downloading movies free - NO ADs!

命令行界面scraping-websitesweb-apino-ads下载器download-videos免费moviesfree-softwareanime-downloaderstream
Go 319
2 年前
https://static.github-zh.com/github_avatars/kennethreitz?size=40
kennethreitz / requests-html

#网络爬虫#Pythonic HTML Parsing for Humans™

requestsHTMLscrapingscraping-websitesscraping-framework
Python 311
1 年前
https://static.github-zh.com/github_avatars/driscoll42?size=40
driscoll42 / ebayMarketAnalyzer

Scrape all eBay sold listings to determine average/median pricing, plot listings over time with trend lines, and extract to excel

ebayscraping-websitesPythonwebscraping
Python 224
3 年前
https://static.github-zh.com/github_avatars/m92vyas?size=40
m92vyas / llm-reader

#网络爬虫#Turn Webpage to LLM friendly input text. Similar to Firecrawl and Jina Reader API. Makes RAG, AI web scraping, image & webpage links extraction easy.

extract-data大语言模型llm-agentscraperscrapingscraping-websiteswebscrapingai-agent-toolsai-agentsfirecrawlrag
Python 196
16 天前
loading...