GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

webscraper

Website
Wikipedia
https://static.github-zh.com/github_avatars/jaypyles?size=40
jaypyles / Scraperr

#网络爬虫#Self-hosted webscraper.

Open Source自托管webscraperDockerhelmKubernetesPlaywrightPythonscrapingweb-scraperweb-scrapersweb-scrapingwebscraping
TypeScript 3.34 k
6 天前
https://static.github-zh.com/github_avatars/anaskhan96?size=40
anaskhan96 / soup

Web Scraper in Go, similar to BeautifulSoup

Gowebscraperwebscrapingbeautifulsoupweb-scraperhtml-node
Go 2.2 k
2 年前
https://static.github-zh.com/github_avatars/benibela?size=40
benibela / xidel

#网络爬虫#Command line tool to download and extract data from HTML/XML pages or JSON-APIs, using CSS, XPath 3.0, XQuery 3.0, JSONiq or pattern matching. It can also create new or transformed XML/HTML/JSON docu...

xqueryXMLHTMLJSONxpath命令行界面HTTPWebREST APIcss-selectorwgetcURLhttpiewebscraperwebscrapingscraperdatascrapingdata-processing
Pascal 806
4 个月前
https://static.github-zh.com/github_avatars/scrapfly?size=40
scrapfly / scrapfly-scrapers

#网络爬虫#Scalable Python web scraping scripts for +40 popular domains

crawlingPython爬虫scrapingweb-scrapingweb-scraping-pythonantibot自动化datascrapingproxiespython-scraperscraperscraping-pythonspiderweb-crawlerwebscraperwebscraping
Python 533
5 天前
https://static.github-zh.com/github_avatars/rootVIII?size=40
rootVIII / proxy_requests

a class that uses scraped proxies to make http GET/POST requests (Python requests)

Pythonrequests-modulerequestsproxyproxy-serverproxy-listwebscrapingwebscraperrecursionHTTPhttp-proxypython-requests
Python 390
5 年前
https://static.github-zh.com/github_avatars/salimk?size=40
salimk / Rcrawler

#网络爬虫#An R web crawler and scraper

R爬虫scraperwebcrawlerwebscrapingwebscraperwebscrappingcrawlers
R 355
3 年前
https://static.github-zh.com/github_avatars/onepointAI?size=40
onepointAI / onepoint

#大语言模型#An AI assistant tool that integrates coding, writing, and reading functions. For better alternatives see https://monica.im/desktop

人工智能ElectronChatGPTall-in-onemacOStoolkitReactwebscraperCodereadinggpt-35-turbo
TypeScript 314
2 年前
https://static.github-zh.com/github_avatars/toby-p?size=40
toby-p / rightmove_webscraper.py

Python class to scrape data from rightmove.co.uk and return listings in a pandas DataFrame object

webscraperpandaspandas-dataframeCSVPython数据科学数据分析data-mining
Python 271
1 年前
https://static.github-zh.com/github_avatars/serpapi?size=40
serpapi / lego-ai-parser

#网络爬虫#Lego AI Parser is an open-source application that uses OpenAI to parse visible text of HTML elements.

人工智能classificationdatasciencegpt-3HTML机器学习openaiParserParsingparser-libraryPythonscraper工具Web appwebscraperwebscraping
Python 233
1 年前
https://static.github-zh.com/github_avatars/intergalacticalvariable?size=40
intergalacticalvariable / reader

#网络爬虫#📚 This is an adapted version of Jina AI's Reader for local deployment using Docker. Convert any URL to an LLM-friendly input with a simple prefix http://127.0.0.1:3000/https://website-to-scrape.com/

Docker大语言模型proxyragscraper自托管webscraperwebscrapingwebsite-screenshotwebsite-screenshot-capturer
TypeScript 215
8 个月前
https://static.github-zh.com/github_avatars/mehmetozkaya?size=40
mehmetozkaya / DotnetCrawler

#网络爬虫#DotnetCrawler is a straightforward, lightweight web crawling/scrapying library for Entity Framework Core output based on dotnet core. This library designed like other strong crawler libraries like Web...

.NET爬虫crawlingscrapingscrapyentity-framework-coreddd-architectureC#webcrawlerwebscrapingwebscraperhtmlagilitypack
C# 176
2 年前
https://static.github-zh.com/github_avatars/TBosak?size=40
TBosak / mkfd

#网络爬虫#RSS feed builder created with Bun🥖 and Hono🔥- builds from webpages, email folders, and REST API calls.

BunfeedHonoRSSTypeScriptcontributors-welcomehelp-wantedrss-generatorscraper自托管webscraperDockerDockerfiledockerhub
TypeScript 170
24 天前
https://static.github-zh.com/github_avatars/MichaelYochpaz?size=40
MichaelYochpaz / iSubRip

#网络爬虫#A Python command-line tool for scraping and downloading subtitles from AppleTV and iTunes movie pages.

PythonScriptitunesappletvm3u8pypi-packagesubtitleswebscraperscraper
Python 162
2 天前
https://static.github-zh.com/github_avatars/AliAkhtari78?size=40
AliAkhtari78 / SpotifyScraper

#网络爬虫#Spotify Scraper to extract all the information from spotify, download mp3 with cover of the song

webscrapingwebscraperspotify-downloaderspotify-scrapingscraper爬虫Python免费
Makefile 157
7 天前
https://static.github-zh.com/github_avatars/bitsummation?size=40
bitsummation / pickaxe

SQL Based DSL Web Scraper/Screen Scraper

webscraper
C# 154
4 年前
https://static.github-zh.com/github_avatars/chuanenlin?size=40
chuanenlin / shutterscrape

#网络爬虫#Web scrapper for Shutterstock

webscraperscraperchromedriverSeleniumbeautifulsoupPython
Python 152
5 年前
https://static.github-zh.com/github_avatars/dwallach1?size=40
dwallach1 / Stocker

Financial Web Scraper & Sentiment Classifier

数据科学sentiment-analysisfinanceinvestingwebscraper
Python 152
5 年前
https://static.github-zh.com/github_avatars/CuriousLearner?size=40
CuriousLearner / GeeksForGeeksScrapper

Scrapes g4g and creates PDF

scrapperpdfgeeksforgeekswebscrapingwebscraperHacktoberfest
Python 147
5 年前
https://static.github-zh.com/github_avatars/JesseVent?size=40
JesseVent / crypto

#区块链#Cryptocurrency Historical Market Data R Package

coinmarketcapcryptocurrencieskagglecryptodataset加密货币webscraperfinancialOpen Datamarket-data
R 142
5 年前
https://static.github-zh.com/github_avatars/hedii?size=40
hedii / php-crawler

#网络爬虫#A php crawler that finds emails on the internets

爬虫PHPLaravelVue.jswebscrapingwebscraperwebcrawler
PHP 135
4 年前
loading...