GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

scraper

Website
Wikipedia
https://static.github-zh.com/github_avatars/huginn?size=40
huginn / huginn

#自动化#你的代理人,随时待命。Huginn 是一个用于构建自动化任务的web平台。

自动化notificationsscraperwebscrapingfeedgeneratorRSSagent监控feedtwitter-streaminghuginnX (Twitter)
Ruby 46.45 k
3 天前
https://static.github-zh.com/github_avatars/mendableai?size=40
mendableai / firecrawl

#网络爬虫#Firecrawl 是一种 API 服务,它爬取URL并将其转换为清洗过的 markdown 或结构化数据

人工智能爬虫dataMarkdownscraperhtml-to-markdown大语言模型ragscrapingweb-crawlerai-scrapingwebscraping
TypeScript 39.99 k
2 天前
https://static.github-zh.com/github_avatars/NaiboWang?size=40
NaiboWang / EasySpider

#前端开发#A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。

code-free爬虫GUIlaymanspiderparametersWebinput-parameters前端HTMLbatch-processingbatch-scriptvisual可视化visualprogrammingscraperdata-collectionrpaRobotics
JavaScript 39.06 k
21 天前
https://static.github-zh.com/github_avatars/iawia002?size=40
iawia002 / lux

#网络爬虫#一个Go语言开发命令行视频下载工具

下载器Go爬虫scraperVideo哔哩哔哩YouTubeyoukuiqiyitumblrqqdownload
Go 29.73 k
1 个月前
https://static.github-zh.com/github_avatars/cheeriojs?size=40
cheeriojs / cheerio

#网络爬虫#一个运行在服务端的 jQuery 实现,用于解析和操作 HTML 及 XML

cheeriojQueryhtmlparser2Document Object Model (DOM)htmlparserselectorscraperParserHTMLHacktoberfest
TypeScript 29.52 k
3 天前
https://static.github-zh.com/github_avatars/feder-cr?size=40
feder-cr / Jobs_Applier_AI_Agent_AIHawk

#网络爬虫#AIHawk aims to easy job hunt process by automating the job application process. Utilizing artificial intelligence, it enables users to apply for multiple jobs in a tailored way.

自动化BotChatGPTgptjobjobsearchjobseekeropeaiPythonresumescraperscrapingapplication-resumeSeleniumChromehuman-resourcesjobsagent人工智能
Python 28.31 k
18 天前
https://static.github-zh.com/github_avatars/gocolly?size=40
gocolly / colly

#爬虫框架#一个快速优雅的Golang爬虫框架

Goscraper框架爬虫scrapingcrawlingspider
Go 24.32 k
5 天前
apify/crawlee
https://static.github-zh.com/github_avatars/apify?size=40
apify / crawlee

#网络爬虫#Crawlee - 一个用于Node.js 开发的网页爬虫和浏览器自动化库

web-scrapingweb-crawlingnpmheadless-chromePuppeteer自动化apifyscrapingcrawling爬虫headlessscraperweb-crawlerJavaScriptNode.jsPlaywrightTypeScript
TypeScript 17.92 k
2 天前
https://static.github-zh.com/github_avatars/codelucas?size=40
codelucas / newspaper

#网络爬虫#一个Python数据采集框架,能自动提取新闻、文章的标题、关键词、作者、摘要、正文等元数据

Pythonnews爬虫crawlingscrapernews-aggregator
HTML 14.61 k
3 个月前
Evil0ctal/Douyin_TikTok_Download_API
https://static.github-zh.com/github_avatars/Evil0ctal?size=40
Evil0ctal / Douyin_TikTok_Download_API

#网络爬虫#🚀「Douyin_TikTok_Download_API」是一个开箱即用的高性能异步抖音、快手、TikTok、Bilibili数据爬取工具,支持API调用,在线批量解析及下载。

PythonpywebioTikTokdouyinAPIscraperFastAPIno-watermarkonline-parsingasyncdouyin-tiktok-apidouyin-tiktok-download爬虫spiderweb-scrapingtiktok-scraperdouyin-scraperdouyin-apitiktok-apitiktok-signature
Python 13.03 k
3 个月前
https://static.github-zh.com/github_avatars/getmaxun?size=40
getmaxun / maxun

#网络爬虫#一个可视化,通过鼠标点击完成数据采集的爬虫平台

自动化无代码scraperweb-automationweb-scraperweb-scrapingAPIbrowserbrowser-automationPlaywright自托管website-to-apirobotic-process-automationrpano-code-web-scraperagentsweb-agentdata-extractionweb-scraping-agentwebscraping
TypeScript 13.03 k
2 天前
https://static.github-zh.com/github_avatars/pwxcoo?size=40
pwxcoo / chinese-xinhua

#网络爬虫#📙 中华新华字典数据库。包括歇后语,成语,词语,汉字。

datascraperchinese-traditionalPython中文chinese-characterschinese-nlpchinese-languagechinese-simplifiedjson-datasetJSONjson-data
Python 11.23 k
1 年前
https://static.github-zh.com/github_avatars/guyueyingmu?size=40
guyueyingmu / avbook

#网络爬虫#AV 电影管理系统, avmoo , javbus , javlibrary 爬虫,线上 AV 影片图书馆,AV 磁力链接数据库,Japanese Adult Video Library,Adult Video Magnet Links - Japanese Adult Video Database

javbusavmoojavlibraryspider爬虫Laravelscraperadultmagnet-linkmagnet数据库adult-videoguzzlehttp
PHP 9.71 k
1 年前
https://static.github-zh.com/github_avatars/TeamWiseFlow?size=40
TeamWiseFlow / wiseflow

#网络爬虫#Use LLMs to dig out what you care about from massive amounts of information and a variety of sources daily.

爬虫information-gathering大语言模型scraper
Python 7.56 k
10 小时前
https://static.github-zh.com/github_avatars/arc298?size=40
arc298 / instagram-scraper

#网络爬虫#instagram-scraper 是一个Python开发instagram爬虫,用于爬取 instagram 用户的图片和照片

Instagraminstagram-scraperinstagram-user-photosPythonscraperinstagram-clientinstagram-api
Python 6.96 k
3 年前
https://static.github-zh.com/github_avatars/BruceDone?size=40
BruceDone / awesome-crawler

#网络爬虫#A collection of awesome web crawler,spider in different languages

web-crawler爬虫web-scraperspiderscraperAwesome Lists
6.8 k
1 年前
alirezamika/autoscraper
https://static.github-zh.com/github_avatars/alirezamika?size=40
alirezamika / autoscraper

#网络爬虫#A Smart, Automatic, Fast and Lightweight Web Scraper for Python

scrapingscraperscrapewebscraping爬虫web-scraping人工智能Pythonwebautomation自动化机器学习
Python 6.79 k
6 天前
https://static.github-zh.com/github_avatars/go-rod?size=40
go-rod / rod

#网络爬虫#Rod 是一个直接基于 DevTools Protocol 高级驱动程序。 它是为网页自动化和爬虫而设计的,既可用于高级应用开发也可用于低级应用开发,高级开发人员可以使用低级包和函数来轻松地定制或建立他们自己的Rod版本,高级函数只是建立Rod默认版本的例子。

cdpchrome-headlesschrome-devtoolschrome-devtools-protocolheadlessweb-scraping自动化scraperdevtoolsdevtools-protocolrodGoTestingWebgorodcrawling
Go 5.98 k
6 个月前
MontFerret/ferret
https://static.github-zh.com/github_avatars/MontFerret?size=40
MontFerret / ferret

#网络爬虫#Declarative web scraping

Goquery-languagedata-miningscrapingscraping-websitesdslcdpcrawlingscraper爬虫Chrome命令行界面工具Library
Go 5.82 k
4 天前
https://static.github-zh.com/github_avatars/apify?size=40
apify / crawlee-python

#网络爬虫#Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works...

apify自动化beautifulsoup爬虫crawlingheadlessheadless-chromepipPlaywrightPythonscraperscrapingweb-crawlerweb-crawlingweb-scrapingHacktoberfest
Python 5.73 k
3 天前
loading...