#自动化#Puppeteer一个Node.js库,用于控制Chrome或Chromium。可用于自动化测试,数据爬取等工作
A high-level browser automation library.
#网络爬虫#Crawlee - 一个用于Node.js 开发的网页爬虫和浏览器自动化库
🖥 Chrome automation made simple. Runs locally or headless on AWS Lambda.
Web page PDF/PNG rendering done right. Self-hosted service for rendering receipts, invoices, or any content.
💯 Teach puppeteer new tricks through plugins.
A Headless Chrome rendering solution
#网络爬虫#Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works...
A node.js library for testing modern web applications
Headless chrome/chromium automation library (unofficial port of puppeteer)
Puppeteer Pool, run a cluster of instances in parallel
Puppeteer example scripts for running Headless Chrome from Node.
#网络爬虫#A powerful browser crawler for web vulnerability scanners
🌐 Run headless Chrome/Chromium on AWS Lambda
Playwright for Go a browser automation library to control Chromium, Firefox and WebKit with a single API.
🤖 A Node queue API for generating PDFs using headless Chrome. Comes with a CLI, S3 storage and webhooks for notifying subscribers about generated PDFs
#网络爬虫#A curated list of awesome puppeteer resources.
Headless Chromium-based web performance metrics collector and monitoring tool
Run Lighthouse in CI, as a web service, using Docker. Pass/Fail GH pull requests.