SingleFile 是一个用来保存完整HTML网页的浏览器插件,只会生成一个文件,图片不会丢失。支持Chrome, Firefox,Edge,Vivaldi, Brave, Waterfox, Yandex,Opera等浏览器
#网络爬虫#Crawlee - 一个用于Node.js 开发的网页爬虫和浏览器自动化库
Chrome extension that records your browser interactions and generates a Playwright or Puppeteer script.
The AI Browser Automation Framework
Docker 中运行chrome ,实现浏览器headless自动化任务
Proxy server to bypass Cloudflare protection
A developer-friendly API for converting numerous document formats into PDF files, and more!
Lightpanda: the headless browser designed for AI and automation
Web page PDF/PNG rendering done right. Self-hosted service for rendering receipts, invoices, or any content.
💯 Teach puppeteer new tricks through plugins.
Venom is a high-performance system developed with JavaScript to create a bot for WhatsApp, support for creating any interaction, such as customer service, media sending, sentence recognition based on ...
A Headless Chrome rendering solution
#网络爬虫#Turn any webpage into structured data using LLMs
A command-line tool to turn web pages into readable PDF, EPUB, HTML, or Markdown docs.
Pydoll is a library for automating chromium-based browsers without a WebDriver, offering realistic interactions.
#网络爬虫#Analysis of Bot Protection systems with available countermeasures 🚿. How to defeat anti-bot system 👻 and get around browser fingerprinting scripts 🕵️♂️ when scraping the web?
Scan your entire site with Google Lighthouse in 2 minutes (on average). Open source, fully configurable with minimal setup.
Headless chrome/chromium automation library (unofficial port of puppeteer)