GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

webscraping

Website
Wikipedia
https://static.github-zh.com/github_avatars/huginn?size=40
huginn / huginn

#自动化#你的代理人,随时待命。Huginn 是一个用于构建自动化任务的web平台。

自动化notificationsscraperwebscrapingfeedgeneratorRSSagent监控feedtwitter-streaminghuginnX (Twitter)
Ruby 46.45 k
3 天前
https://static.github-zh.com/github_avatars/mendableai?size=40
mendableai / firecrawl

#网络爬虫#Firecrawl 是一种 API 服务,它爬取URL并将其转换为清洗过的 markdown 或结构化数据

人工智能爬虫dataMarkdownscraperhtml-to-markdown大语言模型ragscrapingweb-crawlerai-scrapingwebscraping
TypeScript 39.99 k
2 天前
assafelovic/gpt-researcher
https://static.github-zh.com/github_avatars/assafelovic?size=40
assafelovic / gpt-researcher

LLM based autonomous agent that conducts deep local and web research on any topic and generates a long report with citations.

人工智能Pythonagent自动化researchsearchwebscraping大语言模型mcpmcp-server
Python 21.87 k
2 天前
https://static.github-zh.com/github_avatars/getmaxun?size=40
getmaxun / maxun

#网络爬虫#一个可视化,通过鼠标点击完成数据采集的爬虫平台

自动化无代码scraperweb-automationweb-scraperweb-scrapingAPIbrowserbrowser-automationPlaywright自托管website-to-apirobotic-process-automationrpano-code-web-scraperagentsweb-agentdata-extractionweb-scraping-agentwebscraping
TypeScript 13.03 k
2 天前
https://static.github-zh.com/github_avatars/pystardust?size=40
pystardust / ani-cli

A cli tool to browse and play anime

Shell命令行界面AnimeposixsteamdeckTermuxwebscrapingfzfLinuxmacOSrofi终端Windows
Shell 9.44 k
4 天前
https://static.github-zh.com/github_avatars/lorien?size=40
lorien / awesome-web-scraping

#网络爬虫#List of libraries, tools and APIs for web scraping and data processing.

web-scrapingcaptcha-recaptchacrawlingscrapingscraping-frameworkscraping-pythonscraping-toolwebscraping爬虫spider
Makefile 7.04 k
6 个月前
alirezamika/autoscraper
https://static.github-zh.com/github_avatars/alirezamika?size=40
alirezamika / autoscraper

#网络爬虫#A Smart, Automatic, Fast and Lightweight Web Scraper for Python

scrapingscraperscrapewebscraping爬虫web-scraping人工智能Pythonwebautomation自动化机器学习
Python 6.79 k
6 天前
D4Vinci/Scrapling
https://static.github-zh.com/github_avatars/D4Vinci?size=40
D4Vinci / Scrapling

#网络爬虫#🕷️ An undetectable, powerful, flexible, high-performance Python library to make Web Scraping Easy and Effortless as it should be!

爬虫crawlingHacktoberfestPlaywrightPythonscrapingselectorsstealth-gameweb-scraperweb-scrapingweb-scraping-pythonwebscrapingxpath自动化人工智能ai-scrapingdatadata-extraction
Python 5.4 k
15 天前
https://static.github-zh.com/github_avatars/autoscrape-labs?size=40
autoscrape-labs / pydoll

Pydoll is a library for automating chromium-based browsers without a WebDriver, offering realistic interactions.

asynchronousbypasscaptchacaptcha-breakingcdpChromiumPlaywrightPuppeteerPythonSeleniumselenium-pythonwebscrapingwebdriverbrowser-automationanti-detectionbot-detection
Python 4.31 k
5 天前
https://static.github-zh.com/github_avatars/niespodd?size=40
niespodd / browser-fingerprinting

#网络爬虫#Analysis of Bot Protection systems with available countermeasures 🚿. How to defeat anti-bot system 👻 and get around browser fingerprinting scripts 🕵️‍♂️ when scraping the web?

BotdetectionChromiumstealth-gamePuppeteerscraperwebscrapingWeb自动化chromium-browserbot-detectionchromedriverfingerprinting爬虫recaptchaspiderbrowser-fingerprinting
JavaScript 4.31 k
1 年前
https://static.github-zh.com/github_avatars/jaypyles?size=40
jaypyles / Scraperr

#网络爬虫#Self-hosted webscraper.

Open Source自托管webscraperDockerhelmKubernetesPlaywrightPythonscrapingweb-scraperweb-scrapersweb-scrapingwebscraping
TypeScript 3.34 k
6 天前
https://static.github-zh.com/github_avatars/daijro?size=40
daijro / camoufox

#网络爬虫#🦊 Anti-detect browser

antidetectantidetect-browserfingerprintFirefoxPlaywrightwebscrapingNetworkscraping
C++ 2.33 k
3 个月前
https://static.github-zh.com/github_avatars/scrapoxy?size=40
scrapoxy / scrapoxy

Scrapoxy is a super proxies manager that orchestrates all your proxies into one place, rather than spreading management across multiple scrapers. It manages IP rotation and fingerprinting, and smartly...

antibotproxieswebscraping
TypeScript 2.28 k
9 天前
https://static.github-zh.com/github_avatars/anaskhan96?size=40
anaskhan96 / soup

Web Scraper in Go, similar to BeautifulSoup

Gowebscraperwebscrapingbeautifulsoupweb-scraperhtml-node
Go 2.2 k
2 年前
https://static.github-zh.com/github_avatars/itsOwen?size=40
itsOwen / CyberScraper-2077

#网络爬虫#A Powerful web scraper powered by LLM | OpenAI, Gemini & Ollama

ai-scraping大语言模型openaiscraperwebscrapinggemini-apiweb-scraper
Python 1.71 k
2 天前
https://static.github-zh.com/github_avatars/reworkd?size=40
reworkd / tarsier

Vision utilities for web interaction agents 👀

OCRPlaywrightSeleniumwebscrapingpypi-packagegpt4v大语言模型Python
Jupyter Notebook 1.69 k
7 个月前
https://static.github-zh.com/github_avatars/TheWebScrapingClub?size=40
TheWebScrapingClub / webscraping-from-0-to-hero

The web scraping open project repository aims to share knowledge and experiences about web scraping with Python

PlaywrightPythonscrapywebscraping
1.64 k
1 年前
https://static.github-zh.com/github_avatars/jamesturk?size=40
jamesturk / scrapeghost

👻 Experimental library for scraping websites using OpenAI's GPT API.

gptwebscrapingopenai-api
Python 1.44 k
8 个月前
requests-cache/requests-cache
https://static.github-zh.com/github_avatars/requests-cache?size=40
requests-cache / requests-cache

Persistent HTTP cache for python requests

cachedynamodbHTTPMongoDBperformanceRedisrequestsSQLiteWebwebscraping
Python 1.43 k
8 天前
https://static.github-zh.com/github_avatars/m8sec?size=40
m8sec / CrossLinked

LinkedIn enumeration tool to extract valid employee names from an organization through search engine scraping

webscrapingPythonOSINTenumerationpentest-toolpentest-scripts
Python 1.39 k
7 个月前
loading...