GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

ai-scraping

Website
Wikipedia
https://static.github-zh.com/github_avatars/mendableai?size=40
mendableai / firecrawl

#网络爬虫#Firecrawl 是一种 API 服务,它爬取URL并将其转换为清洗过的 markdown 或结构化数据

人工智能爬虫dataMarkdownscraperhtml-to-markdown大语言模型ragscrapingweb-crawlerai-scrapingwebscraping
TypeScript 39.99 k
2 天前
https://static.github-zh.com/github_avatars/ScrapeGraphAI?size=40
ScrapeGraphAI / Scrapegraph-ai

#网络爬虫#Python scraper based on AI

scrapingscraping-pythonautomated-scraper大语言模型人工智能web-crawlerweb-scrapingai-scraping爬虫html-to-markdownMarkdownrag
Python 20 k
2 天前
D4Vinci/Scrapling
https://static.github-zh.com/github_avatars/D4Vinci?size=40
D4Vinci / Scrapling

#网络爬虫#🕷️ An undetectable, powerful, flexible, high-performance Python library to make Web Scraping Easy and Effortless as it should be!

爬虫crawlingHacktoberfestPlaywrightPythonscrapingselectorsstealth-gameweb-scraperweb-scrapingweb-scraping-pythonwebscrapingxpath自动化人工智能ai-scrapingdatadata-extraction
Python 5.4 k
15 天前
https://static.github-zh.com/github_avatars/itsOwen?size=40
itsOwen / CyberScraper-2077

#网络爬虫#A Powerful web scraper powered by LLM | OpenAI, Gemini & Ollama

ai-scraping大语言模型openaiscraperwebscrapinggemini-apiweb-scraper
Python 1.71 k
2 天前
https://static.github-zh.com/github_avatars/raznem?size=40
raznem / parsera

#网络爬虫#Lightweight library for scraping web-sites with LLMs

data-extraction大语言模型scrapingPythonOpen Sourcewebscraping人工智能ai-scrapingPlaywright
Python 1.11 k
13 天前
https://static.github-zh.com/github_avatars/mendableai?size=40
mendableai / firecrawl-app-examples

#大语言模型#🔥 This repository contains complete application examples, including websites and other projects, developed using Firecrawl.

人工智能ai-scrapingdataExamplehtml-to-markdown大语言模型Markdownragweb-crawlertemplates
Jupyter Notebook 409
13 天前
https://static.github-zh.com/github_avatars/ArchiveBox?size=40
ArchiveBox / abx-dl

#网络爬虫#⬇️ A simple all-in-one CLI tool to download EVERYTHING from a URL (like youtube-dl/yt-dlp, forum-dl, gallery-dl, simpler ArchiveBox). 🎭 Uses headless Chrome to get HTML, JS, CSS, images/video/audio/s...

ChromecrawlingcURL下载器headlessPlaywrightPuppeteerscrapingwgetyoutube-dlyt-dlpcli-tool命令行界面http-clientai-scraping
JavaScript 74
6 个月前
https://static.github-zh.com/github_avatars/WeebDataHoarder?size=40
WeebDataHoarder / go-away

[Mirror] Self-hosted abuse detection and rule enforcement against low-effort mass AI scraping and bots.

ai-scrapinghttp-proxy安全mirror
Go 47
6 天前
https://static.github-zh.com/github_avatars/kaymen99?size=40
kaymen99 / ai-web-scraper

#网络爬虫#AI web scraper built with Crawl4AI for extracting structured leads data from websites.

ai-agentsai-scrapingcrawl4ai大语言模型scraperweb-scraperweb-scraping
Python 30
4 个月前
https://static.github-zh.com/github_avatars/spider-rs?size=40
spider-rs / web-crawling-guides

#网络爬虫#How to guides on web-crawling or scraping

agentsai-agentsai-scraping爬虫html-to-markdownscraperweb-scraping
20
2 个月前
https://static.github-zh.com/github_avatars/spider-rs?size=40
spider-rs / spider-clients

#网络爬虫#Python, Javascript, and Rust libraries for the Spider Cloud API.

人工智能ai-agentsai-scraping爬虫html-to-markdownscraperspiderweb-scrapingSupabase
Python 15
7 天前
https://static.github-zh.com/github_avatars/any4ai?size=40
any4ai / AnyCrawl

#网络爬虫#AnyCrawl 🚀: A Node.js/TypeScript crawler that turns websites into LLM-ready data and extracts structured SERP results from Google/Bing/Baidu/etc. Native multi-threading for bulk processing.

aitoolscrawlscrapewebscraperai-scrapingdatahtml-to-markdownragscraping
TypeScript 14
7 天前
https://static.github-zh.com/github_avatars/L1shed?size=40
L1shed / Turbo

Fastest and cheapest distributed residential proxy network.

ai-scrapingweb-scrapingpayment-gatewayiaascollaborate
Go 7
5 天前
https://static.github-zh.com/github_avatars/drisskhattabi6?size=40
drisskhattabi6 / AI-Scraper

#网络爬虫#AI Scraper : scrap and extract data from website in any format (CSV, JSON, HTML...) using Selenium or Crawl4ai, and using Ollama or Sambanova API, and using Streamlit for UI as chatbot

ai-scrapingcrawl4ai爬虫crawlingollamaollama-apiopenrouterscraperscrapingSeleniumselenium-pythonStreamlitstreamlit-webapp
Python 2
24 天前
https://static.github-zh.com/github_avatars/nathabonfim59?size=40
nathabonfim59 / md-fetch

#网络爬虫#A CLI tool and REST API that converts web content to clean Markdown, bypassing anti-scraping measures using headless browsers. Perfect for AI/LLM applications

ai-scrapingGoscraper
Go 2
4 个月前
https://static.github-zh.com/github_avatars/HaroonBrokha?size=40
HaroonBrokha / Ai-Webpage-Analyzer-Api-Free

AI Webpage Analyzer** is a powerful API service that extracts only the visible text content from any given URL and analyzes it using the **Haroon AI API**. It intelligently removes hidden elements, sc...

人工智能ai-apiai-scraping免费REST API
1
4 个月前
https://static.github-zh.com/github_avatars/GitRectify?size=40
GitRectify / scrapegraph-ai

#网络爬虫#ScrapeGraphAI is a Python-based web-scraping framework that pairs large-language-model reasoning with a graph-style pipeline engine to turn websites (or local XML/HTML/JSON/Markdown files) into struct...

人工智能ai-scrapingautomated-scraper爬虫html-to-markdown大语言模型Markdownragscrapingscraping-pythonweb-crawlerweb-scraping
Python 1
11 天前
https://static.github-zh.com/github_avatars/luminati-io?size=40
luminati-io / llama-3-web-scraping

Use LLaMA 3 and Python to extract structured data from websites like Amazon, leveraging LLM-powered parsing for resilient, AI-driven web scraping.

ai-scrapingdata-collectionllama-3Pythonpython-scraperweb-scraping
0
2 个月前
https://static.github-zh.com/github_avatars/luminati-io?size=40
luminati-io / openai-sdk-with-web-unlocker

Integrating OpenAI Agents SDK with Bright Data Web Unlocker, enabling AI agents to access, extract, and process structured data from protected web pages

ai-agentai-scrapingweb-scraper
0
2 个月前
https://static.github-zh.com/github_avatars/skrapeai?size=40
skrapeai / examples

#大语言模型#This repository contains complete application examples, developed using Skrape.ai

人工智能html-to-markdown大语言模型Markdownai-scrapingragweb-crawler
0
5 个月前
loading...