GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

data-collection

Website
Wikipedia
https://static.github-zh.com/github_avatars/NaiboWang?size=40
NaiboWang / EasySpider

#前端开发#A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。

code-free爬虫GUIlaymanspiderparametersWebinput-parameters前端HTMLbatch-processingbatch-scriptvisual可视化visualprogrammingscraperdata-collectionrpaRobotics
JavaScript 42.44 k
22 天前
https://static.github-zh.com/github_avatars/airbytehq?size=40
airbytehq / airbyte

Airbyte 开源 EL(T) 平台,帮助用户将数据从应用程序,API 和数据库中同步到数据仓库

datapipeline数据分析data-engineeringJavaPythonetlchange-data-capturedata-collectiondata-integrationeltBigQueryredshiftsnowflakedata-pipelinesql-serverMySQLPostgreSQLs3自托管
Python 19.5 k
1 天前
snowplow/snowplow
https://static.github-zh.com/github_avatars/snowplow?size=40
snowplow / snowplow

The leader in Customer Data Infrastructure

analyticsdatadata-pipelinedata-collectionproduct-analytics
Scala 6.96 k
3 个月前
https://static.github-zh.com/github_avatars/cloudquery?size=40
cloudquery / cloudquery

一个高性能ELT 框架,powered by Apache Arrow

Amazon Web ServicesGoogle 云AzureSQLdata-integrationeltetletl-frameworkBigQuerydata-collectiondata-engineeringKubernetesdataairbyteGitHub API数据分析GoogleGocspmattack-surface-management
Go 6.2 k
3 天前
https://static.github-zh.com/github_avatars/firecrawl?size=40
firecrawl / firecrawl-mcp-server

🔥 Official Firecrawl MCP Server - Adds powerful web scraping and search to Cursor, Claude and any other LLM clients.

batch-processingclaudecontent-extractiondata-collectionfirecrawlfirecrawl-aillm-toolsmcp-servermodel-context-protocolsearch-apiweb-crawlerweb-scrapingjavascript-renderingmcp
JavaScript 4.5 k
2 天前
jitsucom/jitsu
https://static.github-zh.com/github_avatars/jitsucom?size=40
jitsucom / jitsu

Jitsu is an open-source Segment alternative. Fully-scriptable data ingestion engine for modern data teams. Set-up a real-time data pipeline in minutes, not days

data-integrationclickhouseGoBigQuerydata-collectionredshiftsnowflakePostgreSQL
TypeScript 4.4 k
2 天前
https://static.github-zh.com/github_avatars/pyper-dev?size=40
pyper-dev / pyper

Concurrent Python made simple

asyncioconcurrencyPythonthreadingdata-pipelinesdata-processingmultiprocessingparallel-computingdatadata-collectiondata-engineering
Python 1.47 k
7 个月前
https://static.github-zh.com/github_avatars/brightdata?size=40
brightdata / brightdata-mcp

#网络爬虫#A powerful Model Context Protocol (MCP) server that provides an all-in-one solution for public web access.

大语言模型mcpmodelcontextprotocolscrapingai-agentsbrowser-automationdata-collectiondata-extractionmcp-serverstructured-dataweb-crawlingweb-scraping
JavaScript 1.3 k
4 天前
https://static.github-zh.com/github_avatars/Decodo?size=40
Decodo / Decodo

#网络爬虫#HTTP(S)/SOCKS5 rotating residential proxies - code examples & general information.

proxyproxieshttps-proxyproxy-serverPythondata-collectionproxy-listpython-scraperscrapingweb-scrapingdata-gatheringhttp-proxyip-rotationsocks5-proxy
Java 1.15 k
3 个月前
https://static.github-zh.com/github_avatars/plan-player-analytics?size=40
plan-player-analytics / Plan

Player Analytics plugin for Minecraft Server platforms - View player activity of your server with ease. 📆

analytics统计可视化MySQLSQLitewebserverdata-collectionsponge-pluginspigot-pluginbungeecord-pluginbukkit-pluginHacktoberfestFabricMC
Java 955
3 天前
https://static.github-zh.com/github_avatars/getodk?size=40
getodk / collect

#安卓#ODK Collect is an Android app for filling out forms. It's been used to collect billions of data points in challenging environments around the world. Contribute and make the world a better place! ✨📋✨

odkglobal-developmentdata-collectionglobal-healthmhealthxformsAndroidJavasocial-impactmobile-data-collection
Kotlin 745
3 天前
https://static.github-zh.com/github_avatars/chaoss?size=40
chaoss / augur

Python library and web service for Open Source Software Health and Sustainability metrics & data collection. You can find our documentation and new contributor information easily here: https://oss-aug...

LinuxOpen SourceGitHub数据可视化facadeGit监控sustainabilityhealthpython-librarydata-collectiondata-modelingUnixresearchHacktoberfesthacktoberfest2020
Python 654
3 天前
https://static.github-zh.com/github_avatars/pnoker?size=40
pnoker / iot-dc3

IoT DC3 is a fully open-source distributed Internet of Things (IoT) platform built on Spring Cloud. It accelerates IoT project development and simplifies IoT device management, offering a comprehensiv...

Internet of thingsspring-clouddata-collectionmulti-protocolMQTTrtspJavaDockergatewayopc-uatcpsocketplcdcs远程过程调用 (RPC)modbuss7lwm2m
Java 597
2 天前
https://static.github-zh.com/github_avatars/zhaoyachao?size=40
zhaoyachao / zdh_web

大数据采集,抽取平台,zdh_web是zdh系列服务的可视化管理平台,包含数据采集,调度,权限,审批流,私域营销等模块

etlApache Sparkdata-collectionbigdatadatacollectionscheduler
Java 526
1 个月前
https://static.github-zh.com/github_avatars/chapmanjacobd?size=40
chapmanjacobd / library

99+ CLI tools to build, browse, and blend your media library

SQLitedatacurationmediaplaylistVideomusicdata-collectionFFmpegmpvyt-dlp命令行界面filesfolders
Python 441
4 天前
https://static.github-zh.com/github_avatars/K3V1991?size=40
K3V1991 / Disable-Firefox-Telemetry-and-Data-Collection

How to disable Firefox Telemetry and Data Collection

browserconfigurationdatadata-collectionFirefoxhow-tolistMozillamozilla-firefoxoptions隐私reporting安全Serversettingstelemetryblocking教程
413
1 年前
https://static.github-zh.com/github_avatars/ScriptSmith?size=40
ScriptSmith / reaper

#网络爬虫#Social media scraping / data collection tool for the Facebook, Twitter, Reddit, YouTube, Pinterest, and Tumblr APIs

scrapingFacebookX (Twitter)RedditYouTubepinteresttumblrAPIsocialmediadata-miningdata-collectionGUI
Python 389
7 年前
https://static.github-zh.com/github_avatars/graphlit?size=40
graphlit / graphlit-mcp-server

Model Context Protocol (MCP) Server for Graphlit Platform

claudecontent-extractiondata-collectionllm-toolsmcp-servermodel-context-protocolsearch-apiunstructured-dataweb-crawlerweb-scraping
TypeScript 357
14 天前
https://static.github-zh.com/github_avatars/elbwalker?size=40
elbwalker / walkerOS

Open source tag management and event data collection

data-collectionprivacy-by-designdata-pipelineanalyticscomponent-drivenevent-trackingweb-analyticsproduct-analyticsdata-integrationtagging
TypeScript 302
2 天前
https://static.github-zh.com/github_avatars/wq?size=40
wq / wq

📱🌐📋 wq: a modular framework supporting web / native apps for mobile surveys and geospatial data collection. Powered by Django REST Framework, Redux, React, and Material UI.

Pythoncrowdsourcingmobile-appoffline-firstdata-collectionCitizen sciencesurvey框架DjangoREST APIReactGeographic Information Systemoffline
JavaScript 259
6 个月前
loading...