GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

data-collection

Website
Wikipedia
https://static.github-zh.com/github_avatars/NaiboWang?size=40
NaiboWang / EasySpider

#前端开发#A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。

code-free爬虫GUIlaymanspiderparametersWebinput-parameters前端HTMLbatch-processingbatch-scriptvisual可视化visualprogrammingscraperdata-collectionrpaRobotics
JavaScript 39.06 k
21 天前
https://static.github-zh.com/github_avatars/airbytehq?size=40
airbytehq / airbyte

Airbyte 开源 EL(T) 平台,帮助用户将数据从应用程序,API 和数据库中同步到数据仓库

datapipeline数据分析data-engineeringJavaPythonetlchange-data-capturedata-collectiondata-integrationeltBigQueryredshiftsnowflakedata-pipelinesql-serverMySQLPostgreSQLs3自托管
Python 18.43 k
13 小时前
snowplow/snowplow
https://static.github-zh.com/github_avatars/snowplow?size=40
snowplow / snowplow

The leader in Customer Data Infrastructure

analyticsdatadata-pipelinedata-collectionproduct-analytics
Scala 6.93 k
11 天前
https://static.github-zh.com/github_avatars/cloudquery?size=40
cloudquery / cloudquery

一个高性能ELT 框架,powered by Apache Arrow

Amazon Web ServicesGoogle 云AzureSQLdata-integrationeltetletl-frameworkBigQuerydata-collectiondata-engineeringKubernetesdataairbyteGitHub API数据分析GoogleGocspmattack-surface-management
Go 6.12 k
2 天前
jitsucom/jitsu
https://static.github-zh.com/github_avatars/jitsucom?size=40
jitsucom / jitsu

Jitsu is an open-source Segment alternative. Fully-scriptable data ingestion engine for modern data teams. Set-up a real-time data pipeline in minutes, not days

data-integrationclickhouseGoBigQuerydata-collectionredshiftsnowflakePostgreSQL
TypeScript 4.31 k
6 天前
https://static.github-zh.com/github_avatars/mendableai?size=40
mendableai / firecrawl-mcp-server

Official Firecrawl MCP Server - Adds powerful web scraping to Cursor, Claude and any other LLM clients.

batch-processingclaudecontent-extractiondata-collectionfirecrawlfirecrawl-aillm-toolsmcp-servermodel-context-protocolsearch-apiweb-crawlerweb-scrapingjavascript-rendering
JavaScript 3.42 k
11 天前
https://static.github-zh.com/github_avatars/pyper-dev?size=40
pyper-dev / pyper

Concurrent Python made simple

asyncioconcurrencyPythonthreadingdata-pipelinesdata-processingmultiprocessingparallel-computingdatadata-collectiondata-engineering
Python 1.43 k
4 个月前
https://static.github-zh.com/github_avatars/Decodo?size=40
Decodo / Decodo

#网络爬虫#HTTP(S)/SOCKS5 rotating residential proxies - code examples & general information.

proxyproxieshttps-proxyproxy-serverPythondata-collectionproxy-listpython-scraperscrapingweb-scrapingdata-gatheringhttp-proxyip-rotationsocks5-proxy
Java 1.13 k
12 天前
https://static.github-zh.com/github_avatars/plan-player-analytics?size=40
plan-player-analytics / Plan

Player Analytics plugin for Minecraft Server platforms - View player activity of your server with ease. 📆

analytics统计可视化MySQLSQLitewebserverdata-collectionsponge-pluginspigot-pluginbungeecord-pluginbukkit-pluginHacktoberfestFabricMC
Java 932
7 天前
https://static.github-zh.com/github_avatars/getodk?size=40
getodk / collect

#安卓#ODK Collect is an Android app for filling out forms. It's been used to collect billions of data points in challenging environments around the world. Contribute and make the world a better place! ✨📋✨

odkglobal-developmentdata-collectionglobal-healthmhealthxformsAndroidJavasocial-impactmobile-data-collection
Java 732
12 天前
https://static.github-zh.com/github_avatars/chaoss?size=40
chaoss / augur

Python library and web service for Open Source Software Health and Sustainability metrics & data collection. You can find our documentation and new contributor information easily here: https://oss-aug...

LinuxOpen SourceGitHub数据可视化facadeGit监控sustainabilityhealthpython-librarydata-collectiondata-modelingUnixresearchHacktoberfesthacktoberfest2020
Python 635
5 天前
https://static.github-zh.com/github_avatars/brightdata?size=40
brightdata / brightdata-mcp

#网络爬虫#A powerful Model Context Protocol (MCP) server that provides an all-in-one solution for public web access.

大语言模型mcpmodelcontextprotocolscrapingai-agentsbrowser-automationdata-collectiondata-extractionmcp-serverstructured-dataweb-crawlingweb-scraping
JavaScript 614
10 天前
https://static.github-zh.com/github_avatars/pnoker?size=40
pnoker / iot-dc3

IoT DC3 is a 100% open-source, distributed Internet of Things (IoT) platform built on Spring Cloud. It accelerates IoT project development and simplifies IoT device management, offering a comprehensiv...

Internet of thingsspring-clouddata-collectionmulti-protocolMQTTrtspJavaDockergatewayopc-uatcpsocketplcdcs远程过程调用 (RPC)modbuss7lwm2m
Java 578
6 天前
https://static.github-zh.com/github_avatars/zhaoyachao?size=40
zhaoyachao / zdh_web

大数据采集,抽取平台,zdh_web是zdh系列服务的可视化管理平台,包含数据采集,调度,权限,审批流,私域营销等模块

etlApache Sparkdata-collectionbigdatadatacollectionscheduler
Java 515
21 天前
https://static.github-zh.com/github_avatars/chapmanjacobd?size=40
chapmanjacobd / library

99+ CLI tools to build, browse, and blend your media library

SQLitedatacurationmediaplaylistVideomusicdata-collectionFFmpegmpvyt-dlp命令行界面filesfolders
Python 428
14 天前
https://static.github-zh.com/github_avatars/K3V1991?size=40
K3V1991 / Disable-Firefox-Telemetry-and-Data-Collection

How to disable Firefox Telemetry and Data Collection

browserconfigurationdatadata-collectionFirefoxhow-tolistMozillamozilla-firefoxoptions隐私reporting安全Serversettingstelemetryblocking教程
394
1 年前
https://static.github-zh.com/github_avatars/ScriptSmith?size=40
ScriptSmith / reaper

#网络爬虫#Social media scraping / data collection tool for the Facebook, Twitter, Reddit, YouTube, Pinterest, and Tumblr APIs

scrapingFacebookX (Twitter)RedditYouTubepinteresttumblrAPIsocialmediadata-miningdata-collectionGUI
Python 384
6 年前
https://static.github-zh.com/github_avatars/graphlit?size=40
graphlit / graphlit-mcp-server

Model Context Protocol (MCP) Server for Graphlit Platform

claudecontent-extractiondata-collectionllm-toolsmcp-servermodel-context-protocolsearch-apiunstructured-dataweb-crawlerweb-scraping
TypeScript 301
5 天前
https://static.github-zh.com/github_avatars/elbwalker?size=40
elbwalker / walkerOS

Open-source event data collection and tag management (gtag.js/GTM alternative)

measurementtaggingdata-collectionconsent-managementgdpr
TypeScript 296
5 天前
https://static.github-zh.com/github_avatars/ProjectNeura?size=40
ProjectNeura / LEADS

#计算机科学#Enable your racing car with powerful, data-driven instrumentation, control, and analysis systems, all wrapped up in a gorgeous look.

dashboardesc人工智能机器学习data-collection
Python 259
16 天前
loading...