GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

hdfs

Website
Wikipedia
seaweedfs/seaweedfs
https://static.github-zh.com/github_avatars/seaweedfs?size=40
seaweedfs / seaweedfs

SeaweedFS 是一个用于blob、对象、文件和数据湖的分布式存储系统,可快速存储和服务数十亿个文件

distributed-storagedistributed-systemss3hdfsfusedistributed-file-systemhadoop-hdfsposixtiered-file-systemKubernetesreplicationobject-storages3-storageseaweedfserasure-codingblob-storagecloud-drive
Go 24.8 k
2 天前
https://static.github-zh.com/github_avatars/heibaiying?size=40
heibaiying / BigData-Notes

大数据入门指南 ⭐

hadoophdfsYarnmapreducehiveApache SparkstormhbaseScalakafkazookeeperflumeazkabansqoopphoenixbigdatabig-data
Java 16.49 k
1 年前
ceph/ceph
https://static.github-zh.com/github_avatars/ceph?size=40
ceph / ceph

Ceph is a distributed object, block, and file storage platform

storagesoftware-defined-storagedistributed-storages3block-storagedistributed-file-systemobject-storenfshighly-availableiscsicloud-storageKuberneteshdfssmbhigh-performancefuseposixerasure-codingreplicationnvme-over-fabrics
C++ 15.12 k
14 小时前
juicedata/juicefs
https://static.github-zh.com/github_avatars/juicedata?size=40
juicedata / juicefs

为开发者设计的云文件系统。为云环境设计,兼容 POSIX、HDFS 和 S3 协议的分布式文件系统

filesystemcloud-nativeGoRedisdistributed-systemsstorageobject-storageposixhdfss3bigdata
Go 11.7 k
3 天前
https://static.github-zh.com/github_avatars/wangzhiwubigdata?size=40
wangzhiwubigdata / God-Of-BigData

专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...

flinkApache Sparkhadoophdfshivehbasekafkazookeeperbigdataflumeazkaban
10.14 k
2 年前
https://static.github-zh.com/github_avatars/piskvorky?size=40
piskvorky / smart_open

Utils for streaming large files (S3, HDFS, gzip, bz2...)

Pythons3hdfsbotostreamingfilestreaming-databz2Hacktoberfest
Python 3.32 k
5 天前
https://static.github-zh.com/github_avatars/TileDB-Inc?size=40
TileDB-Inc / TileDB

The Universal Storage Engine

tiledbarraysstorage-enginescientific-computing数据分析hdfss3s3-storage数据科学sparse-datadataframes
C++ 1.95 k
2 天前
https://static.github-zh.com/github_avatars/collabH?size=40
collabH / bigdata-growth

大数据知识仓库涉及到数据仓库建模、实时计算、大数据、数据中台、系统设计、Java、算法等。

flinkkafkahivemapreduceApache Sparkolaphadoophbasedebeziumhdfsbigdatahudi
Shell 1.63 k
1 个月前
https://static.github-zh.com/github_avatars/water8394?size=40
water8394 / BigData-Interview

#面试#🎯 🌟[大数据面试题]分享自己在网络上收集的大数据相关的面试题以及自己的答案总结.目前包含Hadoop/Hive/Spark/Flink/Hbase/Kafka/Zookeeper框架的面试题知识总结

bigdataApache SparkkafkahbaseflinkhadoophdfsmapreduceYarn面试
1.62 k
4 年前
https://static.github-zh.com/github_avatars/colinmarc?size=40
colinmarc / hdfs

A native go client for HDFS

hdfsGo命令行界面
Go 1.39 k
5 个月前
wgzhao/Addax
https://static.github-zh.com/github_avatars/wgzhao?size=40
wgzhao / Addax

A fast and versatile ETL tool that can transfer data between RDBMS and NoSQL seamlessly

hadoophive数据库clickhouseinfluxdbMySQLsql-servertrinoexcelimpalaOracle 数据库PostgreSQLetlhdfs
Java 1.29 k
3 天前
https://static.github-zh.com/github_avatars/spotify?size=40
spotify / snakebite

A pure python HDFS client

hdfsPython
Python 857
3 年前
https://static.github-zh.com/github_avatars/HariSekhon?size=40
HariSekhon / DevOps-Python-tools

80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Functions, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML...

cloudformationPythonhbaseJSONavroparquetApache Sparkpysparktravis-cielasticsearchsolrhadoophdfsdockerhubDockerLinuxAmazon Web ServicesDevOpsGoogle 云gcf
Python 798
2 个月前
https://static.github-zh.com/github_avatars/sunnyandgood?size=40
sunnyandgood / BigData

💎🔥大数据学习笔记

hadoophivehbasehdfszookeepersqoopmapreduceflumeMySQLLinuxShell
Java 682
6 年前
https://static.github-zh.com/github_avatars/Stratio?size=40
Stratio / sparta

Real Time Analytics and Data Pipelines based on Spark Streaming

streaming-dataScalaApache Sparkstreamingspark-streamingolapkafkahdfsworkflowanalyticsreal-timesparksqllambdatriggers
Scala 526
6 年前
https://static.github-zh.com/github_avatars/lensesio?size=40
lensesio / kafka-connect-ui

Deprecated - See Lenses.io Community Edition

kafkakafka-connectelasticsearchApache Cassandras3documentdbRedisMQTThdfsinfluxdbX (Twitter)
JavaScript 513
1 个月前
https://static.github-zh.com/github_avatars/dromara?size=40
dromara / CloudEon

CloudEon uses Kubernetes to install and deploy open-source big data components, enabling the containerized operation of an open-source big data platform. This allows you to reduce your focus on underl...

hadoopKubernetesdorishdfsYarnbigdatacloudnative
FreeMarker 469
3 个月前
https://static.github-zh.com/github_avatars/fabiogjardim?size=40
fabiogjardim / bigdata_docker

Big Data Ecosystem Docker

hadoophdfshbasehiveprestoApache SparkJupyter NotebookhueMongoDBmetabasenifiMySQLzookeeper
VBA 417
2 年前
https://static.github-zh.com/github_avatars/uber?size=40
uber / storagetapper

StorageTapper is a scalable realtime MySQL change data streaming, logical backup and logical replication service

MySQLkafkaavrocdcetlJSONmsgpackhdfss3PostgreSQLclickhouse
Go 355
2 年前
https://static.github-zh.com/github_avatars/tirthajyoti?size=40
tirthajyoti / Spark-with-Python

#计算机科学#Fundamentals of Spark with Python (using PySpark), code examples

pysparkApache Sparkdataframe机器学习big-data数据库map-reducePythonhdfsanalyticshadoopdistributed-computingparallel-computingSQLapache
Jupyter Notebook 350
3 年前
loading...