GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

mapreduce

Website
Wikipedia
https://static.github-zh.com/github_avatars/donnemartin?size=40
donnemartin / data-science-ipython-notebooks

#计算机科学#Python 数据科学学习笔记:深度学习 (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, 大数据 (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python 核心, AWS, Linux命令

Python机器学习深度学习数据科学big-dataAmazon Web ServicesTensorflowtheanocaffescikit-learnkaggleApache SparkmapreducehadoopmatplotlibpandasNumPySciPyKeras
Python 28.28 k
1 年前
https://static.github-zh.com/github_avatars/heibaiying?size=40
heibaiying / BigData-Notes

大数据入门指南 ⭐

hadoophdfsYarnmapreducehiveApache SparkstormhbaseScalakafkazookeeperflumeazkabansqoopphoenixbigdatabig-data
Java 16.49 k
1 年前
PowerJob/PowerJob
https://static.github-zh.com/github_avatars/PowerJob?size=40
PowerJob / PowerJob

新一代分布式任务调度与计算框架,支持CRON、API、固定频率、固定延迟等调度策略,提供工作流来编排任务解决依赖关系

schedulerworkflowdistributedmapreduceJavacronjobjob-scheduler
Java 7.5 k
5 个月前
https://static.github-zh.com/github_avatars/douban?size=40
douban / dpark

Python clone of Spark, a MapReduce alike framework in Python

bigdatamapreducedparkstream-processingApache SparkPython
Python 2.68 k
4 年前
https://static.github-zh.com/github_avatars/collabH?size=40
collabH / bigdata-growth

大数据知识仓库涉及到数据仓库建模、实时计算、大数据、数据中台、系统设计、Java、算法等。

flinkkafkahivemapreduceApache Sparkolaphadoophbasedebeziumhdfsbigdatahudi
Shell 1.63 k
1 个月前
https://static.github-zh.com/github_avatars/water8394?size=40
water8394 / BigData-Interview

#面试#🎯 🌟[大数据面试题]分享自己在网络上收集的大数据相关的面试题以及自己的答案总结.目前包含Hadoop/Hive/Spark/Flink/Hbase/Kafka/Zookeeper框架的面试题知识总结

bigdataApache SparkkafkahbaseflinkhadoophdfsmapreduceYarn面试
1.62 k
4 年前
https://static.github-zh.com/github_avatars/mahmoudparsian?size=40
mahmoudparsian / data-algorithms-book

#计算机科学# MapReduce, Spark, Java, and Scala for Data Algorithms Book

hadoop-mapreduceJavadistributed-computingScalamapreducePython机器学习pysparkApache Sparkdesign-patterns
Java 1.08 k
8 个月前
https://static.github-zh.com/github_avatars/microsoft?size=40
microsoft / Mobius

C# and F# language binding and extensions to Apache Spark

Apache SparkdataframedatasetstreamingC#spark-streamingF#bigdatamapreduce
C# 940
1 年前
https://static.github-zh.com/github_avatars/happyer?size=40
happyer / distributed-computing

distributed_computing include mapreduce kvstore etc.

raftmapreduceconsistency
Go 840
5 年前
https://static.github-zh.com/github_avatars/cdapio?size=40
cdapio / cdap

An open source framework for building data analytic applications.

unifiedintegrationplatformdatasetmapreduceApache Sparkspark-streamingJavacdapPythonmiddleware
Java 778
3 天前
https://static.github-zh.com/github_avatars/bcongdon?size=40
bcongdon / corral

🐎 A serverless MapReduce framework written for AWS Lambda

aws-lambdamapreduceServerless
Go 694
4 年前
https://static.github-zh.com/github_avatars/sunnyandgood?size=40
sunnyandgood / BigData

💎🔥大数据学习笔记

hadoophivehbasehdfszookeepersqoopmapreduceflumeMySQLLinuxShell
Java 682
6 年前
https://static.github-zh.com/github_avatars/grailbio?size=40
grailbio / bigslice

A serverless cluster computing system for the Go programming language

clustercomputingGomapreducebigdata机器学习etl
Go 554
2 年前
https://static.github-zh.com/github_avatars/apache?size=40
apache / uniffle

Uniffle is a high performance, general purpose Remote Shuffle Service.

mapreduceshuffleApache Sparkremote-shuffle-serviceRSStez
Java 417
4 天前
https://static.github-zh.com/github_avatars/CamDavidsonPilon?size=40
CamDavidsonPilon / tdigest

t-Digest data structure in Python. Useful for percentiles and quantiles, including distributed enviroments like PySpark

Pythonestimatepysparkdistributed-computingmapreduce
Python 395
2 年前
https://static.github-zh.com/github_avatars/cubefs?size=40
cubefs / compass

Compass is a task diagnosis platform for bigdata

bigdataApache SparkhadoopflinkmapreduceschedulerSQLairflowdolphinscheduler
Java 387
7 个月前
https://static.github-zh.com/github_avatars/RedisGears?size=40
RedisGears / RedisGears

Dynamic execution framework for your Redis data

Redismapreducestream-processinganalytics
Rust 378
6 个月前
https://static.github-zh.com/github_avatars/cwensel?size=40
cwensel / cascading

Cascading is a feature rich API for defining and executing complex and fault tolerant data processing flows locally or on a cluster.

hadoopJavamapreducetez
Java 350
2 个月前
https://static.github-zh.com/github_avatars/datawhalechina?size=40
datawhalechina / juicy-bigdata

🎉🎉🐳 Datawhale大数据处理导论教程 | 大数据技术方向的开篇课程🎉🎉

bigdatahadoophivehbasehdfsApache Sparkmapreduce
Python 318
2 年前
https://static.github-zh.com/github_avatars/DigitalPebble?size=40
DigitalPebble / behemoth

#自然语言处理#Behemoth is an open source platform for large scale document analysis based on Apache Hadoop.

hadoopJava自然语言处理mapreduce
Java 282
7 年前
loading...