GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

hadoop

Website
Wikipedia
https://static.github-zh.com/github_avatars/donnemartin?size=40
donnemartin / data-science-ipython-notebooks

#计算机科学#Python 数据科学学习笔记:深度学习 (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, 大数据 (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python 核心, AWS, Linux命令

Python机器学习深度学习数据科学big-dataAmazon Web ServicesTensorflowtheanocaffescikit-learnkaggleApache SparkmapreducehadoopmatplotlibpandasNumPySciPyKeras
Python 28.42 k
1 年前
https://static.github-zh.com/github_avatars/itdevbooks?size=40
itdevbooks / pdf

编程电子书,电子书,编程书籍,包括C,C#,Docker,Elasticsearch,Git,Hadoop,HeadFirst,Java,Javascript,jvm,Kafka,Linux,Maven,MongoDB,MyBatis,MySQL,Netty,Nginx,Python,RabbitMQ,Redis,Scala,Solr,Spark,Spring,SpringBoot,SpringC...

elasticsearchSpring BootspringcloudJavahadoopDockerMySQLNettyLinuxrabbitmq
19.18 k
3 年前
https://static.github-zh.com/github_avatars/spotify?size=40
spotify / luigi

Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.

Pythonluigiorchestration-frameworkschedulinghadoop
Python 18.41 k
2 个月前
https://static.github-zh.com/github_avatars/Tencent?size=40
Tencent / APIJSON

🏆 实时 零代码、全功能、强安全 ORM 库 🚀 后端接口和文档零代码,前端(客户端) 定制返回 JSON 的数据和结构 🏆 Real-Time coding-free, powerful and secure ORM 🚀 providing APIs and Docs without coding by Backend, and the returned JSON of API can...

postgresql-databasetidbsql-serverOracle 数据库clickhousePostgreSQLhivehadooptdengine低代码lowcode无代码ServerlessbaasCRUDmilvuselasticsearchinfluxdbdatabrickssnowflake
Java 18.09 k
1 天前
https://static.github-zh.com/github_avatars/heibaiying?size=40
heibaiying / BigData-Notes

大数据入门指南 ⭐

hadoophdfsYarnmapreducehiveApache SparkstormhbaseScalakafkazookeeperflumeazkabansqoopphoenixbigdatabig-data
Java 16.56 k
2 年前
prestodb/presto
https://static.github-zh.com/github_avatars/prestodb?size=40
prestodb / presto

Presto 是用于大数据的高性能分布式SQL查询引擎

Javaprestohivehadoopbig-dataSQLdatalakehouseQuery (disambiguation)
Java 16.42 k
5 小时前
https://static.github-zh.com/github_avatars/apache?size=40
apache / hadoop

Hadoop 是一个开源的分布式计算和存储框架,有助于使用许多计算机组成的网络来解决数据、计算密集型的问题。基于MapReduce计算模型,它为大数据的分布式存储与处理提供了一个软件框架。

hadoop
Java 15.19 k
2 天前
https://static.github-zh.com/github_avatars/deeplearning4j?size=40
deeplearning4j / deeplearning4j

Deeplearning4j 是为Java以及基于JVM编写的开源深度学习库,是广泛支持各种深度学习算法的运算框架。

Javagpu深度学习neural-netsdeeplearning4jdl4jhadoopApache SparkIntelliJ IDEA人工智能PythonScalaClojurelinear-algebramatrix-library
Java 14.06 k
8 天前
https://static.github-zh.com/github_avatars/trinodb?size=40
trinodb / trino

trino 是一个分布式大数据 SQL 查询引擎(前身 PrestoSQL)

Javaprestohivehadoopbig-dataSQLprestodb数据库distributed-systemsdistributed-database数据科学datalakejdbcquery-enginetrinoanalyticsdelta-lakeiceberg
Java 11.65 k
11 小时前
https://static.github-zh.com/github_avatars/wangzhiwubigdata?size=40
wangzhiwubigdata / God-Of-BigData

专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...

flinkApache Sparkhadoophdfshivehbasekafkazookeeperbigdataflumeazkaban
10.21 k
2 年前
linkedin/school-of-sre
https://static.github-zh.com/github_avatars/linkedin?size=40
linkedin / school-of-sre

At LinkedIn, we are using this curriculum for onboarding our entry-level talents into the SRE role.

SRELinuxNetworkGitPythonMySQLNoSQLhadoopsystem-design安全
HTML 8.01 k
1 年前
https://static.github-zh.com/github_avatars/h2oai?size=40
h2oai / h2o-3

#计算机科学#H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Me...

h2o机器学习数据科学深度学习big-dataensemble-learninggbmrandom-forestnaive-bayespcaOpen SourcedistributedJavaPythonRhadoopApache Sparkgpuautoml
Jupyter Notebook 7.24 k
1 天前
https://static.github-zh.com/github_avatars/Alluxio?size=40
Alluxio / alluxio

Alluxio作为数据编排层为大数据和人工智能工作负载带来速度和敏捷性并降低成本,使用户能够迁移到对象存储等更新的存储解决方案

alluxiomemory-speedhadoopApache SparkprestoTensorflow数据分析data-orchestrationvirtual-distributed-filesystem
Java 7.05 k
3 个月前
https://static.github-zh.com/github_avatars/HariSekhon?size=40
HariSekhon / DevOps-Bash-tools

1000+ DevOps Bash Scripts - AWS, GCP, Kubernetes, Docker, CI/CD, APIs, SQL, PostgreSQL, MySQL, Hive, Impala, Kafka, Hadoop, Jenkins, GitHub, GitLab, BitBucket, Azure DevOps, TeamCity, Spotify, MP3, LD...

BashDockerPerlclouderahadoopkafkaPostgreSQLMySQLDevOpsJenkinsAmazon Web ServicesGoogle 云APIKubernetesGitHubGitLinux持续集成TerraformHacktoberfest
Shell 7.03 k
22 天前
https://static.github-zh.com/github_avatars/apache?size=40
apache / hive

Hive是基于Hadoop的一个数据仓库工具,可以将结构化的数据文件映射为一张数据库表,并提供简单的SQL查询功能

Javahive数据库SQLapachebig-datahadoop
Java 5.75 k
2 天前
https://static.github-zh.com/github_avatars/apache?size=40
apache / ignite

Apache Ignite

distributed-sql-databaseInternet of thingsosginetwork-clientignitedata-management-platformbig-datacloud数据库network-serverhadoopSQLcachein-memory-database
Java 4.96 k
7 小时前
https://static.github-zh.com/github_avatars/apache?size=40
apache / calcite

Apache Calcite是一个动态数据管理框架,它具备很多典型数据库管理系统的功能,比如SQL解析、SQL校验、SQL查询优化、SQL生成以及数据连接查询等,但是又省略了一些关键的功能,比如Calcite并不存储相关的元数据和基本数据,不完全包含相关处理数据的算法等。

geospatialcalciteJavabig-datahadoopSQL
Java 4.9 k
5 天前
https://static.github-zh.com/github_avatars/tomwhite?size=40
tomwhite / hadoop-book

Example source code accompanying O'Reilly's "Hadoop: The Definitive Guide" by Tom White

hadoopbook
Makefile 3.51 k
5 年前
https://static.github-zh.com/github_avatars/WeBankFinTech?size=40
WeBankFinTech / DataSphereStudio

DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualizati...

workflowgovernanceazkabandavincilinkisApache Sparkhivehadoopvisualiszeppelinhuetableaugriffinkettleairflowflinkdolphinscheduleratlas
Java 3.19 k
4 个月前
https://static.github-zh.com/github_avatars/apache?size=40
apache / nutch

#网络爬虫#Apache Nutch is an extensible and scalable web crawler

Javanutchweb-crawlercrawlinghadoopapache
Java 3.05 k
9 天前
loading...