#

google-cloud-dataproc

https://static.github-zh.com/github_avatars/kubeflow?size=40

Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.

Go 3.02 k
7 天前
https://static.github-zh.com/github_avatars/GoogleCloudPlatform?size=40

[DEPRECATED] Kubernetes operator for managing the lifecycle of Apache Flink and Beam applications.

Go 658
3 年前
https://static.github-zh.com/github_avatars/GoogleCloudDataproc?size=40

Run in all nodes of your cluster before the cluster starts - lets you customize your cluster

Shell 598
4 天前
https://static.github-zh.com/github_avatars/GoogleCloudDataproc?size=40

BigQuery data source for Apache Spark: Read data from BigQuery into DataFrames, write DataFrames into BigQuery tables.

Java 408
4 天前
https://static.github-zh.com/github_avatars/GoogleCloudDataproc?size=40

Libraries and tools for interoperability between Hadoop-related open-source software and Google Cloud Platform.

Java 285
3 天前
https://static.github-zh.com/github_avatars/GoogleCloudDataproc?size=40
Jupyter Notebook 204
3 个月前
https://static.github-zh.com/github_avatars/GoogleCloudDataproc?size=40

Tools for creating Dataproc custom images

Python 34
3 个月前
https://static.github-zh.com/github_avatars/kumgaurav?size=40

A sample demo to check latest spark, big query connector and scala 2.12

Scala 1
4 年前
https://static.github-zh.com/github_avatars/VagnerBellacosa?size=40

Sua missão será criar um ecossistema de Big Data usando o Google Cloud Platform (GCP). Para isso, o expert te ensinará a configurar o Google Cloud Dataproc, um Hadoop totalmente gerenciado, usando seu...

Python 0
4 年前
https://static.github-zh.com/github_avatars/Eu-Bitwise?size=40

Streaming JSON data to Spark or Google Cloud Dataproc.

Python 0
2 年前
https://static.github-zh.com/github_avatars/jonathanAmancioSales?size=40

Projeto do Curso "Criando um Ecossistema Hadoop Totalmente Gerenciado com Google Cloud Dataproc" do Bootcamp Data Engineer da Digital Innovation One

Shell 0
4 年前
https://static.github-zh.com/github_avatars/varmor?size=40

This project explores the core concepts of distributed data processing using the MapReduce programming model , implemented with Python via Hadoop Streaming , and deployed on a multi-node Google Cloud ...

Python 0
2 个月前
Website
Wikipedia