GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

apachespark

Website
Wikipedia
https://static.github-zh.com/github_avatars/DataExpert-io?size=40
DataExpert-io / data-engineer-handbook

数据工程师学习资源清单

apachesparkAwesome ListsbigdatadatadataengineeringSQL
Jupyter Notebook 29.37 k
6 天前
https://static.github-zh.com/github_avatars/apache?size=40
apache / hudi

Upserts, Deletes And Incremental Processing on Big Data.

hudiapachehudidatalakebigdataapachesparkincremental-processingstream-processingdata-integrationapacheflink
Java 5.84 k
1 天前
https://static.github-zh.com/github_avatars/holdenk?size=40
holdenk / sparkProjectTemplate.g8

Template for Spark Projects

apachesparkApache Spark
Scala 102
1 年前
https://static.github-zh.com/github_avatars/martandsingh?size=40
martandsingh / ApacheSpark

This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which we need in our real life experience as a data engineer. We wil...

apachespark数据分析data-engineering数据库databricksdatalakedeltalakeetl-pipelinehadoophiveApache Sparkspark-sqlspark-streamingtimetraveletlpysparkSQL
Python 98
1 年前
https://static.github-zh.com/github_avatars/funkyminds?size=40
funkyminds / cleanframes

type-class based data cleansing library for Apache Spark SQL

Apache SparksparksqlScalabigdataapachespark
Scala 78
6 年前
https://static.github-zh.com/github_avatars/josephmachado?size=40
josephmachado / docker_for_data_engineers

Code for blog at: https://www.startdataengineering.com/post/docker-for-de/

apachesparkDockerDocker Composepyspark
C 37
1 年前
https://static.github-zh.com/github_avatars/propelledanalytics?size=40
propelledanalytics / SparkSQL.jl

SparkSQL.jl enables Julia programs to work with Apache Spark data using just SQL.

Apache SparkJulia 语言apachespark
Julia 25
1 年前
https://static.github-zh.com/github_avatars/tspannhw?size=40
tspannhw / FLiPStackWeekly

FLaNK AI Weekly covering Apache NiFi, Apache Flink, Apache Kafka, Apache Spark, Apache Iceberg, Apache Ozone, Apache Pulsar, and more...

apacheflinkapachesparkclouderalakehousestreaming
21
5 天前
https://static.github-zh.com/github_avatars/aravinthsci?size=40
aravinthsci / Spark_Delta_Lake

Delta Lake Examples

Apache Sparkapachesparkdelta-lakedeltalakedatalake
Jupyter Notebook 12
5 年前
https://static.github-zh.com/github_avatars/SmartDataAnalytics?size=40
SmartDataAnalytics / MA-INF-4223-DBDA-Lab

#计算机科学#Repository for Lab “Distributed Big Data Analytics” (MA-INF 4223), University of Bonn

teachingapachesparkbigdatasemantics机器学习RDF (Resource Description Framework)university
Jupyter Notebook 10
3 年前
https://static.github-zh.com/github_avatars/SandeepAswathnarayana?size=40
SandeepAswathnarayana / professional-certificate-programs

This repository contains all the projects and labs I worked on while pursuing professional certificate programs, specializations, and bootcamp. [Areas: Deep Learning, Machine Learning, Applied Data Sc...

深度学习机器学习datasciencerecurrent-neural-networksPythonPyTorchTensorflowpandasNumPymatplotlibSciPyscikit-learnrecommender-systemrestricted-boltzmann-machineseabornautoencoderimage-classificationapachespark
Jupyter Notebook 9
5 年前
https://static.github-zh.com/github_avatars/datumbrain?size=40
datumbrain / gossub

Trigger spark-submit in Golang. A Go implementation of famous SparkLauncher.java.

Apache SparkapachesparkGo
Go 7
5 年前
https://static.github-zh.com/github_avatars/sfrechette?size=40
sfrechette / spark-jdbc-mssql

Connect to SQL Server using Apache Spark

sql-serverjdbc-driverApache SparkScalaapachespark
Scala 7
9 年前
https://static.github-zh.com/github_avatars/CarolinaNicasio?size=40
CarolinaNicasio / APACHESPARK-PYSPARK-2023

PySpark es una biblioteca de procesamiento de datos distribuidos en Python que permite procesar grandes volúmenes de datos en clústeres utilizando el framework Apache Spark, ofreciendo un alto rendim...

apacheapachespark数据科学dataframeActionspysparkPythonApache Spark
7
2 年前
https://static.github-zh.com/github_avatars/lensesio?size=40
lensesio / lenses-jdbc-spark

Apache Spark with Kafka via JDBC !!!

kafkaapachesparkjdbc-driver
Java 6
7 年前
https://static.github-zh.com/github_avatars/funkyminds?size=40
funkyminds / cleanframes-examples

Examples usages for cleanframes library

Apache SparksparksqlbigdataScalaapachespark
Scala 5
6 年前
https://static.github-zh.com/github_avatars/sahith?size=40
sahith / Link-Prediction-for-Citation-Networks-using-Apache-Spark

Link Prediction is about predicting the future connections in a graph. In this project, Link Prediction is about predicting whether two authors will be collaborating for their future paper or not give...

ScalaAmazon Web Servicesemrapachesparkdataframess3bigdatabig-databig-data-analyticsdatabricks
Scala 5
6 年前
https://static.github-zh.com/github_avatars/ashkrit?size=40
ashkrit / sparkmicroservices

Microservices for Spark application

apachespark微服务
Java 5
2 年前
https://static.github-zh.com/github_avatars/divithraju?size=40
divithraju / divith-raju-Immigration-Data-Engineering

A Capstone Project that covers several aspects of Data Engineering (Data Exploration, Cleaning, Modeling, Pipelining, Processing)

apachesparkbigdatadatacleaningdataengineeringdatalakedatasetpandasSQL
Jupyter Notebook 3
2 年前
https://static.github-zh.com/github_avatars/AbdelmajidLh?size=40
AbdelmajidLh / spark-functionality-repo

Ce dépôt GitHub contient un document détaillé sur les bases du langage Scala.

apacheapachesparkdatabrickspysparkPythonScalaApache Spark
3
1 年前
loading...