GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

spark-sql

Website
Wikipedia
https://static.github-zh.com/github_avatars/getredash?size=40
getredash / redash

Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.

redashPython可视化analyticsbiredshiftBigQueryathenaMySQLPostgreSQLdashboardJavaScriptbusiness-intelligencedatabricksApache Sparkspark-sqlHacktoberfest
Python 27.41 k
11 天前
https://static.github-zh.com/github_avatars/apache?size=40
apache / kyuubi

Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.

Apache SparkhiveSQLthriftjdbcspark-sqldata-lakehadoopKubernetesHacktoberfest
Scala 2.2 k
2 天前
https://static.github-zh.com/github_avatars/dotnet?size=40
dotnet / spark

#计算机科学#.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.

Apache SparkC#.NETanalyticsbigdataspark-streamingspark-sql机器学习F#dotnet-standardstreamingAzurehdinsightdatabricksemrMicrosoft
C# 2.07 k
1 个月前
https://static.github-zh.com/github_avatars/almond-sh?size=40
almond-sh / almond

A Scala kernel for Jupyter

Jupyter NotebookScalaRepl.itjupyter-kernelsApache Sparkspark-sql
Scala 1.62 k
23 天前
https://static.github-zh.com/github_avatars/apache?size=40
apache / incubator-gluten

Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.

clickhousesimdspark-sqlvectorizationveloxarrow
Scala 1.37 k
2 天前
https://static.github-zh.com/github_avatars/databricks?size=40
databricks / LearningSparkV2

This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]

Apache Sparkspark-sqlspark-mllibmlflowdelta-lake
Scala 1.3 k
5 个月前
https://static.github-zh.com/github_avatars/oeljeklaus-you?size=40
oeljeklaus-you / UserActionAnalyzePlatform

电商用户行为分析大数据平台

Apache SparkJavahadoopsparkjavaspark-sql
Java 1.04 k
3 年前
https://static.github-zh.com/github_avatars/qubole?size=40
qubole / sparklens

Qubole Sparklens tool for performance tuning Apache Spark

Apache SparkScalaSimulationschedulerschedulingperformanceperformance-analysisperformance-metricsperformance-tuningperformance-visualizationspark-sqlsparkjavaspark-mllibspark-mlcluster
Scala 578
1 年前
https://static.github-zh.com/github_avatars/kevinschaich?size=40
kevinschaich / pyspark-cheatsheet

🐍 Quick reference guide to common patterns & functions in PySpark.

pysparkcheatsheetcheatcheatsheetsreferencereferences文档数据科学dataApache Sparkspark-sqlguideguidesquickstart
555
2 年前
https://static.github-zh.com/github_avatars/japila-books?size=40
japila-books / spark-sql-internals

The Internals of Spark SQL

Apache Sparkspark-sqlinternalsmkdocs-materialbook
467
5 个月前
https://static.github-zh.com/github_avatars/zsvoboda?size=40
zsvoboda / ngods-stocks

New Generation Opensource Data Stack Demo

cubedagsterdatahubdbticebergmetabasePythonApache Sparkspark-sqltrino
Jupyter Notebook 432
2 年前
https://static.github-zh.com/github_avatars/microsoft?size=40
microsoft / data-accelerator

Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsigh...

Apache Sparkspark-streamingspark-sqlsparksqlstreaming-datastreamingservicefabricNode.jsDockerhdinsightcosmosdbReactAzureiothubbig-dataInternet of thingskafkakafka-streams
C# 302
3 个月前
https://static.github-zh.com/github_avatars/cuebook?size=40
cuebook / cuelake

Use SQL to build ELT pipelines on a data lakehouse.

apache-icebergdeltalakehousedatalakedata-lakeeltetldata-engineeringdata-integrationdata-ingestionApache Sparkspark-sqldata-transferpipelinesdata-pipelinezeppelin-notebookSQL
JavaScript 287
3 年前
https://static.github-zh.com/github_avatars/jaceklaskowski?size=40
jaceklaskowski / spark-workshop

Apache Spark™ and Scala Workshops

workshopApache Sparkspark-sqlspark-mllib
HTML 264
1 年前
https://static.github-zh.com/github_avatars/Qbeast-io?size=40
Qbeast-io / qbeast-spark

Qbeast-spark: DataSource enabling multi-dimensional indexing and efficient data sampling. Big Data, free from the unnecessary!

Apache SparkScalabig-datasamplingdatasourcespark-sql
Scala 228
5 个月前
https://static.github-zh.com/github_avatars/Chabane?size=40
Chabane / bigdata-playground

#计算机科学#A complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apach...

Dockerspark-sqlScalakafkahbaseparquetavroNode.jsAngularGraphQLMongoDB机器学习big-datahadoopApache Sparkapache-flinkspark-streamingtwitter-apiPythonkops
TypeScript 209
6 年前
https://static.github-zh.com/github_avatars/bluishglc?size=40
bluishglc / bdp

A prototype project of big data platform, the source codes of the book Big Data Platform Architecture and Prototype

bigdataprototypequickstartApache Sparkspark-streamingspark-sqlDemoRediskafkasqoopsparksql
Java 199
5 年前
https://static.github-zh.com/github_avatars/polomarcus?size=40
polomarcus / Spark-Structured-Streaming-Examples

Spark Structured Streaming / Kafka / Cassandra / Elastic

Apache SparkkafkaApache Cassandraspark-sql
Scala 183
2 年前
https://static.github-zh.com/github_avatars/mc2-project?size=40
mc2-project / opaque-sql

#计算机科学#An encrypted data analytics platform

安全隐私机器学习analyticsApache Sparkspark-sqlenclave
Scala 182
2 年前
https://static.github-zh.com/github_avatars/xiaogp?size=40
xiaogp / recsys_spark

Spark SQL 实现 ItemCF,UserCF,Swing,推荐系统,推荐算法,协同过滤

spark-sqlcollaborative-filteringrecommender-system
Scala 141
5 年前
loading...