GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

deltalake

Website
Wikipedia
https://static.github-zh.com/github_avatars/paradedb?size=40
paradedb / pg_analytics

DuckDB-powered data lake analytics from Postgres

analyticsarrowcolumnardatafusionlakehouseparquetPostgreSQLduckdbolapbig-data数据库datalakedeltalakeicebergobject-storageSQLlakehouse-platform
Rust 522
4 个月前
https://static.github-zh.com/github_avatars/databrickslabs?size=40
databrickslabs / dbldatagen

Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POC...

pysparkPythondata-generationfakerApache Sparkspark-streamingdeltalakedatabrickssynthetic-data
Python 414
1 个月前
https://static.github-zh.com/github_avatars/delta-io?size=40
delta-io / kafka-delta-ingest

A highly efficient daemon for streaming data from Kafka into Delta Lake

deltalakedeltaRustkafka
Rust 410
3 个月前
https://static.github-zh.com/github_avatars/MrPowers?size=40
MrPowers / mack

Delta Lake helper methods in PySpark

deltalakepyspark
Python 325
1 年前
https://static.github-zh.com/github_avatars/japila-books?size=40
japila-books / delta-lake-internals

The Internals of Delta Lake

deltalakebookinternalsdelta-lakebooksdatalake
184
7 个月前
https://static.github-zh.com/github_avatars/smart-data-lake?size=40
smart-data-lake / smart-data-lake

Smart Automation Tool for building modern Data Lakes and Data Pipelines

data-lakeScalaApache Sparkhadoophivedeltalaketransform-datadata-pipelines
Scala 124
2 天前
https://static.github-zh.com/github_avatars/flintml?size=40
flintml / flintml

#计算机科学#One-click ML infrastructure for teams that just want to get sh*t done.

deltalakeJupyter Notebook机器学习mlopspolars数据科学
Python 123
1 个月前
https://static.github-zh.com/github_avatars/uname-n?size=40
uname-n / deltabase

a lightweight, comprehensive solution for managing delta tables built on polars and deltalake

数据库deltalakepolarsSQL
Python 122
7 个月前
https://static.github-zh.com/github_avatars/izhangzhihao?size=40
izhangzhihao / Real-time-Data-Warehouse

Real-time Data Warehouse with Apache Flink & Apache Kafka & Apache Hudi

flinkdata-warehousedata-warehousingflink-sqldebeziumkafkaelasticsearchdelta-lakecdcchange-data-capturehudiicebergSQLdatalakedeltadeltalakeApache Sparkspark-sql
Dockerfile 117
2 年前
https://static.github-zh.com/github_avatars/WeBankFinTech?size=40
WeBankFinTech / Streamis

Streaming application development and management system, based on Linkis and DSS, planning to provide the workflow-like graphical drag-and-drop development capability.

flinklinkisstreaminghudiicebergdatalakekafkadeltalake
Java 107
3 个月前
https://static.github-zh.com/github_avatars/anneglienke?size=40
anneglienke / 101_upsert-delta

This repository exemplifies a simple ELT process using delta to perform upsert and remove data files that aren't in the latest state of the transaction log for the table.

deltadelta-lakedeltalake
Python 101
3 年前
https://static.github-zh.com/github_avatars/martandsingh?size=40
martandsingh / ApacheSpark

This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which we need in our real life experience as a data engineer. We wil...

apachespark数据分析data-engineering数据库databricksdatalakedeltalakeetl-pipelinehadoophiveApache Sparkspark-sqlspark-streamingtimetraveletlpysparkSQL
Python 100
1 年前
https://static.github-zh.com/github_avatars/dacort?size=40
dacort / faker-cli

Command-line interface to quickly generate fake CSV and JSON data

Amazon Web ServicesCSVJSONdeltalakeparquet
Python 73
1 年前
https://static.github-zh.com/github_avatars/bhavink?size=40
bhavink / databricks

Databricks Platform - Architecture, Security, Automation and much more!!

databricksdeltalake安全
Jupyter Notebook 51
24 天前
https://static.github-zh.com/github_avatars/buoyant-data?size=40
buoyant-data / oxbow

Collection of AWS Lambdas for creating and managing Delta tables

deltalakeparquetdatalakelambdaRust
Rust 41
16 天前
https://static.github-zh.com/github_avatars/sankamuk?size=40
sankamuk / PysparkCheatsheet

PySpark Cheatsheet

Apache SparkPythondeltalake
Python 36
3 年前
https://static.github-zh.com/github_avatars/DataTech-Solutions?size=40
DataTech-Solutions / Threat-Detection-and-Visualization

#计算机科学#Threat Detection and Visualization

APIdatalakedefenderdeltalakePostmanpowerbisccmsiemSQLactive-directory机器学习
TSQL 32
2 年前
https://static.github-zh.com/github_avatars/mrjsj?size=40
mrjsj / delta-lake-explorer

Azuredeltalakeduckdbsql-client
Python 29
1 年前
https://static.github-zh.com/github_avatars/mrjsj?size=40
mrjsj / msfabricutils

Spark-free Python utilities for Microsoft Fabric focused on Data Engineering using Polars and delta-rs

dataframedeltalakeduckdbPythonFabricMCpolarsdata-engineering
Python 27
2 个月前
https://static.github-zh.com/github_avatars/newfront?size=40
newfront / hitchhikers_guide_to_deltalake_streaming

Don't Panic. This guide will help you when it feels like the end of the world.

apacheApache Sparkdeltalake
Jupyter Notebook 26
2 个月前
loading...