#

datalakehouse

https://static.github-zh.com/github_avatars/linkedin?size=40
Java 370
2 天前
https://static.github-zh.com/github_avatars/lakevision-project?size=40
Python 43
15 小时前
https://static.github-zh.com/github_avatars/ociexplained?size=40
Jupyter Notebook 7
2 年前
https://static.github-zh.com/github_avatars/prefeitura-rio?size=40

Projeto dbt do Data Lake da Secretaria Municipal de Saúde

Shell 5
8 小时前
https://static.github-zh.com/github_avatars/aswinjose89?size=40

Connecting prestodb with external databases like mongodb, elasticsearch, mysql, hadoob etc to manipulate big data

2
2 年前
https://static.github-zh.com/github_avatars/gabriel-solon-padilha?size=40

Meu décimo primeiro projeto em que crio um datalakehouse usando computação distribuído no databricks

HTML 1
3 年前
https://static.github-zh.com/github_avatars/dwickyferi?size=40

This repository provides a modular and easy-to-extend ETL pipeline that streams data from a PostgreSQL database into a StarRocks data warehouse using RisingWave as the real-time streaming computation ...

1
5 个月前
https://static.github-zh.com/github_avatars/BsoBird?size=40

A prototype for implementing datalake catalog management only based on arbitrary file systems

Java 1
2 个月前
https://static.github-zh.com/github_avatars/subbota19?size=40

This project serves as a personal lab for developing and honing skills in distributed data processing and data lake architecture.

Java 1
2 个月前
https://static.github-zh.com/github_avatars/Alex-Nettekoven?size=40

Real-Time Healthcare Data Lakehouse for Predictive Analytics (Synthea, Faker, Kafka, Spark, Delta Lake, BigQuery)

Jupyter Notebook 0
22 天前
https://static.github-zh.com/github_avatars/rayyan-merchant?size=40

A scalable and optimized data warehouse solution designed for efficient data integration, transformation, and analytics. This project demonstrates ETL workflows, dimensional modeling, and query perfor...

0
3 个月前
https://static.github-zh.com/github_avatars/sainathd07?size=40
PLpgSQL 0
6 个月前
https://static.github-zh.com/github_avatars/dalvarez83?size=40

This repo is to run a quick demo for how to spin up an Apache Iceberg application.

0
6 个月前
https://static.github-zh.com/github_avatars/burakugurr?size=40

We will create a sample lakehouse using Docker, execute an ETL process with Spark, and then access the data in the Iceberg table format from the Nessie Catalog.

Jupyter Notebook 0
1 年前
Website
Wikipedia