GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

datalakehouse

Website
Wikipedia
https://static.github-zh.com/github_avatars/linkedin?size=40
linkedin / openhouse

Open Control Plane for Tables in Data Lakehouse

big-datacatalogdatalakedatalakehousedeclarativeicebergmanagementtables
Java 354
3 天前
https://static.github-zh.com/github_avatars/ociexplained?size=40
ociexplained / how-to-use-OCI

"바로 쓰는 오라클 클라우드 - Build and Delploy Modern Apps with Oracle Cloud"의 전체 소스코드 저장소입니다.

business-intelligenceCI/CDcloud数据科学FlaskistioJenkinsKubernetes微服务MySQLociopensearchOracle 数据库Pythonservice-meshdatalakehouseServerlessautoscalingloadbalancing
Jupyter Notebook 5
2 年前
https://static.github-zh.com/github_avatars/prefeitura-rio?size=40
prefeitura-rio / queries-rj-sms

Projeto dbt do Data Lake da Secretaria Municipal de Saúde

datalakedatalakehousedbthealthhealthcare
PowerShell 4
5 天前
https://static.github-zh.com/github_avatars/aswinjose89?size=40
aswinjose89 / docker-presto-integration

Connecting prestodb with external databases like mongodb, elasticsearch, mysql, hadoob etc to manipulate big data

bigdatadatalakehouseMongoDBprestodb
2
2 年前
https://static.github-zh.com/github_avatars/gabriel-solon-padilha?size=40
gabriel-solon-padilha / criando_um_datalakehouse_databricks

Meu décimo primeiro projeto em que crio um datalakehouse usando computação distribuído no databricks

databricksApache SparkdatalakehousepysparkSQLhadoopparquet
HTML 1
3 年前
https://static.github-zh.com/github_avatars/dwickyferi?size=40
dwickyferi / etl-postgres-to-starrocks-via-risingwave

This repository provides a modular and easy-to-extend ETL pipeline that streams data from a PostgreSQL database into a StarRocks data warehouse using RisingWave as the real-time streaming computation ...

datadatalakedatalakehousedatawarehouseetletl-pipelinePostgreSQLsynchronization
1
1 个月前
https://static.github-zh.com/github_avatars/BsoBird?size=40
BsoBird / filesystem-catalog-original

A prototype for implementing datalake catalog management based on arbitrary file systems

catalogfilesystemhadoopicebergOpen Sources3datalakedatalakehouse
Java 1
7 天前
https://static.github-zh.com/github_avatars/abhinabsarkar?size=40
abhinabsarkar / az-data-services

Azure data services

Azuredatabricksdatalakehousedelta-lake
0
1 年前
https://static.github-zh.com/github_avatars/gillsantos?size=40
gillsantos / streamflake

StreamFlake: Real-Time CDC Pipeline with Kafka and Snowflake

cdcdataengineeringdatalakehousedebeziumkafkaKubernetes微服务minikubePostgreSQLrealtimesnowflake
0
1 年前
https://static.github-zh.com/github_avatars/burakugurr?size=40
burakugurr / data-lakehouse-with-cyber-security-data

We will create a sample lakehouse using Docker, execute an ETL process with Spark, and then access the data in the Iceberg table format from the Nessie Catalog.

datalakehouseiceberglakehouseApache Spark
Jupyter Notebook 0
9 个月前
https://static.github-zh.com/github_avatars/dalvarez83?size=40
dalvarez83 / iceberg-tutorial

This repo is to run a quick demo for how to spin up an Apache Iceberg application.

apache-icebergdatalakehouse
0
3 个月前
https://static.github-zh.com/github_avatars/sainathd07?size=40
sainathd07 / sql-data-warehouse

Building a modern data warehouse with PostgreSQL, including ETL processes, data modeling, and analytics.

datacleaningdataengineeringdatalakedatalakehousedatasciencedatawarehouseetletl-jobetl-pipelinePostgreSQLpostgresql-databaseSQLsql-query
PLpgSQL 0
3 个月前