dbt · GitHub Topics

DataTalksClub / data-engineering-zoomcamp

免费数据工程师视频课程，共9周课时

data-engineering kafka Apache Spark dbt Docker kestra

Jupyter Notebook 32.48 k

1 个月前

zsvoboda / ngods-stocks

New Generation Opensource Data Stack Demo

cube dagster datahub dbt iceberg metabase Python Apache Spark spark-sql trino

Jupyter Notebook 441

3 年前

GokuMohandas / data-engineering

#计算机科学#Construct a modern data stack and orchestration the workflows to create high quality data for analytics and ML applications.

data-engineering etl 机器学习 mlops orchestration airflow data-warehouse dbt

Jupyter Notebook 223

3 年前

buremba / universql

The bridge to effortless multi-engine data applications, currently supports Snowflake ❄️ and DuckDB 🦆

dbt duckdb snowflake databricks SQL proxy-server

Jupyter Notebook 179

4 个月前

lelouvincx / goodreads-elt-pipeline

This project implements an ELT (Extract - Load - Transform) data pipeline with the goodreads dataset, using dagster (orchestration), spark (calculation) and dbt (transformation)

dagster goodreads minio pipeline Python Apache Spark dbt elt

Jupyter Notebook 38

2 年前

Balajirvp / DE-Zoomcamp

Code/Notes for the Data Engineering Zoomcamp by DataTalksClub

BigQuery dataengineering dbt Docker Google 云 prefect streaming pyspark Python Apache Spark SQL Terraform datalake datawarehouse

Jupyter Notebook 31

2 年前

ozkary / data-engineering-mta-turnstile

Data Engineering - Metropolitan Transportation Authority (MTA) Subway Data Analysis

data-lake Docker analysis data-engineering Python SQL static-analysis Visual Studio Code BigQuery data-modeling data-orchestration data-pipeline data-warehouse dbt Jupyter Notebook prefect Terraform

Jupyter Notebook 29

9 个月前

dain55788 / ELT-Data-Pipeline

ELT Data Pipeline implementation in Data Warehousing environment

apache-airflow data-engineering dbt great-expectations PostgreSQL powerbi Apache Spark minio trino

Jupyter Notebook 26

4 个月前

Nathnael12 / Datawarehouse

Fully dockerized Data Warehouse (DWH) using Airflow, dbt, PostgreSQL and dashboard using redash

airflow dataengineering datawarehouse dbt PostgreSQL redash

Jupyter Notebook 24

3 年前

longNguyen010203 / Youtube-Recommend-Master-ETL-Pipeline

A Data Engineering Project that implements an ETL data pipeline using Dagster, Apache Spark, Streamlit, MinIO, Metabase, Dbt, Polars, Docker. Data from kaggle and youtube-api

dagster etl-pipeline minio Apache Spark dbt Docker Docker Compose Dockerfile MySQL PostgreSQL data-engineering data-engineering-pipeline pyspark processing Streamlit polars YouTube youtube-api metabase

Jupyter Notebook 23

9 个月前

jaanli / american-community-survey

American Community Survey data on people and households

big-data big-data-analytics census community data-engineering dbt JavaScript observable survey TypeScript

Jupyter Notebook 19

9 个月前

Quocc1 / OpenStack

An end-to-end open-source data stack for crawling and visualizing real estate data, facilitating insights into market trends.

dagster dbt iceberg metabase minio notebook PostgreSQL Apache Spark trino Docker

Jupyter Notebook 14

1 年前

himewel / covid19retail

Covid19 and Iowa Liquor Sales analysis at BigQuery using dbt, Airflow, Marquez, Google Cloud and other modern data stack tools

airbyte airflow dbt superset

Jupyter Notebook 14

3 年前

koksang / social-media-analysis

Social Media Analysis, scalable solution, flexible deployment that analyses social media contents

BigQuery Python ray kafka apache-airflow social-media X (Twitter)data-engineering data-engineering-pipeline dbt Google 云 etl

Jupyter Notebook 9

2 年前

jbassie / ETL-PROJECT

A simple data pipeline using dbt, pandas, postgresql to transform data using dbt as a transformation tool and postgres as the warehouse

beautifulsoup4 dbt pandas postgersql Python

Jupyter Notebook 5

3 年前

ProgrammingOperative / elt_with_scalable_data_warehouse

A project that creates a scalable and robust ELT data pipeline leveraging PostgreSQL , DBT, Orchestration using airflow and data visualization with Redash

dbt PostgreSQL airflow redash

Jupyter Notebook 2

2 年前

MekWiset / Medallion_DataLakehouse

Building a Data Lakehouse using the Medallion architecture.

Azure dbt databricks

Jupyter Notebook 2

1 年前

DegaregeN / Telegram-

Data warehouse building by scrapping data from telegram

数据科学 datawarehouse dbt FastAPI Telegram telepathy yolo

Jupyter Notebook 1

8 个月前

addin12 / DF9-bank-marketing-data-pipelines

Data Fellowship 9 Final Project - Bank Marketing Campaign Data Pipeline

airflow BigQuery cloud-storage data-engineering data-modelling data-pipeline 数据可视化 dbt Google 云 kafka Apache Spark tableau

Jupyter Notebook 1

2 年前

longNguyen010203 / DATA-WAREHOUSE-ACCIDENT-US-2016-2023

Design and implement a data warehouse to manage automobile accident cases across all 49 states in the US, using a star schema and Snowflake for the data warehouse architecture.

apache-airflow Apache Spark data-ingestion data-processing data-quality-checks data-transformation data-warehouse dbt dimensions Docker Docker Compose Dockerfile FastAPI minio powerbi pyspark snowflake star-schema

Jupyter Notebook 1

8 个月前