免费数据工程师视频课程,共9周课时
#计算机科学#Construct a modern data stack and orchestration the workflows to create high quality data for analytics and ML applications.
The bridge to effortless multi-engine data applications, currently supports Snowflake ❄️ and DuckDB 🦆
This project implements an ELT (Extract - Load - Transform) data pipeline with the goodreads dataset, using dagster (orchestration), spark (calculation) and dbt (transformation)
Code/Notes for the Data Engineering Zoomcamp by DataTalksClub
Data Engineering - Metropolitan Transportation Authority (MTA) Subway Data Analysis
ELT Data Pipeline implementation in Data Warehousing environment
Fully dockerized Data Warehouse (DWH) using Airflow, dbt, PostgreSQL and dashboard using redash
A Data Engineering Project that implements an ETL data pipeline using Dagster, Apache Spark, Streamlit, MinIO, Metabase, Dbt, Polars, Docker. Data from kaggle and youtube-api
American Community Survey data on people and households
An end-to-end open-source data stack for crawling and visualizing real estate data, facilitating insights into market trends.
Covid19 and Iowa Liquor Sales analysis at BigQuery using dbt, Airflow, Marquez, Google Cloud and other modern data stack tools
Social Media Analysis, scalable solution, flexible deployment that analyses social media contents
A simple data pipeline using dbt, pandas, postgresql to transform data using dbt as a transformation tool and postgres as the warehouse
A project that creates a scalable and robust ELT data pipeline leveraging PostgreSQL , DBT, Orchestration using airflow and data visualization with Redash
Building a Data Lakehouse using the Medallion architecture.
Data warehouse building by scrapping data from telegram
Data Fellowship 9 Final Project - Bank Marketing Campaign Data Pipeline
Design and implement a data warehouse to manage automobile accident cases across all 49 states in the US, using a star schema and Snowflake for the data warehouse architecture.