数据工程师学习资源清单
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team colla...
Compare tables within or across databases
Scalable and efficient data transformation framework - backwards compatible with dbt.
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
A free to use dbt package for creating and loading Data Vault 2.0 compliant Data Warehouses (powered by dbt, an open source data engineering tool, registered trademark of dbt Labs)
#计算机科学#This repository provides various demos/examples of using Snowpark for Python.
An open source development framework to help you build data workflows and modern data architecture on AWS.
The developer framework for building analytical backends on top of Clickhouse, Redpanda and other high-performance analytical infrastructure
#面试#Roadmap for Data Engineering
Code and data for the Modern Polars book
#数据仓库#Data Engineering Pilipinas is a community for data engineers, data analysts, data scientists, developers, AI / ML engineers, and users of closed and open source data tools and methods / techniques in ...
end-to-end data engineering project to get insights from PyPi using python, duckdb, MotherDuck & Evidence
Все, о чем меня когда-либо спрашивали на собеседованиях, и другие полезные знания в кратком формате
A Data Platform built for AWS, powered by Kubernetes.
Index for online reading materials in order to learn Python and backend development/engineering concepts from scratch and develop a mastery sufficient for Senior/Principal Backend Engineers and Data E...
Simple stream processing pipeline
#计算机科学#Resources about data science, machine learning, deep learning, data engineering, and SQL.