数据工程师学习资源清单
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team colla...
Compare tables within or across databases
Scalable and efficient data transformation framework - backwards compatible with dbt.
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
A free to use dbt package for creating and loading Data Vault 2.0 compliant Data Warehouses (powered by dbt, an open source data engineering tool, registered trademark of dbt Labs)
#计算机科学#This repository provides various demos/examples of using Snowpark for Python.
The developer framework for building analytical backends on top of ClickHouse, Redpanda and other high-performance analytical infrastructure
An open source development framework to help you build data workflows and modern data architecture on AWS.
#面试#Roadmap for Data Engineering
Code and data for the Modern Polars book
#数据仓库#Data Engineering Pilipinas is a community for data engineers, data analysts, data scientists, developers, AI / ML engineers, and users of closed and open source data tools and methods / techniques in ...
end-to-end data engineering project to get insights from PyPi using python, duckdb, MotherDuck & Evidence
Все, о чем меня когда-либо спрашивали на собеседованиях, и другие полезные знания в кратком формате
A Data Platform built for AWS, powered by Kubernetes.
#计算机科学#Notebooks for Tutorials from Marktechpost
Index for online reading materials in order to learn Python and backend development/engineering concepts from scratch and develop a mastery sufficient for Senior/Principal Backend Engineers and Data E...
Simple stream processing pipeline