Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management.
dbt + Metabase integration
BENERATOR is a leading software solution to generate, obfuscate, pseudonymize and migrate data for development, testing, and training purposes with a model-driven approach.
Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow
#计算机科学#📈 🐍 Multidimensional synthetic data generation with Copula and fPCA models in Python
#计算机科学#Hypergol is a Data Science/Machine Learning productivity toolkit to accelerate any projects into production with autogenerated code, standardised structure for data and ML and parallel processing out-...
Cool DE Projects
This repository is a working ETL framework which utilizes user data from Spotify API using ➲Python for Extraction and Transformation ➲SQL for Data Loading and Staging ➲Airflow for Data Orchestration a...
🎥 Email marketing campaign analysis
#时序数据库#COVID-19 Surveillance Data Modelling and Management Pipeline in Piedmont.
Extensible Object Model Data Abstraction
This repo covers the processes of designing a database by performing logical, conceptual and physical data modelling processes, creating the designed database using DML and DDL on various database ser...
⚙️ ETL pipeline on AWS using S3 and Redshift
Repository with files that I worked upon during the DBS211 (Introduction to Database Systems) course.
Formula 1 race data engineering project which utilises azure services and databricks to ingest and analyse the data.
Social blogging community build with React, Next.js, and Firebase.
Data model for the Participatory Knowledge Practices in Analogue and Digital Image Archives (PIA) project
A CIDOC-CRM-based Application Profile, consisting in a set of entities and properties for representing the digitisation process of cultural heritage objects in a machine-readable format.
This repo contains HR Analytics project to analyze what factors impact employee attrition using dataset for Atlas Labs Company.
Data Visualization for Atliq Hardware sales