GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

etl-job

Website
Wikipedia
https://static.github-zh.com/github_avatars/AlexIoannides?size=40
AlexIoannides / pyspark-example-project

Implementing best practices for PySpark ETL jobs and applications.

pysparketl-jobPythondata-engineeringApache Spark数据科学etletl-pipeline
Python 1.93 k
2 年前
san089/goodreads_etl_pipeline
https://static.github-zh.com/github_avatars/san089?size=40
san089 / goodreads_etl_pipeline

An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.

etl-pipelineetl-frameworkApache Sparkapache-airflowairflowredshiftemr-clusterlivys3data-lakeschedulerdata-migrationdata-engineeringdata-engineering-pipelinePythonetl-job
Python 1.39 k
5 年前
https://static.github-zh.com/github_avatars/paillave?size=40
paillave / Etl.Net

Mass processing data with a complete ETL for .net developers

etl.NETdotnet-standardbusiness-intelligenceextracttransformloadCSVcsv-parsercsv-readerentity-frameworketl-jobsftp
C# 752
1 个月前
https://static.github-zh.com/github_avatars/jbogard?size=40
jbogard / bulk-writer

Provides guidance for fast ETL jobs, an IDataReader implementation for SqlBulkCopy (or the MySql or Oracle equivalents) that wraps an IEnumerable, and libraries for mapping entites to table columns.

etlSQLsqlbulkcopypipelineetl-job
C# 241
1 年前
https://static.github-zh.com/github_avatars/DataWithBaraa?size=40
DataWithBaraa / sql-data-warehouse-project

A comprehensive guide to building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics.

数据分析data-analyticsdata-cleaningdata-engineering数据科学data-warehousedata-warehousingdatalakedatasciencedatawarehouseetletl-jobetl-pipelineSQLsql-querysql-server
TSQL 198
2 个月前
https://static.github-zh.com/github_avatars/visiologyofficial?size=40
visiologyofficial / vixtract

etletl-pipelineetl-frameworketl-job
HTML 45
8 个月前
https://static.github-zh.com/github_avatars/cloudposse?size=40
cloudposse / terraform-aws-glue

Terraform modules for provisioning and managing AWS Glue resources

Amazon Web Servicesetletl-jobglueworkflow
HCL 31
5 天前
https://static.github-zh.com/github_avatars/felipefrizzo?size=40
felipefrizzo / terraform-aws-kinesis-firehose

This code creates a Kinesis Firehose in AWS to send CloudWatch log data to S3.

Terraformterraform-awsterraform-providerparquetbig-dataetl-jobanalytics
HCL 26
4 年前
https://static.github-zh.com/github_avatars/ktnsh24?size=40
ktnsh24 / DataModelling

This repo will guide you step-by-step method to create star schema dimensional model.

etl-jobSQLMySQL
Python 25
4 年前
https://static.github-zh.com/github_avatars/nsphung?size=40
nsphung / pyspark-template

A Python PySpark Projet with Poetry

poetrypysparkProjectPythontemplateblackpytestisortJupyter NotebookApache Sparkspark-sqldata-engineering数据科学etl-jobetletl-pipeline
Jupyter Notebook 23
9 个月前
https://static.github-zh.com/github_avatars/michaelbironneau?size=40
michaelbironneau / analyst

A declarative, SQL-like DSL for data integration tasks.

SQLdata-integrationetletl-job
Go 14
7 年前
https://static.github-zh.com/github_avatars/kishlayjeet?size=40
kishlayjeet / Twitter-Data-Pipeline-using-Airflow-and-AWS-S3

An end-to-end Twitter Data Pipeline that extracts data from Twitter and loads it into AWS S3.

airflowapache-airflowboto3data-engineeringdata-engineering-pipelinedata-pipelineetletl-jobetl-pipelinePythons3schedulerX (Twitter)twitter-api
Python 12
2 年前
https://static.github-zh.com/github_avatars/yennanliu?size=40
yennanliu / AirflowJob

#计算机科学#Airflow POC demo : 1) env set up 2) airflow DAG 3) Spark/ML pipeline | #DE

etl-jobairflowApache Sparkdata-engineering机器学习InstagrametlDockerPythontravis数据科学infrastructureenvironmenteltetl-pipelinedagShell
Python 12
2 年前
https://static.github-zh.com/github_avatars/ankiano?size=40
ankiano / etl

Extract transform load CLI tool for extracting small and middle data volume from sources (databases, csv files, xls files, gspreadsheets) to target (databases, csv files, xls files, gspreadsheets) in ...

etldata-engineeringdata-pipelineetl-jobextractor数据库Google Sheetsdata-lakeeltbusiness-intelligence
Python 11
18 天前
https://static.github-zh.com/github_avatars/Joshua-omolewa?size=40
Joshua-omolewa / Retailstore_ETL_pipeline_project

Built a Data Pipeline for a Retail store using AWS services that collects data from its transactional database (OLTP) in Snowflake and transforms the raw data (ETL process) using Apache spark to meet ...

airflowDockeretl-jobetl-pipelinePythonsnowflakeApache Spark
Python 9
2 年前
https://static.github-zh.com/github_avatars/TheCocoTeam?size=40
TheCocoTeam / source-watcher-core

This is a PHP project which combines ETL with different strategies to extract data from multiple databases, files, and services, transform it and load it into multiple destinations.

etletl-frameworketl-pipelineetl-jobCSVtransformation
PHP 9
2 个月前
https://static.github-zh.com/github_avatars/2298-Software?size=40
2298-Software / Mambo

A simple in-memory, configuration driven, data processing pipeline for Apache Spark.

Apache Sparketl-frameworketl-jobturbinestreamhadooppipeline
Scala 5
2 年前
https://static.github-zh.com/github_avatars/amantewary?size=40
amantewary / Sentiment-Analysis-of-Tweets-Using-ETL-process-and-Elastic-Search

Sentiment Analysis of Tweets Using ETL process and Elastic Search

sentiment-analysisetl-jobelasticsearchAzure
Python 4
7 年前
https://static.github-zh.com/github_avatars/achugr?size=40
achugr / flink-comms-processing

Comms processing (ETL) with Apache Flink.

flinkflink-examplesetletl-pipelineetl-job
Java 4
5 年前
https://static.github-zh.com/github_avatars/ShihWen?size=40
ShihWen / tpe-mrt-traffic-etl

A data pipeline from source to data warehouse using Taipei Metro Hourly Traffic data

airflowPythondata-engineeringdata-engineering-pipelinedata-warehouseetl-jobetl-pipelineredshifts3
Jupyter Notebook 3
2 年前
loading...