GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

etl-job

Website
Wikipedia
https://static.github-zh.com/github_avatars/AlexIoannides?size=40
AlexIoannides / pyspark-example-project

Implementing best practices for PySpark ETL jobs and applications.

pysparketl-jobPythondata-engineeringApache Spark数据科学etletl-pipeline
Python 1.97 k
3 年前
san089/goodreads_etl_pipeline
https://static.github-zh.com/github_avatars/san089?size=40
san089 / goodreads_etl_pipeline

An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.

etl-pipelineetl-frameworkApache Sparkapache-airflowairflowredshiftemr-clusterlivys3data-lakeschedulerdata-migrationdata-engineeringdata-engineering-pipelinePythonetl-job
Python 1.4 k
5 年前
https://static.github-zh.com/github_avatars/paillave?size=40
paillave / Etl.Net

Mass processing data with a complete ETL for .net developers

etl.NETdotnet-standardbusiness-intelligenceextracttransformloadCSVcsv-parsercsv-readerentity-frameworketl-jobsftp
C# 759
4 天前
https://static.github-zh.com/github_avatars/DataWithBaraa?size=40
DataWithBaraa / sql-data-warehouse-project

A comprehensive guide to building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics.

数据分析data-analyticsdata-cleaningdata-engineering数据科学data-warehousedata-warehousingdatalakedatasciencedatawarehouseetletl-jobetl-pipelineSQLsql-querysql-server
TSQL 250
3 个月前
https://static.github-zh.com/github_avatars/jbogard?size=40
jbogard / bulk-writer

Provides guidance for fast ETL jobs, an IDataReader implementation for SqlBulkCopy (or the MySql or Oracle equivalents) that wraps an IEnumerable, and libraries for mapping entites to table columns.

etlSQLsqlbulkcopypipelineetl-job
C# 243
1 年前
https://static.github-zh.com/github_avatars/visiologyofficial?size=40
visiologyofficial / vixtract

etletl-pipelineetl-frameworketl-job
HTML 45
9 个月前
https://static.github-zh.com/github_avatars/cloudposse?size=40
cloudposse / terraform-aws-glue

Terraform modules for provisioning and managing AWS Glue resources

Amazon Web Servicesetletl-jobglueworkflow
HCL 33
1 个月前
https://static.github-zh.com/github_avatars/felipefrizzo?size=40
felipefrizzo / terraform-aws-kinesis-firehose

This code creates a Kinesis Firehose in AWS to send CloudWatch log data to S3.

Terraformterraform-awsterraform-providerparquetbig-dataetl-jobanalytics
HCL 26
4 年前
https://static.github-zh.com/github_avatars/ktnsh24?size=40
ktnsh24 / DataModelling

This repo will guide you step-by-step method to create star schema dimensional model.

etl-jobSQLMySQL
Python 25
4 年前
https://static.github-zh.com/github_avatars/nsphung?size=40
nsphung / pyspark-template

A Python PySpark Projet with Poetry

poetrypysparkProjectPythontemplateblackpytestisortJupyter NotebookApache Sparkspark-sqldata-engineering数据科学etl-jobetletl-pipeline
Jupyter Notebook 23
17 天前
https://static.github-zh.com/github_avatars/michaelbironneau?size=40
michaelbironneau / analyst

A declarative, SQL-like DSL for data integration tasks.

SQLdata-integrationetletl-job
Go 14
7 年前
https://static.github-zh.com/github_avatars/kishlayjeet?size=40
kishlayjeet / Twitter-Data-Pipeline-using-Airflow-and-AWS-S3

An end-to-end Twitter Data Pipeline that extracts data from Twitter and loads it into AWS S3.

airflowapache-airflowboto3data-engineeringdata-engineering-pipelinedata-pipelineetletl-jobetl-pipelinePythons3schedulertweepyX (Twitter)twitter-api
Python 13
2 年前
https://static.github-zh.com/github_avatars/yennanliu?size=40
yennanliu / AirflowJob

#计算机科学#Airflow POC demo : 1) env set up 2) airflow DAG 3) Spark/ML pipeline | #DE

etl-jobairflowApache Sparkdata-engineering机器学习InstagrametlDockerPythontravis数据科学infrastructureenvironmenteltetl-pipelinedagShell
Python 12
3 年前
https://static.github-zh.com/github_avatars/ankiano?size=40
ankiano / etl

Extract transform load CLI tool for extracting small and middle data volume from sources (databases, csv files, xls files, gspreadsheets) to target (databases, csv files, xls files, gspreadsheets) in ...

etldata-engineeringdata-pipelineetl-jobextractor数据库Google Sheetsdata-lakeeltbusiness-intelligence
Python 11
2 个月前
https://static.github-zh.com/github_avatars/Joshua-omolewa?size=40
Joshua-omolewa / Retailstore_ETL_pipeline_project

Built a Data Pipeline for a Retail store using AWS services that collects data from its transactional database (OLTP) in Snowflake and transforms the raw data (ETL process) using Apache spark to meet ...

airflowDockeretl-jobetl-pipelinePythonsnowflakeApache Spark
Python 9
2 年前
https://static.github-zh.com/github_avatars/TheCocoTeam?size=40
TheCocoTeam / source-watcher-core

This is a PHP project which combines ETL with different strategies to extract data from multiple databases, files, and services, transform it and load it into multiple destinations.

etletl-frameworketl-pipelineetl-jobCSVtransformation
PHP 9
4 个月前
https://static.github-zh.com/github_avatars/2298-Software?size=40
2298-Software / Mambo

A simple in-memory, configuration driven, data processing pipeline for Apache Spark.

Apache Sparketl-frameworketl-jobturbinestreamhadooppipeline
Scala 5
3 年前
https://static.github-zh.com/github_avatars/amantewary?size=40
amantewary / Sentiment-Analysis-of-Tweets-Using-ETL-process-and-Elastic-Search

Sentiment Analysis of Tweets Using ETL process and Elastic Search

sentiment-analysisetl-jobelasticsearchAzure
Python 4
7 年前
https://static.github-zh.com/github_avatars/achugr?size=40
achugr / flink-comms-processing

Comms processing (ETL) with Apache Flink.

flinkflink-examplesetletl-pipelineetl-job
Java 4
5 年前
https://static.github-zh.com/github_avatars/amrelauoty?size=40
amrelauoty / Telecom-ETL-SSIS

Telecom ETL is a SSIS package that ingest it's data from CSVs to DB

csv-importetletl-job
TSQL 4
3 年前
loading...