GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

etl-pipelines

Website
Wikipedia
https://static.github-zh.com/github_avatars/yobix-ai?size=40
yobix-ai / extractous

#自然语言处理#Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.

extractionpdftikaunstructuredunstructured-datadata-pipelinesdocxetletl-pipelines大语言模型机器学习自然语言处理OCRpdf-parserragRust
Rust 1.14 k
6 个月前
https://static.github-zh.com/github_avatars/patterns-app?size=40
patterns-app / patterns-devkit

Data pipelines from re-usable components

数据科学数据分析pipelinesetletl-pipelineetl-frameworkfunctional-reactive-programmingdata-engineeringSQLimmutabilitydata-pipelinedata-pipelinesetl-pipelines
Python 108
2 年前
https://static.github-zh.com/github_avatars/Burla-Cloud?size=40
Burla-Cloud / burla

Scale Python over 10,000 CPUs with one line of code.

batch-processingPythondata-pipelinesetl-pipelines
TypeScript 73
9 天前
https://static.github-zh.com/github_avatars/level-vc?size=40
level-vc / useful

The open-source Useful SDK. One python decorator in the Useful library allows for full observability of Python functions within an ETL.

etletl-pipelinestelemetry
Python 20
1 年前
https://static.github-zh.com/github_avatars/Chek0rrdn?size=40
Chek0rrdn / DataEngineer_ETL

#网络爬虫#A project structure for doing and sharing data engineer work.

Pythoncookiecuttercookiecutter-templatedata-extractiondata-engineeringetletl-pipelineetl-pipelinesscraper
Python 8
3 年前
https://static.github-zh.com/github_avatars/abrahamkoloboe27?size=40
abrahamkoloboe27 / Airflow-Pipeline-Dashboard-Compagnie-Aerienne

Lien de l'application

airflowDockerDocker ComposeMongoDBPostgreSQLStreamlitdata-engineeringetl-pipelinemakefileduckdbatlasDockerfileetletl-pipelinesmongodb-atlasorchestrationPython
Python 5
6 个月前
https://static.github-zh.com/github_avatars/angelxd84130?size=40
angelxd84130 / Airflow-ETL

Build ETL piplines on AirFlow to load data from BigQuery and store it in MySQL

airflowapache-airflowBigQueryetletl-pipelineMySQLetl-pipelines
Python 1
3 年前
https://static.github-zh.com/github_avatars/prneidhardt?size=40
prneidhardt / Apache-Data-Pipeline

Sparkify project

Amazon Web Servicesetl-pipelinesPython
Jupyter Notebook 1
7 个月前
https://static.github-zh.com/github_avatars/EmmanuelEzenwere?size=40
EmmanuelEzenwere / DataSift

DataSift auto applies a data pre-processing pipeline to Data Science Projects.

data-engineeringdata-preprocessing数据科学etl-pipelines
Python 1
1 年前
https://static.github-zh.com/github_avatars/ChristianRCanlas?size=40
ChristianRCanlas / ChristianRCanlas.github.io

e-Portfolio showcasing my personal projects.

数据分析数据可视化data-warehousingetl-pipelinessql-serverpredictive-analyticsPythontableautime-series-forecastingarima
Python 1
5 个月前
https://static.github-zh.com/github_avatars/SayamAlt?size=40
SayamAlt / Formula-1-Data-Ingestion-Transformation---ETL-Pipeline

This project demonstrates a complete ETL pipeline for Formula 1 racing data using Azure Databricks, Delta Lake, and Azure Data Factory. It covers data ingestion, transformation with PySpark and Spark ...

data-engineeringdata-ingestiondata-transformationdelta-lakeetl-pipelinesmicrosoft-azurespark-mllibspark-sqlspark-streamingworkflow-orchestration
Python 0
7 个月前
https://static.github-zh.com/github_avatars/pranaypkadu?size=40
pranaypkadu / networksecurity

End To End MLOPS Project With ETL Pipelines- Building Network Security System

aws-ec2aws-s3Dockeretl-pipelinesActionsmlflowmlopsmongodb-atlasnetwork-securityNumPypandasPythonPyTorchscikit-learnTensorflowVisual Studio CodeFastAPI
Python 0
5 个月前
https://static.github-zh.com/github_avatars/IMAbril?size=40
IMAbril / RENIS

project in process

data-cleaningdata-governancedata-modelingdata-profilingdata-validationdata-wranglingetl-pipelinesportfolio-project
Jupyter Notebook 0
4 个月前
https://static.github-zh.com/github_avatars/ragztigadi?size=40
ragztigadi / BigData-ETL-Pipelines-Ecommerce

Big Data ETL pipeline for Brazilian e-commerce data. Implements data ingestion, transformation, and storage using Apache Spark, Hadoop, and SQL. Designed for scalable data processing and analytics.

Azure DevOpsMongoDBMySQLPythonpowerbietl-pipelinesSQL
HTML 0
2 个月前
https://static.github-zh.com/github_avatars/Xuconnika?size=40
Xuconnika / baboon

#博客#For scribes of Thoth in the shell — your codebrain’s sacred scroll.

bonobodata-layeretl-pipelinesJavaJSONnotespyrogramPythonScriptterminal-based工具YAML
Dockerfile 0
1 个月前
https://static.github-zh.com/github_avatars/Guilherme-B?size=40
Guilherme-B / baboon

JSON-driven ETL pipeline framework prototype

bonoboetl-pipelinesdagJSON
Python 0
5 年前
https://static.github-zh.com/github_avatars/omar-elmaria?size=40
omar-elmaria / airflow_local

This repo contains the DAGs that run on my local Airflow environment. I use the local environment to test my DAGs before deploying them to virtual machines via Kubernetes

airflow自动化dagsetl-pipelinesorchestrationPython
Python 0
3 年前
https://static.github-zh.com/github_avatars/siddarthaThentu?size=40
siddarthaThentu / Disaster-Response-Pipeline

#计算机科学#A deployed machine learning model that has the capability to automatically classify the incoming disaster messages into related 36 categories. Project developed as a part of Udacity's Data Science Nan...

data-analyticsPythonml-pipelinesetl-pipelinesplotlyBootstrap机器学习Flaskhyperparameter-optimizationfeature-engineering
Python 0
4 年前
https://static.github-zh.com/github_avatars/juniors90?size=40
juniors90 / PymaciesArg

An extension that registers all pharmacies in Argentina.

datascienceetl-frameworketl-jobetl-pipelineetl-pipelinespharmaciespypi-packagePython
Python 0
3 年前
https://static.github-zh.com/github_avatars/extralo?size=40
extralo / loom

Weaving together different threads (services like image/audio converse, ETL services, etc.) to enable the World Wide Flow

etl-frameworketl-pipelines
JavaScript 0
1 年前
loading...