#

data-pipelines

https://static.github-zh.com/github_avatars/apache?size=40

一个分布式易扩展的可视化DAG工作流任务调度系统。致力于解决数据处理流程中错综复杂的依赖关系,使调度系统在数据处理流程中开箱即用

Java 13.83 k
5 小时前
https://static.github-zh.com/github_avatars/Unstructured-IO?size=40

#自然语言处理#Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to...

HTML 12.66 k
6 天前
StructuredLabs/preswald
https://static.github-zh.com/github_avatars/StructuredLabs?size=40

#大语言模型#Preswald is a WASM packager for Python-based interactive data apps: bundle full complex data workflows, particularly visualizations, into single files, runnable completely in-browser, using Pyodide, D...

Python 4.31 k
2 个月前
https://static.github-zh.com/github_avatars/meltano?size=40

Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to writing, maintaining, and scaling your own API integrations.

Python 2.2 k
3 天前
elementary-data/elementary
https://static.github-zh.com/github_avatars/elementary-data?size=40

The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.

HTML 2.15 k
2 天前
data-engineering-community/data-engineering-wiki
https://static.github-zh.com/github_avatars/data-engineering-community?size=40

The best place to learn data engineering. Built and maintained by the data engineering community.

CSS 1.78 k
1 天前
https://static.github-zh.com/github_avatars/combust?size=40
Scala 1.52 k
10 个月前
https://static.github-zh.com/github_avatars/fmind?size=40

#计算机科学#Kickstart your MLOps initiative with a flexible, robust, and productive Python package.

Jupyter Notebook 1.34 k
1 天前
https://static.github-zh.com/github_avatars/OpenDCAI?size=40
Python 1.29 k
1 天前
loading...
Website
Wikipedia