GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

data-transformation

Website
Wikipedia
mahmoud/glom
https://static.github-zh.com/github_avatars/mahmoud?size=40
mahmoud / glom

☄️ Python's nested data operator (and CLI), for all your declarative restructuring needs. Got data? Glom it! ☄️

declarativedatarecursionPythonutilities命令行界面data-transformationapisdictionaries
Python 2.02 k
5 个月前
https://static.github-zh.com/github_avatars/hi-primus?size=40
hi-primus / optimus

#计算机科学#🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark

Apache Sparkpysparkdata-wranglingbigdata数据科学data-cleansingdata-transformation机器学习data-profilingdata-extractiondata-exploration数据分析data-preparationcudfdaskdata-cleaning
Python 1.51 k
6 个月前
https://static.github-zh.com/github_avatars/2ndQuadrant?size=40
2ndQuadrant / pglogical

Logical Replication extension for PostgreSQL 17, 16, 15, 14, 13, 12, 11, 10, 9.6, 9.5, 9.4 (Postgres), providing much faster replication than Slony, Bucardo or Londiste, as well as cross-version upgra...

PostgreSQLreplicationsubscriptionPublish-subscribe patterndata-transformationetlcdczero-downtime
C 1.12 k
25 天前
https://static.github-zh.com/github_avatars/bruin-data?size=40
bruin-data / bruin

Build data pipelines with SQL and Python, ingest data from different sources, add quality checks, and build end-to-end flows.

analyticsBigQuerydata-modelingdata-pipelinesPythonsnowflakeSQL数据分析data-transformationdata-ingestiondata-platform
Go 947
5 天前
https://static.github-zh.com/github_avatars/mattt?size=40
mattt / TransformerKit

A block-based API for NSValueTransformer, with a growing collection of useful examples.

Objective-CSwiftdata-transformation
Objective-C 842
4 年前
https://static.github-zh.com/github_avatars/raystack?size=40
raystack / optimus

Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management.

airflowetlworkflows自动化GoBigQuerydata-warehouseanalyticsdata-modellinganalytics-engineeringdata-transformationdata-pipelineseltbusiness-intelligenceDataOps
Go 749
1 年前
https://static.github-zh.com/github_avatars/SebKrantz?size=40
SebKrantz / collapse

#时序数据库#Advanced and Fast Data Transformation in R

Rrstats统计数据科学scientific-computingtime-seriespanel-dataeconometricshigh-performancedata-transformationdata-processingdata-manipulation数据分析
C 682
4 天前
https://static.github-zh.com/github_avatars/microsoft?size=40
microsoft / prose

Microsoft Program Synthesis using Examples SDK is a framework of technologies for the automatic generation of programs from input-output examples. This repo includes samples and sample data for the Mi...

synthesisSDKproseMicrosoft.NETC#data-transformationdata-wranglingprogram-synthesisExample
C# 643
12 天前
https://static.github-zh.com/github_avatars/ScriptFUSION?size=40
ScriptFUSION / Porter

💄 Durable and asynchronous data imports for consuming data at scale and publishing testable SDKs.

porterdata-import框架data-transformationabstractionscalabilityasynchronousLibraryfibers
PHP 612
4 个月前
https://static.github-zh.com/github_avatars/dbohdan?size=40
dbohdan / sqawk

Like awk, but with SQL and table joins

awkSQLdata-wrangling命令行界面CSVtsvdata-transformationconverterJSON
Tcl 315
7 个月前
https://static.github-zh.com/github_avatars/weAIDB?size=40
weAIDB / awesome-data-llm

#大语言模型#Official Repository of "LLM × DATA" Survey Paper

大语言模型data-filteringdata-transformationvlm
312
6 天前
https://static.github-zh.com/github_avatars/jupyter-naas?size=40
jupyter-naas / naas

Low-code Python library to safely use notebooks in production: schedule workflows, generate assets, trigger webhooks, send notifications, build pipelines, manage secrets (Cloud-only)

enginenotebooksOpen Source数据科学dataintegrationpipelineetldata-transformationJupyter Notebookjupyterlabbinder人工智能
Python 285
4 个月前
https://static.github-zh.com/github_avatars/feichao93?size=40
feichao93 / temme

📄 Concise selector to extract JSON from HTML.

css-selectortemme-selectorJSONHTMLdata-transformation
TypeScript 273
1 年前
https://static.github-zh.com/github_avatars/fastverse?size=40
fastverse / fastverse

#时序数据库#An Extensible Suite of High-Performance and Low-Dependency Packages for Statistical Computing and Data Manipulation in R

high-performancestatistical-computingdata-manipulationdata-transformationrstatsRCC++time-seriespanel-data数据科学
R 272
1 个月前
https://static.github-zh.com/github_avatars/mahmoudparsian?size=40
mahmoudparsian / data-algorithms-with-spark

#算法刷题#O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian

Apache Sparkpysparkdata算法transformations机器学习design-patternsPythondesignmonoidmapreducedata-transformationdataframesbigdata
Python 216
2 年前
https://static.github-zh.com/github_avatars/simongray?size=40
simongray / clojure-dsl-resources

#自然语言处理#A curated list of Clojure resources for dealing with domain-specific languages.

dslParsingdata-transformationdomain-specific-language自然语言处理
182
1 年前
https://static.github-zh.com/github_avatars/SETL-Framework?size=40
SETL-Framework / setl

#计算机科学#A simple Spark-powered ETL framework that just works 🍺

Apache Sparketl框架Scalapipelinedata-transformation数据科学data-engineering数据分析modularizationdatasetbig-dataetl-pipeline机器学习
Scala 181
1 个月前
https://static.github-zh.com/github_avatars/markus-wa?size=40
markus-wa / cq

Clojure Query: A Command-line Data Processor for JSON, YAML, EDN, XML and more

ClojureYAMLJSONCSV命令行界面msgpackHacktoberfestXMLdata-processingtransformationdata-transformation
Clojure 178
9 个月前
https://static.github-zh.com/github_avatars/strengejacke?size=40
strengejacke / sjmisc

Data transformation and utility functions for R

data-transformationRdata-wranglingrecoding
R 160
1 年前
https://static.github-zh.com/github_avatars/mahmoudparsian?size=40
mahmoudparsian / big-data-mapreduce-course

#算法刷题#Big Data Modeling, MapReduce, Spark, PySpark @ Santa Clara University

mapreducepysparkdata-transformation算法Apache Sparkbig-data数据分析data-engineeringglossarymonoid
HTML 158
6 个月前
loading...