GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

data-wrangling

Website
Wikipedia
https://static.github-zh.com/github_avatars/OpenRefine?size=40
OpenRefine / OpenRefine

#数据仓库#OpenRefine(原名Google Refine) 是一个强大的数据清洗和转换工具

datacleansing数据分析JavaOpen Datawikidatajournalism数据科学datajournalismdatacleaningdataminingreconciliationdata-wrangling
Java 11.38 k
4 天前
https://static.github-zh.com/github_avatars/TomWright?size=40
TomWright / dasel

Select, put and delete data from JSON, TOML, YAML, XML and CSV files with a single tool. Supports conversion between formats and can be used as a Go package.

JSONYAMLconfigurationselector数据结构Parseryaml-processorjson-processingdevops-toolsGo命令行界面tomlQuery (disambiguation)updateXMLdata-processingdata-wrangling
Go 7.48 k
3 个月前
khanhnamle1994/cracking-the-data-science-interview
https://static.github-zh.com/github_avatars/khanhnamle1994?size=40
khanhnamle1994 / cracking-the-data-science-interview

#计算机科学#A Collection of Cheatsheets, Books, Questions, and Portfolio For DS/ML Interview Prep

数据科学机器学习深度学习统计Pythonconceptsdata-wrangling
Jupyter Notebook 4.09 k
10 个月前
https://static.github-zh.com/github_avatars/tirthajyoti?size=40
tirthajyoti / Data-science-best-resources

#计算机科学#Carefully curated resource links for data science in one place

数据科学统计机器学习深度学习神经网络Pythonscikit-learnanalyticsAmazon Web Servicesdata-wrangling可视化人工智能RSQL数据库APIreinforcement-learningonline-coursecheatsheetLinux
3.07 k
10 个月前
dathere/qsv
https://static.github-zh.com/github_avatars/dathere?size=40
dathere / qsv

#数据仓库#Blazing-fast Data-Wrangling toolkit

CSVdata-wrangling命令行界面Open Datadata-engineeringckanexcelluauparquetPostgreSQLSQLitepolarsSQLgeocodetimeseriesdcatmetadata统计samplinglibreoffice
Rust 2.88 k
1 天前
iterative/datachain
https://static.github-zh.com/github_avatars/iterative?size=40
iterative / datachain

#大语言模型#ETL, Analytics, Versioning for Unstructured Data

人工智能cvdata-wrangling大语言模型llm-evalmultimodaldata-analyticsembeddingsmlops机器学习
Python 2.58 k
3 天前
https://static.github-zh.com/github_avatars/brimdata?size=40
brimdata / zui

Zui is a powerful desktop application for exploring and working with data. The official front-end to the Zed lake.

datazedCSVdata-analyticsdata-vizdata-wranglingelectron-apptype-systemzui
TypeScript 1.85 k
6 天前
https://static.github-zh.com/github_avatars/ContextLab?size=40
ContextLab / hypertools

#时序数据库#A Python toolbox for gaining geometric insights into high-dimensional data

数据可视化Pythontopic-modelingdata-wrangling可视化time-series
Python 1.85 k
2 个月前
https://static.github-zh.com/github_avatars/hi-primus?size=40
hi-primus / optimus

#计算机科学#🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark

Apache Sparkpysparkdata-wranglingbigdata数据科学data-cleansingdata-transformation机器学习data-profilingdata-extractiondata-exploration数据分析data-preparationcudfdaskdata-cleaning
Python 1.51 k
6 个月前
https://static.github-zh.com/github_avatars/skrub-data?size=40
skrub-data / skrub

#计算机科学#Machine learning with dataframes

机器学习数据科学data-cleaningdatadata-preparationdata-preprocessing数据分析data-wranglingdataframedataframes
Python 1.41 k
4 天前
https://static.github-zh.com/github_avatars/data-forge?size=40
data-forge / data-forge-ts

The JavaScript data transformation and analysis toolkit inspired by Pandas and LINQ.

data-wranglingdata-forgedata数据分析JavaScriptNode.jslinqpandas可视化数据可视化data-managementdata-manipulationdata-cleaningdata-cleansingCSVJSON
TypeScript 1.36 k
2 个月前
https://static.github-zh.com/github_avatars/moderndive?size=40
moderndive / ModernDive_book

Statistical Inference via Data Science: A ModernDive into R and the Tidyverse

数据科学tidyversestatistical-inferenceRggplot2hypothesis-testingregressionregression-models数据可视化data-wranglingrstudiorstats
HTML 778
1 个月前
https://static.github-zh.com/github_avatars/microsoft?size=40
microsoft / prose

Microsoft Program Synthesis using Examples SDK is a framework of technologies for the automatic generation of programs from input-output examples. This repo includes samples and sample data for the Mi...

synthesisSDKproseMicrosoft.NETC#data-transformationdata-wranglingprogram-synthesisExample
C# 643
12 天前
https://static.github-zh.com/github_avatars/stefmolin?size=40
stefmolin / Hands-On-Data-Analysis-with-Pandas-2nd-edition

#计算机科学#Materials for following along with Hands-On Data Analysis with Pandas – Second Edition

数据分析数据科学data-wrangling机器学习pandasdata-manipulation
Jupyter Notebook 637
1 个月前
https://static.github-zh.com/github_avatars/stefmolin?size=40
stefmolin / Hands-On-Data-Analysis-with-Pandas

#计算机科学#Materials for following along with Hands-On Data Analysis with Pandas.

数据分析数据科学data-wrangling机器学习pandasPythonmatplotlibdatascience
Jupyter Notebook 416
5 个月前
https://static.github-zh.com/github_avatars/Desbordante?size=40
Desbordante / desbordante-core

Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algor...

data-analyticsdata-cleaningdata-cleansingdata-engineeringdata-explorationdata-miningdata-profiling数据科学data-wranglingdata-preprocessingfeature-selectionfeature-engineeringfeature-extractionSpreadsheettabular-dataanomaly-detectionexploratory-data-analysisknowledge-discovery
C++ 405
11 天前
https://static.github-zh.com/github_avatars/stefmolin?size=40
stefmolin / pandas-workshop

An introductory workshop on pandas with notebooks and exercises for following along. Slides contain all solutions.

pandaspandas-tutorialpython-data-sciencepython-data-analysispython-dataframesdataframesdata-wrangling数据分析数据可视化Python
Jupyter Notebook 395
7 天前
https://static.github-zh.com/github_avatars/datacarpentry?size=40
datacarpentry / R-ecology-lesson

Data Analysis and Visualization in R for Ecologists

lessonRdata-wranglingdata-visualisation数据可视化englishecologystable
R 322
4 天前
https://static.github-zh.com/github_avatars/georgevbsantiago?size=40
georgevbsantiago / qsacnpj

Pacote que trata e organiza os dados do Cadastro Nacional da Pessoa Jurídica (CNPJ)

data-wranglingR
R 320
4 年前
https://static.github-zh.com/github_avatars/dbohdan?size=40
dbohdan / sqawk

Like awk, but with SQL and table joins

awkSQLdata-wrangling命令行界面CSVtsvdata-transformationconverterJSON
Tcl 315
7 个月前
loading...