GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

data-cleansing

Website
Wikipedia
https://static.github-zh.com/github_avatars/hi-primus?size=40
hi-primus / optimus

#计算机科学#🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark

Apache Sparkpysparkdata-wranglingbigdata数据科学data-cleansingdata-transformation机器学习data-profilingdata-extractiondata-exploration数据分析data-preparationcudfdaskdata-cleaning
Python 1.51 k
6 个月前
https://static.github-zh.com/github_avatars/data-forge?size=40
data-forge / data-forge-ts

The JavaScript data transformation and analysis toolkit inspired by Pandas and LINQ.

data-wranglingdata-forgedata数据分析JavaScriptNode.jslinqpandas可视化数据可视化data-managementdata-manipulationdata-cleaningdata-cleansingCSVJSON
TypeScript 1.36 k
2 个月前
https://static.github-zh.com/github_avatars/Desbordante?size=40
Desbordante / desbordante-core

Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algor...

data-analyticsdata-cleaningdata-cleansingdata-engineeringdata-explorationdata-miningdata-profiling数据科学data-wranglingdata-preprocessingfeature-selectionfeature-engineeringfeature-extractionSpreadsheettabular-dataanomaly-detectionexploratory-data-analysisknowledge-discovery
C++ 405
11 天前
https://static.github-zh.com/github_avatars/BDFD-Learning-Ground?size=40
BDFD-Learning-Ground / Cousera_Google-Data-Analytics-Professional-Certificate

Quizzes & Assignment Solutions for Google Data Analytics Professional Certificate on Coursera. Also included a few resources on side that I found helpful.

data-cleansingSQLdecision-making数据可视化Python数据科学数据分析excelquizCourseraRGoogledatadata-analytics
239
3 年前
https://static.github-zh.com/github_avatars/ajaymache?size=40
ajaymache / data-analysis-using-python

Exploratory data analysis 📊using python 🐍of used car 🚘 database taken from ⓚ𝖆𝖌𝖌𝖑𝖊

数据科学数据分析数据可视化data-cleaningdata-cleansingdata-wranglingdata-analyticsedaexploratory-data-analysiskaggle-competition
Jupyter Notebook 225
6 年前
https://static.github-zh.com/github_avatars/probcomp?size=40
probcomp / PClean

A domain-specific probabilistic programming language for scalable Bayesian data cleaning

probabilistic-programmingdata-cleaningdata-cleansingbayesian-inference
Julia 224
1 年前
https://static.github-zh.com/github_avatars/data-integrations?size=40
data-integrations / wrangler

Wrangler Transform: A DMD system for transforming Big Data

wrangledata-transformationdata-transform数据科学transform-datamanipulate-datacdapbig-datacdap-plugintransformProjectpreparationdata-cleansingdata-prepParsingavro
Java 105
6 天前
https://static.github-zh.com/github_avatars/ojasphansekar?size=40
ojasphansekar / Zillow-Home-Value-Prediction

#计算机科学#XGBoost, LightGBM, LSTM, Linear Regression, Exploratory Data Analysis

Python机器学习exploratory-data-analysisdata-preprocessingdata-cleansing
Jupyter Notebook 45
5 年前
https://static.github-zh.com/github_avatars/iweld?size=40
iweld / data_cleaning

An SQL data cleaning project

SQLdata-analyticsdata-cleansingexcel
31
3 年前
https://static.github-zh.com/github_avatars/kbasu2016?size=40
kbasu2016 / Autism-Detection-in-Adults

This is a binary classification problem related with Autistic Spectrum Disorder (ASD) screening in Adult individual. Given some attributes of a person, my model can predict whether the person would ha...

supervised-learningnaive-bayes-classifierdecision-tree-classifierrandom-forestsupport-vector-machinek-nearest-neighboursdata-wranglingdata-cleansing
Jupyter Notebook 26
7 年前
https://static.github-zh.com/github_avatars/bakdata?size=40
bakdata / dedupe

Java DSL for (online) deduplication

duplicate-detectiondata-cleaningdata-cleansingEntity resolution
Java 20
5 个月前
https://static.github-zh.com/github_avatars/AP-State-Skill-Development-Corporation?size=40
AP-State-Skill-Development-Corporation / Data-Science-Using-Python-Internship-EB1

#计算机科学#This repo created for sharing the required/discussed files during Online Internship training program on Data Science Using Python in May-2021

数据科学机器学习数据分析Pythondata-visualisationdata-cleansing
Jupyter Notebook 14
4 年前
https://static.github-zh.com/github_avatars/TimKong21?size=40
TimKong21 / PwC-Switzerland-Power-BI-in-Data-Analytics-Virtual-Case-Experience

Comprehensive Power BI dashboards showcasing insights on Call Centre Trends, Customer Retention, and Diversity & Inclusion to drive business impact.

powerbi数据可视化interactive-dashboardsbusiness-intelligencecustomer-retentiondata-analyticsdata-cleansing
14
1 年前
https://static.github-zh.com/github_avatars/AlexLamson?size=40
AlexLamson / DataWrangler

Make quick and dirty data mining made easier in Sublime Text

sublime-text-plugindata-cleaningdata-cleansingdata-wrangling
Python 11
4 年前
https://static.github-zh.com/github_avatars/brunocampos01?size=40
brunocampos01 / porto-seguro-safe-driver-prediction

#计算机科学#Predict if a driver will file an insurance claim next year. (Kaggle Competition)

challengekaggledata-engineering机器学习Python数据科学random-forestxgboostkaggle-competitiondata-cleansingdataset
Python 11
3 年前
https://static.github-zh.com/github_avatars/prachitqwer?size=40
prachitqwer / Power-BI---Product-Rationalization

Product Rationalization of Pro Bikes Inc using Power BI

dashboarddata-analyticsdata-cleansingdata-modelingdata-transformation数据可视化financepowerbiSQLsql-server
10
3 年前
https://static.github-zh.com/github_avatars/mtimjones?size=40
mtimjones / dataprocessing

Data cleanse, clustering with Vector Quantization and Adaptive Resonance Theory

数据科学data-cleansing
C 10
8 年前
https://static.github-zh.com/github_avatars/data-forge?size=40
data-forge / data-forge-fs

This library contains the file system extensions to Data-Forge that allow it to directly read and write CSV and JSON files in Node.js

data-wranglingdata-forgedata数据分析JavaScriptNode.jslinqpandas可视化数据可视化data-managementdata-manipulationdata-cleaningdata-cleansingCSVJSON
TypeScript 10
3 年前
https://static.github-zh.com/github_avatars/LieseB-1746743?size=40
LieseB-1746743 / data-cleaning

Data cleaning tool.

data-profilingdata-cleaningdata-cleansing
JavaScript 9
4 年前
https://static.github-zh.com/github_avatars/HypertextAssassin0273?size=40
HypertextAssassin0273 / Excel_Data_Organizer_and_Cleaner-DS_Project

Data Structures project in C++11 language, uses custom Vector & String structures with Move Semantics (Rule of Five)

Open SourceC++open-source-project数据结构vectorstringObject-oriented programming (OOP)data-cleaningdata-cleansingdata-wrangling
C++ 9
2 年前
loading...