GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

massive-datasets

Website
Wikipedia
https://static.github-zh.com/github_avatars/polardb?size=40
polardb / polardbx-sql

GalaxySQL 是 PolarDB-X 的计算节点。PolarDB-X 是一款面向超高并发、海量存储、复杂查询场景设计的云原生分布式数据库系统。其采用 Shared-nothing 与存储计算分离架构,支持水平扩展、分布式事务、混合负载等能力,具备企业级、云原生、高可用、高度兼容 MySQL 系统及生态等特点。

horizontal-scalingdistributed-transactionshtapenterprise-classcloud-nativehigh-availabilityMySQLhigh-concurrencymassive-datasetsrelational-database
Java 1.61 k
10 天前
https://static.github-zh.com/github_avatars/helmholtz-analytics?size=40
helmholtz-analytics / heat

#计算机科学#Distributed tensors and Machine Learning framework with GPU and MPI acceleration in Python

gputensorsdistributed机器学习mpiNumPyPythonPyTorchdata-analyticsdata-processing数据科学hpcmassive-datasetsparallelism
Python 222
5 天前
https://static.github-zh.com/github_avatars/polardb?size=40
polardb / polardbx

PolarDB-X is a cloud native distributed SQL Database designed for high concurrency, massive storage, complex querying scenarios.

MySQLdistributed-transactionscloud-nativehigh-availabilityrelational-databaseshigh-concurrencymassive-datasetshtaphorizontal-scalingenterprise-class
Makefile 81
7 个月前
https://static.github-zh.com/github_avatars/joshuaboud?size=40
joshuaboud / gen-dataset

Command line tool to quickly generate a lot of files in a lot of directories

Linuxdatasetbenchmarkingmassive-datasetscli-toolmultithreadingevaluation
C++ 6
3 年前
https://static.github-zh.com/github_avatars/rajeshidumalla?size=40
rajeshidumalla / Bloom-Filter

#计算机科学#Building a Bloom Filter on English dictionary words

bloom-filtermassive-datasetsPython数据科学机器学习数据分析
Jupyter Notebook 4
4 年前
https://static.github-zh.com/github_avatars/FedericoBruzzone?size=40
FedericoBruzzone / anti-money-laundering

#计算机科学#The project is based on the analysis of the "IBM Transactions for Anti Money Laundering" dataset published on Kaggle. The task is to implement a model which predicts whether or not a transaction is il...

机器学习massive-datasetspyspark
Jupyter Notebook 3
10 个月前
https://static.github-zh.com/github_avatars/FedericoBruzzone?size=40
FedericoBruzzone / algorithms-for-massive-datasets

#算法刷题#This repository contains a LaTeX file that generates a PDF document comprising comprehensive notes for the course "Algorithms for Massive Datasets"

算法深度学习massive-datasetsrecommender-system
TeX 2
10 个月前
https://static.github-zh.com/github_avatars/gmalik9?size=40
gmalik9 / floating_point_data_compressor

gipa -- compression/decompression tool to package compress and encode massive archive files with floating-point data

compressioncompressorautoencoderfloating-pointmassive-datasets数据可视化data-compressionrepresentationrepresentation-learning
Python 2
8 年前
https://static.github-zh.com/github_avatars/rajeshidumalla?size=40
rajeshidumalla / PageRank

#计算机科学#Building PageRank algorithm on Web Graph around Stanford.edu using NetworkX python library

pagerank-algorithm机器学习massive-datasets数据分析数据科学PythonApache SparkpandasNumPy
Jupyter Notebook 2
4 年前
https://static.github-zh.com/github_avatars/StefanoBalbo?size=40
StefanoBalbo / Geocoding

Automated massive geolocator of addresses with parallel processing.

DockergeocodinggeolocationgeopandasgeospatialJupyter Notebookjupyterlabmassive-datasetsmassively-parallelnominatimosmPythonspatial-analysisssh-server
Jupyter Notebook 1
3 个月前
https://static.github-zh.com/github_avatars/datakaveri?size=40
datakaveri / k-anonymisation-SKALD

Scalable, chunk-wise K-anonymization tool based on the Optimal Lattice Anonymization (OLA) algorithm. It is designed to handle large datasets by processing them in manageable chunks, ensuring data pri...

chunkingencodingmassive-datasets
Python 1
13 天前
https://static.github-zh.com/github_avatars/Alex4gtx?size=40
Alex4gtx / Massive-Data-Handler

Permite abrir e manipular arquivos massivos de texto/dados cujo seria impossivel abrir em um computador, por exemplo um arquivo de texto de +20gb, permite manipular o arquivo pegando apenas as linhas ...

big-datadictionariespython-scriptmassive-datasets
Python 1
3 年前
https://static.github-zh.com/github_avatars/diem-ai?size=40
diem-ai / google-bigquery

Series of SQL exercise working with databases, using Google BigQuery to scale to massive datasets taught by educators in Kaggle.com

BigQuerySQLPythonmassive-datasetsanalyticskaggle
Jupyter Notebook 1
6 年前
https://static.github-zh.com/github_avatars/rajeshidumalla?size=40
rajeshidumalla / node2vec

#计算机科学#Building node2vec algorithm

node2vec机器学习数据科学数据分析massive-datasetsPythonNumPypandasmatplotlib
Jupyter Notebook 1
4 年前
https://static.github-zh.com/github_avatars/arhcoder?size=40
arhcoder / Netflix-Recommendation

📺 Content Recommendation System for the Netflix Prize Challenge with Collaborative Filtering.

collaborative-filteringJupyter Notebookmassive-datasetsNetflixPythonrecommendation-enginerecommendation-systemrecommender-system
Jupyter Notebook 1
1 年前
https://static.github-zh.com/github_avatars/manuparra?size=40
manuparra / hadoop-statistics

Calculate statistical measures of one column in big data Datasets with these simply Hadoop Application

maxminJavabigdatamassive-datasetshadoop
Java 1
8 年前
https://static.github-zh.com/github_avatars/rajeshidumalla?size=40
rajeshidumalla / Wordcount-in-Spark

word count in Spark

Apache SparkPythonmassive-datasetspython-librarypandas
Jupyter Notebook 0
4 年前
https://static.github-zh.com/github_avatars/KolwaBrad?size=40
KolwaBrad / massivedataset

Training the MASSIVE dataset by Amazon(english-US, German-DE and Swahili-KE)

massive-datasetsPython
Python 0
2 年前
https://static.github-zh.com/github_avatars/simkarwin?size=40
simkarwin / mimo_keras

TF-Package: Multiple-Input Multiple-Output Keras Data-Generator for massive and complex datasets

massive-datasets
Python 0
2 年前
https://static.github-zh.com/github_avatars/dhruv3?size=40
dhruv3 / MRbasedFriendRecommender

Map Reduce program to suggest new friends based on count of mutual friends

mapreduceJavadataminingmassive-datasets
Java 0
7 年前
loading...