#计算机科学#A Flexible and Powerful Parameter Server for large-scale machine learning
基于Spark的电影推荐系统,包含爬虫项目、web网站、后台管理系统以及spark推荐系统
#计算机科学#.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
Wormhole is a SPaaS (Stream Processing as a Service) Platform
C# and F# language binding and extensions to Apache Spark
An open source framework for building data analytic applications.
Real Time Analytics and Data Pipelines based on Spark Streaming
Scala examples for learning to use Spark
Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POC...
Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsigh...
Databricks framework to validate Data Quality of pySpark DataFrames
Big Data Processing Framework - Unified Data API or SQL on Any Storage
Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Spark, Spark Streaming and Spark SQL unit testing strategies
#计算机科学#A complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apach...
Self-contained examples of Apache Spark streaming integrated with Apache Kafka.