-
apache/predictionio 0.9.6
PredictionIO, a machine learning server for developers and ML engineers.
Scala versions: 2.10 -
microsoft/synapseml 1.0.8
Simple and Distributed Machine Learning
Scala versions: 2.12 -
h2oai/sparkling-water 2.4.13
Sparkling Water provides H2O functionality inside Spark cluster
Scala versions: 2.11 -
delta-io/delta-sharing 1.2.2
An open protocol for secure data sharing
Scala versions: 2.13 2.12 -
touk/nussknacker 1.18.0
Low-code tool for automating actions on real time data | Stream processing for the users.
Scala versions: 2.13 2.12 -
yotpoltd/metorikku 0.0.156
A simplified, lightweight ETL Framework based on Apache Spark
Scala versions: 2.12 2.11 -
hydrospheredata/mist 0.6.4
Serverless proxy for Spark cluster
Scala versions: 2.11 2.10 -
qbeast-io/qbeast-spark 0.7.0
Qbeast-spark: DataSource enabling multi-dimensional indexing and efficient data sampling. Big Data, free from the unnecessary!
Scala versions: 2.12 -
setl-framework/setl 1.0.0-SNAPSHOT
A simple Spark-powered ETL framework that just works 🍺
Scala versions: 2.12 2.11 -
sparkling-graph/sparkling-graph 0.0.7
SparklingGraph provides easy to use set of features that will give you ability to proces large scala graphs using Spark and GraphX.
Scala versions: 2.11 2.10 -
clustering4ever/clustering4ever 0.11.0
C4E, a JVM friendly library written in Scala for both local and distributed (Spark) Clustering.
Scala versions: 2.11