-
microsoft/synapseml 1.0.8
Simple and Distributed Machine Learning
Scala versions: 2.12 -
feathr-ai/feathr 1.0.0
Feathr – A scalable, unified data and AI engineering platform for enterprise
Scala versions: 2.12 -
haifengl/smile 4.0.0
Statistical Machine Intelligence & Learning Engine
Scala versions: 3.x 2.13 -
swoop-inc/spark-alchemy 1.2.1
Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive
Scala versions: 2.12 -
setl-framework/setl 1.0.0-SNAPSHOT
A simple Spark-powered ETL framework that just works 🍺
Scala versions: 2.12 2.11 -
picnicml/doddle-model 0.0.1-beta5
:cake: doddle-model: machine learning in Scala.
Scala versions: 2.13 2.12 2.11 -
streamnative/pulsar-spark 2.4.5
Spark Connector to read and write with Pulsar
Scala versions: 2.11 -
zenecture/neuroflow 1.8.2
Artificial Neural Networks for Scala
Scala versions: 2.12 -
pityka/nspl 0.10.0
scala plotting (charting, graphing) library
Scala versions: 3.x 2.13Scala.js versions: 1.x -
galliaproject/gallia-core 0.6.1
A schema-aware Scala library for data transformation
Scala versions: 3.x 2.13 2.12 -
facultyai/scala-plotly-client 0.1
Visualise your data from Scala using Plotly
Scala versions: 2.10 -
pityka/saddle 3.5.0
SADDLE: Scala Data Library
Scala versions: 3.x 2.13Scala.js versions: 1.x -
dragonfly-ai/cliviz 0.102
A Scala.js library for Command Line Interface Visualizations.
Scala versions: 3.xScala.js versions: 1.xScala Native versions: 0.4 -
recommenders-team/recommenders 0.6.6
Best Practices on Recommendation Systems
Scala versions: 2.12 -
h2oai/h2o-3 3.30.0.3
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Scala versions: 2.11 -
whylabs/whylogs 0.7.0
An open-source data logging library for machine learning models and data pipelines. 📚 Provides visibility into data quality & model performance over time. 🛡️ Supports privacy-preserving data collection, ensuring safety & robustness. 📈
Scala versions: 2.12 -
catboost/catboost 1.2.7
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.
Scala versions: 2.13 2.12