A Scala API for Apache Beam and Google Cloud Dataflow.
A Scala feature transformation library for data science and machine learning
A tool for data sampling, data generation, and data diffing
A collection of Magnolia add-on modules
Google BigQuery support for Spark, SQL, and DataFrames
Scala Aggregators used for ML Model metrics monitoring
A lightweight workflow definition library
Provides compile-time derivation of conversions between Scala case classes and Tensorflow Example protocol buffers
Community-supported add-ons for Scio
Runs JVM closures in Docker containers on Kubernetes
DBeam exports SQL tables into Avro files using JDBC and Apache Beam