21 results
-
byzer-org/byzer-lang 2.1.0
Byzer (former MLSQL): A low-code open-source programming language for data pipeline, analytics and AI.
Scala versions: 2.12 2.11 -
apache/hudi 0.15.0
Upserts, Deletes And Incremental Processing on Big Data.
Scala versions: 2.13 2.12 2.11 -
mjakubowski84/parquet4s 2.20.0
Read and write Parquet in Scala. Use Scala classes as schema. No need to start a cluster.
Scala versions: 3.x 2.13 2.12 -
azure/azure-event-hubs-spark 2.1.5
Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Scala versions: 2.11 -
clustering4ever/clustering4ever 0.11.0
C4E, a JVM friendly library written in Scala for both local and distributed (Spark) Clustering.
Scala versions: 2.11 -
microsoft/mobius 2.0.200
C# and F# language binding and extensions to Apache Spark
Scala versions: 2.11 -
gigahexhq/jetprobe 0.1.0
🚀 Validation DSL for data pipelines
Scala versions: 2.12 2.11 -
grouzen/zio-apache-parquet 0.1.5
Scala ZIO-powered Apache Parquet library
Scala versions: 3.x 2.13 -
grouzen/zio-apache-arrow 0.1.2
Scala ZIO-powered Apache Arrow library
Scala versions: 3.x 2.13 2.12 -
zuinnote/hadoopoffice
HadoopOffice - Analyze Office documents using the Hadoop ecosystem (Spark/Flink/Hive)