Low level integration of Spark and Kafka
Secondary sort and streaming reduce for Apache Spark
Joins for skewed datasets in Spark
Use Cascading Taps and Scalding DSL with Spark
Kerberos based authentication for akka-http using SPNEGO
A tiny library that aims to make Spark SQL Dataset more developer friendly by bringing back the operators we all love to use on key-value RDDs
RDD based implementation of word2phrase algorithm for Apache Spark