Arc is an opinionated framework for defining data pipelines which are predictable, repeatable and manageable.
Arc-Jupyter is an interactive Jupyter Notebooks Extenstion for building Arc data pipelines via Jupyter Notebooks.
Provides the CassandraExtract, CassandraExecute, and CassandraLoad stages
Provides the XMLExtract and XMLLoad stages
arc-dataquality-udf-plugin defines a set of data quality/validation user defined functions.
Provides KafkaExtract, KafkaLoad and KafkaCommitExecute stages
Provides ElasticsearchExtract and ElasticsearchLoad stages
Provides the SASExtract stage
Creates a list of formatted dates to easily calculate delta processing periods.
Provides the MongoDBExtract and MongoDBLoad stages
Provides GeoSpark UDFs functionality to Arc.
Plugin to support extract and load using the spark-bigquery-connector
Provides the DeltaLakeExtract and DeltaLakeLoad stages
Provides the DebeziumTransform stage
Provides the CypherTransform and GraphTransform stages