tripl-ai / arc-dataquality-udf-plugin   1.8.0

MIT License Website GitHub

arc-dataquality-udf-plugin defines a set of data quality/validation user defined functions.

Scala versions: 2.12

arc-dataquality-udf-plugin defines a set of data quality/validation user defined functions.

The intention of this plugin is to provide a standard suite of data quality functions that can be used by many people and companies. Feel free to raise pull requests against this repository with reusable functions that may be beneficial to others.

User Defined Functions

  • is_valid_phonenumber(numberToParse, defaultRegion) returns a boolean of whether the phone number appears valid for that region e.g. is_valid_phonenumber('61499000000', 'AU').
  • format_phonenumber(numberToParse, defaultRegion) returns an ISO E164 formatted phone number string e.g. format_phonenumber('61499000000', 'AU') becomes +61499000000.
  • is_valid_abn(abn) returns a boolean of whether the Australian Business Number (ABN) passes the inbuilt checksum function.
  • is_valid_acn(acn) returns a boolean of whether the Australian Company Number (ACN) passes the inbuilt checksum function.

License

Arc is released under the MIT License.

is_valid_phonenumber and format_phonenumber use https://github.com/google/libphonenumber released under the Apache 2.0 license.