databricks-labs-dqx

Python library for data quality checks and monitoring

pipmacoslinuxwindows
Try with needOr install directly
Source

About

Data Quality eXtended (DQX) is a Python library for data quality checks and data quality monitoring

Commands

dqx

Examples

validate data quality of a table$ dqx validate --table my_table --rules rules.yaml
run data quality checks on a dataset$ dqx check --source data.csv --expectations expectations.json
monitor data quality metrics over time$ dqx monitor --catalog my_catalog --schema my_schema
generate data quality report$ dqx report --input results.json --output report.html
detect anomalies in data quality metrics$ dqx anomaly-detect --metrics metrics.csv --threshold 0.95