Version: 1.5.7

Accelerating test coverage

GX Cloud provides several conveniences to help you more quickly achieve test coverage for your data. This page provides an overview of the following options:

Automating Anomaly Detection rules.
Generating personalized AI-recommended rules.
Generating code for custom SQL Expectations.

Anomaly Detection

When you add a new Data Asset, GX Cloud by default generates Expectations to detect anomalies in the following data quality issues. You can also automate Anomaly Detection for these when you add Expectations for an existing Data Asset.

Schema
Volume
Completeness
Uniqueness (coming soon)

Schema

To detect schema anomalies, we automatically generate a rule to expect table columns to match set using the Data Asset’s initial columns as the set to match. If the number or names of columns in the Data Asset change, this Expectation will fail.

Volume

To detect anomalies in row count, we automatically generate a rule to expect table row count to be between a forecasted range that tests that the current validation run doesn’t deviate significantly from historical patterns. For example, if there is a sudden increase when volume has been stable or stagnation in a season when volume typically grows, then this Expectation will fail.

Completeness

To detect completeness anomalies, we automatically generate rules for every column to expect column proportion of non-null values to be between thresholds that depend on the column's initial proportion of non-null values.

If a column initially has no null values, GX Cloud generates a rule to test that the column continues to have no null values.
If a column initially has all null values, GX Cloud generates a rule to test that the column continues to have all null values.
If a column starts with a mix of null and non-null values, GX Cloud generates a rule with dynamic parameters to test that the proportions stay close to the average of the last 5 Validation runs.

If the proportions change at all for a column that started with all null values or no null values, its generated completeness Expectation will fail. If the proportions change a bit for a column that started with a mix of null and non-null values, its generated completeness Expectation will pass; if the change is drastic, the generated completeness Expectation will fail.

ExpectAI Beta

ExpectAI is an analytical AI tool that powers several features in GX Cloud.

Personalized AI-recommended rules

When you generate Expectations, ExpectAI performs deep analysis on a given Data Asset to set Expectations based on patterns in the data. These AI-recommended data quality rules are sometimes based on anomalies detected in the data, so they may fail on the first validation to bring your attention to potential problems.

Code for custom SQL Expectations

To simplify working with custom SQL Expectations, you can use ExpectAI to generate a SQL query based on a natural language prompt you provide and a data profile GX Cloud automatically provides.

Anomaly Detection​

Schema​

Volume​

Completeness​

ExpectAI Beta​

Personalized AI-recommended rules​

Code for custom SQL Expectations​