Version: 1.9.3

Format results

You can control the level of detail GX Cloud returns in your Validation Results to improve the clarity and efficiency of your data quality workflows. By configuring the result_format setting with the GX Cloud API, you can receive only the information you need, whether that’s a high-level pass/fail indicator for exploration, specific failing values for troubleshooting, or full failed rows for data cleansing.

This setting controls the results you receive in both the GX Cloud UI and the GX Cloud API, as detailed below. However, result_format must be configured through the GX Cloud API.

Prerequisites

A GX Cloud account with Workspace Editor permissions or greater.
Your Cloud credentials saved in your environment variables.
A Data Asset with a Checkpoint or Validation Definition. You can use an automatically created GX-managed resource or a manually created resource.
Python version 3.10 to 3.13.
An installation of the Great Expectations Python library.
Minimum version for row conditions
GX Cloud library versions prior to 1.8.1 do not support the following row conditions options. If you use any of these aspects of row conditions, make sure your GX Cloud library is version 1.8.1 or later.
- multiple condition statements
- is in, is not in, or is null operators

Configure and apply a Result Format

Follow the steps below to select a base level of verbosity, optionally configure additional settings available to your selection, and then apply the Result Format configuration to a Checkpoint or Validation Definition.

Create a dictionary and set the verbosity of your Validation Results as the value of the key "result_format". In order from least verbosity to greatest detail, the valid values for the "result_format" key are:
- "BOOLEAN_ONLY"
- "BASIC"
- "SUMMARY"
- "COMPLETE"
The default for Validation Results generated by GX-managed Checkpoints is "COMPLETE". The default for Validation Results generated by Validation Definitions and API-managed Checkpoints is "SUMMARY".

Select a value below to see example code for that Result Format and what information is returned at that level of verbosity:
- "BOOLEAN_ONLY"
- "BASIC"
- "SUMMARY"
- "COMPLETE"
When the result_format is "BOOLEAN_ONLY", Validation Results do not include additional information in a result dictionary. The successful evaluation of the Expectation is exclusively returned via the True or False value of the success key in the returned Validation Result.
To create a "BOOLEAN_ONLY" Result Format configuration, use the following code:
Python
boolean_result_format_dict = {"result_format": "BOOLEAN_ONLY"}
When the result_format is set to "BASIC", the Validation Results of each Expectation include a result dictionary with information providing a basic explanation for why it failed or succeeded. The format is intended for quick feedback and it works well in Jupyter Notebooks.
You can check the result field reference table to see what information is provided in the result dictionary.
To create a "BASIC" Result Format configuration, use the following code:
Python
basic_result_format_dict = {"result_format": "BASIC"}
When the result_format key is set to "SUMMARY", the Validation Results of each Expectation include a result dictionary with information that summarizes values to show why it failed or succeeded. This format is intended for more detailed exploratory work and includes additional information beyond what is included by BASIC.
You can check the result field reference table to see what information is provided in the result dictionary.
To create a "SUMMARY" Result Format configuration, use the following code:
Python
summary_result_format_dict = {"result_format": "SUMMARY"}
When the result_format key is set to "COMPLETE", the Validation Results of each Expectation include a result dictionary with all available information to explain why it failed or succeeded. This format is intended for debugging pipelines or developing detailed regression tests and includes additional information beyond what is provided by "SUMMARY".
You can check the result field reference table to see what information is provided in the result dictionary.
To create a "COMPLETE" Result Format configuration, use the following code:
Python
complete_result_format_dict = {"result_format": "COMPLETE"}

Optional. Specify configurations for additional settings available to the base result_format.

Once you have defined the base configuration in your result_format key, you can further tailor the format of your Validation Results by defining additional key/value pairs in your Result Format dictionary.

Reference the table below for valid keys and how they influence the format of generated Validation Results:

"BOOLEAN_ONLY"
"BASIC"
"SUMMARY"
"COMPLETE"

A "BOOLEAN_ONLY" Result Format does not support additional settings.

Dictionary key	Purpose
`"partial_unexpected_count"`	Sets the number of results to include in `"partial_unexpected_list"` (default is 20). Set the value to zero to suppress the unexpected counts.
`"include_unexpected_rows"`	When `True`, the GX Cloud API returns up to 200 entire rows that violate the Expectation (default is `False`). Applies to Column Map Expectations only, such as `ExpectColumnValuesToBeInSet`. Note that `ExpectColumnValuesToBeOfType` and `ExpectColumnValuesToBeInTypeList` will return unexpected rows for only Pandas Data Sources.

Dictionary key	Purpose
`"unexpected_index_column_names"`	Takes a list to define the column(s) that will be used to identify unexpected results returned. For example, primary key (PK) column(s) or other columns with unique identifiers.
`"partial_unexpected_count"`	Sets the number of results to include in `"partial_unexpected_counts"`, `"partial_unexpected_list"`, and `"partial_unexpected_index_list"` (default is 20). Set the value to zero to suppress the unexpected counts.
`"include_unexpected_rows"`	When `True`, the GX Cloud API returns up to 200 entire rows that violate the Expectation (default is `False`). Applies to Column Map Expectations only, such as `ExpectColumnValuesToBeInSet`. Note that `ExpectColumnValuesToBeOfType` and `ExpectColumnValuesToBeInTypeList` will return unexpected rows for only Pandas Data Sources.

Dictionary key	Purpose
`"unexpected_index_column_names"`	Takes a list to define the column(s) that will be used to identify unexpected results returned. For example, primary key (PK) column(s) or other columns with unique identifiers.
`"return_unexpected_index_query"`	When running validations with the GX Cloud API, a query (or a set of indices) is returned that allows you to retrieve the full set of unexpected results as well as the values of any identifying columns specified in `"unexpected_index_column_names"`. To make the query available in the GX Cloud UI, you must also set `"unexpected_index_column_names"`. Setting `"return_unexpected_index_query"` to `False` suppresses the output (default is `True`).
`"partial_unexpected_count"`	Sets the number of results to include in `"partial_unexpected_counts"`, `"partial_unexpected_list"`, and `"partial_unexpected_index_list"` (default is 20). Set the value to zero to suppress the unexpected counts.
`"exclude_unexpected_values"`	When running validations, a set of unexpected results' indices and values is returned. Setting this value to `True` suppresses values from the output to only have indices (default is `False`).
`"include_unexpected_rows"`	When `True`, the GX Cloud API returns up to 200 entire rows that violate the Expectation (default is `False`). Applies to Column Map Expectations only, such as `ExpectColumnValuesToBeInSet`. Note that `ExpectColumnValuesToBeOfType` and `ExpectColumnValuesToBeInTypeList` will return unexpected rows for only Pandas Data Sources.

Apply the Result Format to a Checkpoint or Validation Definition.

You can define a persistent Result Format configuration on a Checkpoint. The Result Format will be applied every time the Checkpoint is run. For more information on retrieving or creating a Checkpoint, see Run a Validation.

Saved Result Format
import great_expectations as gx

context = gx.get_context(mode="cloud")

# Define the Result Format
result_format_dict = {
    "result_format": "COMPLETE",
    "unexpected_index_column_names": ["my_indentifying_column"],
    "partial_unexpected_count": 25,
    "include_unexpected_rows": True,
}

# Retrieve the Checkpoint
checkpoint = context.checkpoints.get("my_checkpoint")

# Update the Checkpoint's configuration
checkpoint.result_format = result_format_dict
checkpoint.save()

# Run the Checkpoint
# If you are working with a SQL or filesystem Data Asset, omit the batch_parameters.
batch_parameters = {"dataframe": test_df}
checkpoint.run(batch_parameters=batch_parameters)

Alternatively, you can pass a result_format configuration at runtime to the .run(...) method of a Validation Definition. This result_format configuration does not persist with the Validation Definition; it will apply to only the current execution of the .run(...) method. For more information on creating a Validation Definition, see Run a Validation.

Runtime Result Format
import great_expectations as gx

context = gx.get_context(mode="cloud")

# Define the Result Format
result_format_dict = {
    "result_format": "COMPLETE",
    "unexpected_index_column_names": ["my_indentifying_column"],
    "partial_unexpected_count": 25,
    "include_unexpected_rows": True,
}

# Retrieve the Validation Definition
validation_definition = context.validation_definitions.get("my_validation_definition")

# Run the Validation Definition with a Result Format configuration
# If you are working with a SQL or filesystem Data Asset, omit the batch_parameters.
batch_parameters = {"dataframe": test_df}
validation_results = validation_definition.run(
    result_format=result_format_dict, batch_parameters=batch_parameters
)

# Review the Validation Results
print(validation_results)

Reference tables

Information in result fields
Result fields provided by verbosity level
Result Format keys

The following table lists the fields that can be found in the result dictionary of a Validation Result, and what information is provided by that field.

Field within `result`	Value
element_count	The total number of values in the column.
missing_count	The number of missing values in the column.
missing_percent	The total percent of rows missing values for the column.
unexpected_count	The total count of unexpected values in a column.
unexpected_percent	The overall percent of unexpected values in a column.
unexpected_percent_nonmissing	The percent of unexpected values in a column, excluding rows that have no value for that column.
observed_value	The aggregate statistic computed for the column. This only applies to Expectations that pertain to the aggregate value of a column, rather than the individual values in each row for the column.
partial_unexpected_list	A partial list of values that violate the Expectation. (Up to 20 values by default.)
partial_unexpected_index_list	A partial list of the unexpected values in the column, as defined by the columns in `unexpected_index_column_names`. (Up to 20 indices by default.)
partial_unexpected_counts	A partial list of values and counts, showing the number of times each of the unexpected values occurs. (Up to 20 unexpected value/count pairs by default.)
unexpected_index_list	A list of the indices of the unexpected values in the column, as defined by the columns in `unexpected_index_column_names`. This only applies to Expectations that have a yes/no answer for each row.
unexpected_index_query	A query that can be used to retrieve all unexpected values (SQL and Spark), or the full list of unexpected indices (Pandas). This only applies to Expectations that have a yes/no answer for each row.
unexpected_list	A list of up to 200 values that violate the Expectation.
unexpected_rows	Up to 200 complete rows that violate the Expectation. The format depends on the Data Source. For example, a SQL Data Source will return a list of tuples while a Spark Data Source will return a DataFrame. Not available in the GX Cloud UI. Applies to Column Map Expectations only, such as `ExpectColumnValuesToBeInSet`. Note that `ExpectColumnValuesToBeOfType` and `ExpectColumnValuesToBeInTypeList` will return `unexpected_rows` for only Pandas Data Sources.

The following table lists the fields that can be found in the result dictionary of a Validation Result and the Result Format verbosity levels that return that field.

Fields within `result`	BOOLEAN_ONLY	BASIC	SUMMARY	COMPLETE
element_count	no	yes	yes	yes
missing_count	no	yes	yes	yes
missing_percent	no	yes	yes	yes
unexpected_count	no	yes	yes	yes
unexpected_percent	no	yes	yes	yes
unexpected_percent_nonmissing	no	yes	yes	yes
observed_value	no	yes	yes	yes
partial_unexpected_list	no	yes	yes	yes
partial_unexpected_index_list	no	no	yes	yes
partial_unexpected_counts	no	no	yes	yes
unexpected_index_list	no	no	no	yes
unexpected_index_query	no	no	no	yes
unexpected_list	no	no	no	yes
unexpected_rows	no	yes	yes	yes

The following table lists the valid keys for a Result Format dictionary and what their purpose is. Not all keys are used by every verbosity level.

Dictionary key	Purpose
`"result_format"`	Sets the fields to return in Validation Results. Valid values are `"BASIC"`, `"BOOLEAN_ONLY"`, `"COMPLETE"`, and `"SUMMARY"` (default for GX-managed Checkpoints is `"COMPLETE"`; default for Validation Definitions and API-managed Checkpoints is `"SUMMARY"`).
`"unexpected_index_column_names"`	Takes a list to define the column(s) that will be used to identify unexpected results returned. For example, primary key (PK) column(s) or other columns with unique identifiers.
`"return_unexpected_index_query"`	When running validations with the GX Cloud API, a query (or a set of indices) is returned that allows you to retrieve the full set of unexpected results as well as the values of any identifying columns specified in `"unexpected_index_column_names"`. To make the query available in the GX Cloud UI, you must also set `"unexpected_index_column_names"`. Setting `"return_unexpected_index_query"` to `False` suppresses the output (default is `True`).
`"partial_unexpected_count"`	Sets the number of results to include in `"partial_unexpected_counts"`, `"partial_unexpected_list"`, and `"partial_unexpected_index_list"` if applicable (default is 20). Set `"return_unexpected_index_query"` to zero to suppress the unexpected counts.
`"exclude_unexpected_values"`	When running validations, a set of unexpected results' indices and values is returned. Setting this value to `True` suppresses values from the output to only have indices (default is `False`).
`"include_unexpected_rows"`	When `True`, the GX Cloud API returns up to 200 entire rows that violate the Expectation (default is `False`). Applies to Column Map Expectations only, such as `ExpectColumnValuesToBeInSet`. Note that `ExpectColumnValuesToBeOfType` and `ExpectColumnValuesToBeInTypeList` will return unexpected rows for only Pandas Data Sources.

Prerequisites​

Configure and apply a Result Format​

Reference tables​

Prerequisites

Configure and apply a Result Format

Reference tables