Skip to main content
Version: 1.8.0

FileDataAsset

Signature

class great_expectations.datasource.fluent.data_asset.path.file_asset.FileDataAsset(
*,
name: <pydantic.v1.fields.DeferredType object at 0x7fb3f0f87320>,
type: <pydantic.v1.fields.DeferredType object at 0x7fb3f0f873e0>,
id: <pydantic.v1.fields.DeferredType object at 0x7fb3f0f874a0> = None,
order_by: <pydantic.v1.fields.DeferredType object at 0x7fb3f0f87560> = None,
batch_metadata: <pydantic.v1.fields.DeferredType object at 0x7fb3f0f876b0> = None,
batch_definitions: <pydantic.v1.fields.DeferredType object at 0x7fb3f0f87860> = None,
connect_options: <pydantic.v1.fields.DeferredType object at 0x7fb3f0f87920> = None,
**extra_data: typing.Any
)

Base class for PathDataAssets which batch by applying a regex to file names.

Methods

add_batch_definition_daily

Signature

add_batch_definition_daily(
name: str,
regex: Union[re.Pattern,
str],
sort_ascending: bool = True
) → BatchDefinition

Add a BatchDefinition which defines daily batches by file name.

Parameters

NameDescription

name

BatchDefinition name

regex

Regular Expression used to define batches by file name. Must contain the groups year, month, and day.

sort_ascending

determine order in which batches are returned

Raises

TypeDescription

RegexMissingRequiredGroupsError

regex is missing the groups year, month, and/or day.

RegexUnknownGroupsError

regex has groups other than year, month, and/or day.

add_batch_definition_monthly

Signature

add_batch_definition_monthly(
name: str,
regex: Union[re.Pattern,
str],
sort_ascending: bool = True
) → BatchDefinition

Add a BatchDefinition which defines monthly batches by file name.

Parameters

NameDescription

name

BatchDefinition name

regex

Regular Expression used to define batches by file name. Must contain the groups year and month.

sort_ascending

determine order in which batches are returned

Raises

TypeDescription

RegexMissingRequiredGroupsError

regex is missing the groups year and/or month.

RegexUnknownGroupsError

regex has groups other than year and/or month.

add_batch_definition_path

Signature

add_batch_definition_path(
name: str,
path: PathStr
) → BatchDefinition

Add a BatchDefinition which matches a single Path.

Parameters

NameDescription

name

BatchDefinition name

path

File path relative to the Asset

Raises

TypeDescription

PathNotFoundError

path cannot be resolved

AmbiguousPathError

path matches more than one file

add_batch_definition_yearly

Signature

add_batch_definition_yearly(
name: str,
regex: Union[re.Pattern,
str],
sort_ascending: bool = True
) → BatchDefinition

Add a BatchDefinition which defines yearly batches by file name.

Parameters

NameDescription

name

BatchDefinition name

regex

Regular Expression used to define batches by file name. Must contain a single group year

sort_ascending

determine order in which batches are returned

Raises

TypeDescription

RegexMissingRequiredGroupsError

regex is missing the group year

RegexUnknownGroupsError

regex has groups other than year

delete_batch_definition

Signature

delete_batch_definition(
name: str
)None

Delete a batch definition.

Parameters

NameDescription

name (str)

Name of the BatchDefinition to delete.

get_batch_definition

Signature

get_batch_definition(
name: str
) → great_expectations.core.batch_definition.BatchDefinition

Get a batch definition.

Parameters

NameDescription

name (str)

Name of the BatchDefinition to get.

Raises

TypeDescription

KeyError

If the BatchDefinition does not exist.