Skip to main content
Version: 1.3.2

Integrate GX in a data pipeline

This tutorial provides working, hands-on examples of how to use GX to add data validation within a data pipeline. The tutorial is hosted on GitHub.

The tutorial provides a functioning deployment of GX, Airflow, JupyterLab, and Postgres that can be run locally using Docker compose. Educational content is provided as JupyterLab notebooks that can be interactively run and modified to learn and explore how GX integrates with a data pipeline.

The tutorial environment uses Airflow as the orchestrator and Postgres as the Data Source, but the examples can be adapted to any other supported Data Source and orchestrator.

Access this tutorial on GitHub

Click the link below to access the tutorial-gx-in-the-data-pipeline repo.

tutorial-gx-in-the-data-pipeline