A Fivetran provider for Apache Airflow
This package provides an operator, sensor, and hook that integrates Fivetran into Apache Airflow. FivetranOperator
allows you to start Fivetran jobs from Airflow and FivetranSensor
allows you to monitor a Fivetran sync job for completion before running downstream processes.
Fivetran automates your data pipeline, and Airflow automates your data processing.
Prerequisites: An environment running apache-airflow
.
pip install airflow-provider-fivetran
In the Airflow user interface, configure a Connection for Fivetran. Most of the Connection config fields will be left blank. Configure the following fields:
Conn Id
: fivetran_default
Conn Type
: Fivetran
Fivetran API Key
: Your Fivetran API KeyFivetran API Secret
: Your Fivetran API SecretFind the Fivetran API Key and Secret in the Fivetran Account Settings, under the API Config section. See our documentation for more information on Fivetran API Authentication.
The sensor and operator assume the Conn Id
is set to fivetran_default
, however if you are managing multipe Fivetran accounts, you can set this to anything you like. See the DAG in examples to see how to specify a custom Conn Id
.
FivetranOperator
starts a Fivetran sync job. Note that when a Fivetran sync job is controlled via an Operator, it is no longer run on the schedule as managed by Fivetran. In other words, it is now scheduled only from Airflow.
FivetranOperator
requires that you specify the connector_id
of the sync job to start. You can find connector_id
in the Settings page of the connector you configured in the Fivetran dashboard.
Import into your DAG via:
from fivetran_provider.operators.fivetran import FivetranOperator
FivetranSensor
monitors a Fivetran sync job for completion. Monitoring with FivetranSensor
allows you to trigger downstream processes only when the Fivetran sync jobs have completed, ensuring data consistency. You can use multiple instances of FivetranSensor
to monitor multiple Fivetran connectors.
Note, it is possible to monitor a sync that is scheduled and managed from Fivetran; in other words, you can use FivetranSensor
without using FivetranOperator
. If used in this way, your DAG will wait until the sync job starts on its Fivetran-controlled schedule and then completes.
FivetranSensor
requires that you specify the connector_id
of the sync job to start. You can find connector_id
in the Settings page of the connector you configured in the Fivetran dashboard.
Import into your DAG via:
from fivetran_provider.sensors.fivetran import FivetranSensor
See the examples directory for an example DAG.
Please submit issues and pull requests in our official repo: https://github.com/fivetran/airflow-provider-fivetran
We are happy to hear from you. Please email any feedback to the authors at devrel@fivetran.com.
Special thanks to Pete DeJoy, Plinio Guzman, and David Koenitzer of Astronomer.io for their contributions and support in getting this provider off the ground.