Temporal Fusion Transformer: Time Sequence Forecasting with Deep Studying — Full Tutorial | by Nikos Kafritsas | Nov, 2022

November 6, 2022

1

Create correct and interpretable predictions

In accordance with [2], Temporal Fusion Transformer outperforms all outstanding Deep Studying fashions for time sequence forecasting.

Together with a featured Gradient Boosting Tree mannequin for tabular time sequence information.

However what’s Temporal Fusion Transformer (TFT)[3] and why is it so fascinating?

On this article, we briefly clarify the novelties of Temporal Fusion Transformer and construct an end-to-end venture on Vitality Demand Forecasting. Particularly, we are going to cowl:

The right way to put together our information for the TFT format.
The right way to construct, practice, and consider the TFT mannequin.
The right way to get predictions on validation information and out-of-sample predictions.
The right way to calculate function importances, seasonality patterns, and excessive occasions robustness utilizing the built-in mannequin’s interpretable consideration mechanism.

Let’s dive in!

For an in-depth evaluation of the Temporal Fusion Transformer structure, examine my earlier article.

Temporal Fusion Transformer (TFT) is a Transformer-based mannequin that leverages self-attention to seize the advanced temporal dynamics of a number of time sequences.

TFT helps:

A number of time sequence: We are able to practice a TFT mannequin on hundreds of univariate or multivariate time sequence.
Multi-Horizon Forecasting: The mannequin outputs multi-step predictions of a number of goal variables — together with prediction intervals.
Heterogeneous options: TFT helps many kinds of options, together with time-variant and static exogenous variables.
Interpretable predictions: Predictions will be interpreted when it comes to variable significance and seasonality.

A type of traits is exclusive to Temporal Fusion Transformer. We’ll cowl this within the subsequent part.

Amongst notable DL time-series fashions (e.g., DeepAR[4]), TFT stands out as a result of it helps varied kinds of options. These are:

Time-varying recognized
Time-varying unknown
Time-invariant actual
Time-invariant categorical

For instance, think about we’ve got a gross sales forecasting case:

Let’s say we’ve got to foretell the gross sales of three merchandise. The num gross sales is the goal variable. The CPI index or the variety of guests are time-varying unknown options as a result of they’re solely recognized as much as prediction time. Nevertheless, holidaysand particular days are time-varying recognized occasions.

The product id is a time-invariant (static) categorical function. Different options that are numerical and never time-dependent resembling yearly_revenue will be categorized as time-invariant actual.

Earlier than shifting to our venture, we are going to first present a mini-tutorial on find out how to convert your information to the prolonged time-series format.

Notice: All photos and figures on this article are created by the writer.

For this tutorial, we use the TemporalFusionTransformer mannequin from the PyTorch Forecasting library and PyTorch Lightning:

pip set up torch pytorch-lightning pytorch_forecasting

The entire course of includes 3 issues:

Create a pandas dataframe with our time-series information.
Wrap our dataframe right into a TimeSeriesDataset occasion.
Cross our TimeSeriesDataset occasion to TemporalFusionTransformer.

The TimeSeriesDataset may be very helpful as a result of it helps us specify whether or not options are time-varying or static. Plus, it’s the one format that TemporalFusionTransformer accepts.

Let’s create a minimal coaching dataset to indicate how TimeSeriesDataset works: