Enhanced Linear and Vision Transformer-Based Architectures for Time Series Forecasting

Musleh Alharthi; Ausif Mahmood

doi:10.3390/bdcc8050048

Big Data and Cognitive Computing (May 2024)

Enhanced Linear and Vision Transformer-Based Architectures for Time Series Forecasting

Musleh Alharthi,
Ausif Mahmood

Affiliations

Musleh Alharthi: Department of Computer Science and Engineering, University of Bridgeport, Bridgeport, CT 06604, USA
Ausif Mahmood: Department of Computer Science and Engineering, University of Bridgeport, Bridgeport, CT 06604, USA

DOI: https://doi.org/10.3390/bdcc8050048
Journal volume & issue: Vol. 8, no. 5
p. 48

Abstract

Read online

Time series forecasting has been a challenging area in the field of Artificial Intelligence. Various approaches such as linear neural networks, recurrent linear neural networks, Convolutional Neural Networks, and recently transformers have been attempted for the time series forecasting domain. Although transformer-based architectures have been outstanding in the Natural Language Processing domain, especially in autoregressive language modeling, the initial attempts to use transformers in the time series arena have met mixed success. A recent important work indicating simple linear networks outperform transformer-based designs. We investigate this paradox in detail comparing the linear neural network- and transformer-based designs, providing insights into why a certain approach may be better for a particular type of problem. We also improve upon the recently proposed simple linear neural network-based architecture by using dual pipelines with batch normalization and reversible instance normalization. Our enhanced architecture outperforms all existing architectures for time series forecasting on a majority of the popular benchmarks.

Published in Big Data and Cognitive Computing

ISSN: 2504-2289 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology
Website: http://www.mdpi.com/journal/BDCC

About the journal

Abstract

Keywords