IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing (Jan 2024)

A Novel Transformer Network With Shifted Window Cross-Attention for Spatiotemporal Weather Forecasting

  • Alabi Bojesomo,
  • Hasan AlMarzouqi,
  • Panos Liatsis

DOI
https://doi.org/10.1109/JSTARS.2023.3323729
Journal volume & issue
Vol. 17
pp. 45 – 55

Abstract

Read online

Earth observation is a growing research area that can capitalize on the powers of artificial intelligence for short time forecasting, a now-casting scenario. In this work, we tackle the challenge of weather forecasting using a video transformer network. Vision transformer architectures have been explored in various applications, with major constraints being the computational complexity of attention and the data-hungry training. To address these issues, we propose the use of video Swin-transformer (VST), coupled with a dedicated augmentation scheme. Moreover, we employ gradual spatial reduction on the encoder side and cross-attention on the decoder. The proposed approach is tested on the Weather4Cast2021 weather forecasting challenge data, which requires the prediction of 8 h ahead future frames (4 per hour) from an hourly weather product sequence. The dataset was normalized to 0–1 to facilitate the use of the evaluation metrics across different datasets. The model results in an mse score of 0.4750 when provided with training data, and 0.4420 during transfer learning without using training data, respectively.

Keywords