Multi-scale persistent spatiotemporal transformer for long-term urban traffic flow prediction

Jia-Jun Zhong; Yong Ma; Xin-Zheng Niu; Philippe Fournier-Viger; Bing Wang; Zu-kuan Wei

Journal of Electronic Science and Technology (Mar 2024)

Multi-scale persistent spatiotemporal transformer for long-term urban traffic flow prediction

Jia-Jun Zhong,
Yong Ma,
Xin-Zheng Niu,
Philippe Fournier-Viger,
Bing Wang,
Zu-kuan Wei

Affiliations

Jia-Jun Zhong: School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, 611731, China
Yong Ma: School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, 611731, China
Xin-Zheng Niu: School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, 611731, China; Corresponding author.
Philippe Fournier-Viger: College of Computer Science & Software Engineering, Shenzhen University, Shenzhen, 518060, China
Bing Wang: School of Computer Science, Southwest Petroleum University, Chengdu, 610500, China
Zu-kuan Wei: School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, 611731, China

Journal volume & issue: Vol. 22, no. 1
p. 100244

Abstract

Read online

Long-term urban traffic flow prediction is an important task in the field of intelligent transportation, as it can help optimize traffic management and improve travel efficiency. To improve prediction accuracy, a crucial issue is how to model spatiotemporal dependency in urban traffic data. In recent years, many studies have adopted spatiotemporal neural networks to extract key information from traffic data. However, most models ignore the semantic spatial similarity between long-distance areas when mining spatial dependency. They also ignore the impact of predicted time steps on the next unpredicted time step for making long-term predictions. Moreover, these models lack a comprehensive data embedding process to represent complex spatiotemporal dependency. This paper proposes a multi-scale persistent spatiotemporal transformer (MSPSTT) model to perform accurate long-term traffic flow prediction in cities. MSPSTT adopts an encoder-decoder structure and incorporates temporal, periodic, and spatial features to fully embed urban traffic data to address these issues. The model consists of a spatiotemporal encoder and a spatiotemporal decoder, which rely on temporal, geospatial, and semantic space multi-head attention modules to dynamically extract temporal, geospatial, and semantic characteristics. The spatiotemporal decoder combines the context information provided by the encoder, integrates the predicted time step information, and is iteratively updated to learn the correlation between different time steps in the broader time range to improve the model's accuracy for long-term prediction. Experiments on four public transportation datasets demonstrate that MSPSTT outperforms the existing models by up to 9.5% on three common metrics.

Published in Journal of Electronic Science and Technology

ISSN: 1674-862X (Print); 2666-223X (Online)
Publisher: KeAi Communications Co., Ltd.
Country of publisher: China
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science; Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://www.keaipublishing.com/en/journals/journal-of-electronic-science-and-technology/

About the journal

Abstract

Keywords