Multi-Modal Pedestrian Trajectory Prediction for Edge Agents Based on Spatial-Temporal Graph

Xiangyu Zou; Bin Sun; Duan Zhao; Zongwei Zhu; Jinjin Zhao; Yongxin He

doi:10.1109/ACCESS.2020.2991435

IEEE Access (Jan 2020)

Multi-Modal Pedestrian Trajectory Prediction for Edge Agents Based on Spatial-Temporal Graph

Xiangyu Zou,
Bin Sun,
Duan Zhao,
Zongwei Zhu,
Jinjin Zhao,
Yongxin He

Affiliations

Xiangyu Zou: ORCiD; School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, China
Bin Sun: ORCiD; School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, China
Duan Zhao: ORCiD; School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, China
Zongwei Zhu: ORCiD; Suzhou Institute for Advanced Study, University of Science and Technology of China, Suzhou, China
Jinjin Zhao: ORCiD; School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, China
Yongxin He: School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, China

DOI: https://doi.org/10.1109/ACCESS.2020.2991435
Journal volume & issue: Vol. 8
pp. 83321 – 83332

Abstract

Read online

Edge agents, represented by socially-aware robots and autonomous vehicles, have gradually been integrated into human society. The safety navigation system in interactive scenes is of great importance to them. The key of this system is that the edge agent has the ability to predict the pedestrian trajectory in the dynamic scene, so as to avoid collision. However, predicting pedestrian trajectories in dynamic scenes is not an easy task, because it is necessary to comprehensively consider the spatial-temporal structure of human-environment interaction, visual attention, and the multi-modal behavior of human walking. In this paper, a scalable spatial-temporal graph generation adversarial network architecture (STG-GAN) is introduced, which can comprehensively consider the influence of human-environment interaction and generate a reasonable multi-modal prediction trajectory. First, we use LSTM nodes to flexibly transform the spatial-temporal graph of human-environment interactions into feed-forward differentiable feature coding, and innovatively propose the global node to integrate scene context information. Then, we capture the relative importance of global interactions on pedestrian trajectories through scaled dot product attention, and use recurrent sequence modeling and generative adversarial network architecture for common training, so as to generate reasonable pedestrian future trajectory distributions based on rich mixed features. Experiments on public data sets show that STG-GAN is superior to previous work in terms of accuracy, reasoning speed and rationality of trajectory prediction.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords