AdaFI-FCN: an adaptive feature integration fully convolutional network for predicting driver’s visual attention

Bowen Shi; Weihua Dong; Zhicheng Zhan

doi:10.1080/10095020.2022.2147028

Geo-spatial Information Science (Jul 2024)

AdaFI-FCN: an adaptive feature integration fully convolutional network for predicting driver’s visual attention

Bowen Shi,
Weihua Dong,
Zhicheng Zhan

Affiliations

Bowen Shi: State Key Laboratory of Remote Sensing Science, Beijing Key Laboratory for Remote Sensing of Environment and Digital Cities, Research Centre of Geospatial Cognition and Visual Analytics, and Faculty of Geographical Science, Beijing Normal University, Beijing, China
Weihua Dong: State Key Laboratory of Remote Sensing Science, Beijing Key Laboratory for Remote Sensing of Environment and Digital Cities, Research Centre of Geospatial Cognition and Visual Analytics, and Faculty of Geographical Science, Beijing Normal University, Beijing, China
Zhicheng Zhan: State Key Laboratory of Remote Sensing Science, Beijing Key Laboratory for Remote Sensing of Environment and Digital Cities, Research Centre of Geospatial Cognition and Visual Analytics, and Faculty of Geographical Science, Beijing Normal University, Beijing, China

DOI: https://doi.org/10.1080/10095020.2022.2147028
Journal volume & issue: Vol. 27, no. 4
pp. 1309 – 1325

Abstract

Read online

Visual Attention Prediction (VAP) is widely applied in GIS research, such as navigation task identification and driver assistance systems. Previous studies commonly took color information to detect the visual saliency of natural scene images. However, these studies rarely considered adaptively feature integration to different geospatial scenes in specific tasks. To better predict visual attention while driving tasks, in this paper, we firstly propose an Adaptive Feature Integration Fully Convolutional Network (AdaFI-FCN) using Scene-Adaptive Weights (SAW) to integrate RGB-D, motion and semantic features. The quantitative comparison results on the DR(eye)VE dataset show that the proposed framework achieved the best accuracy and robustness performance compared with state-of-the-art models (AUC-Judd = 0.971, CC = 0.767, KL = 1.046, SIM = 0.579). In addition, the experimental results of the ablation study demonstrated the positive effect of the SAW method on the prediction robustness in response to scene changes. The proposed model has the potential to benefit adaptive VAP research in universal geospatial scenes, such as AR-aided navigation, indoor navigation, and street-view image reading.

Published in Geo-spatial Information Science

ISSN: 1009-5020 (Print); 1993-5153 (Online)
Publisher: Taylor & Francis Group
Country of publisher: United Kingdom
LCC subjects: Geography. Anthropology. Recreation: Mathematical geography. Cartography; Science: Astronomy: Geodesy
Website: https://www.tandfonline.com/journals/tgsi

About the journal

Abstract

Keywords