Applied Sciences (May 2019)

Enhanced Application of Principal Component Analysis in Machine Learning for Imputation of Missing Traffic Data

  • Yoon-Young Choi,
  • Heeseung Shon,
  • Young-Ji Byon,
  • Dong-Kyu Kim,
  • Seungmo Kang

DOI
https://doi.org/10.3390/app9102149
Journal volume & issue
Vol. 9, no. 10
p. 2149

Abstract

Read online

Missing value imputation approaches have been widely used to support and maintain the quality of traffic data. Although the spatiotemporal dependency-based approaches can improve the imputation performance for large and continuous missing patterns, additionally considering traffic states can lead to more reliable results. In order to improve the imputation performances further, a section-based approach is also needed. This study proposes a novel approach for identifying traffic-states of different spots of road sections that comprise, namely, a section-based traffic state (SBTS), and determining their spatiotemporal dependencies customized for each SBTS, for missing value imputations. A principal component analysis (PCA) was employed, and angles obtained from the first principal component were used to identify the SBTSs. The pre-processing was combined with a support vector machine for developing the imputation model. It was found that the segmentation of the SBTS using the angles and considering the spatiotemporal dependency for each state by the proposed approach outperformed other existing models.

Keywords