Complexity (Jan 2021)

Missing Data Interpolation of Alzheimer’s Disease Based on Column-by-Column Mixed Mode

  • Shi-di Miao,
  • Si-qi Li,
  • Xu-yang Zheng,
  • Rui-tao Wang,
  • Jing Li,
  • Si-si Ding,
  • Jun-feng Ma

DOI
https://doi.org/10.1155/2021/3541516
Journal volume & issue
Vol. 2021

Abstract

Read online

Research on clinical data sets of Alzheimer’s disease can predict and develop early intervention treatment. Missing data is a common problem in medical research. Failure to deal with more missing data will reduce the efficiency of the test, resulting in information loss and result bias. To address these issues, this paper designs and implements the missing data interpolation method of mixed interpolation according to columns by combining the four methods of mean interpolation, regression interpolation, support vector machine (SVM) interpolation, and multiple interpolation. By comparing the effects of the mixed interpolation method with the above four interpolation methods and giving the comparison results, the experiment shows that the results of the mixed interpolation method under different data missing rates have better performance in terms of root mean square error (RMSE), mean absolute error (MSE), and error rate, which proves the effectiveness of the interpolation mechanism. The characteristics of different variables might lead to different interpolation strategy choices, and column-by-column mixed interpolation can dynamically select the best method according to the difference of features. To a certain extent, it selects the best method suitable for each feature and improves the interpolation effect of the data set as a whole, which is beneficial to the clinical study of Alzheimer’s disease. In addition, in the processing of missing data, a combination of deletion method and interpolation method is adopted with reference to expert knowledge. Compared with the direct interpolation method, the data set obtained by this method is more accurate.