Applied Sciences (Apr 2023)

On the Use of Population Data for Training Seasonal Local Models-Based Glucose Predictors: An <i>In Silico</i> Study

  • Antonio Aslan,
  • José-Luis Díez,
  • Alejandro José Laguna Sanz,
  • Jorge Bondia

DOI
https://doi.org/10.3390/app13095348
Journal volume & issue
Vol. 13, no. 9
p. 5348

Abstract

Read online

Most advanced technologies for the treatment of type 1 diabetes, such as sensor-pump integrated systems or the artificial pancreas, require accurate glucose predictions on a given future time-horizon as a basis for decision-making support systems. Seasonal stochastic models are data-driven algebraic models that use recent history data and periodic trends to accurately estimate time series data, such as glucose concentration in diabetes. These models have been proven to be a good option to provide accurate blood glucose predictions under free-living conditions. These models can cope with patient variability under variable-length time-stamped daily events in supervision and control applications. However, the seasonal-models-based framework usually needs of several months of data per patient to be fed into the system to adequately train a personalized glucose predictor for each patient. In this work, an in silico analysis of the accuracy of prediction is presented, considering the effect of training a glucose predictor with data from a cohort of patients (population) instead of data from a single patient (individual). Feasibility of population data as an input to the model is asserted, and the effect of the dataset size in the determination of the minimum amount of data for a valid training of the models is studied. Results show that glucose predictors trained with population data can provide predictions of similar magnitude as those trained with individualized data. Overall median root mean squared error (RMSE) (including 25% and 75% percentiles) for the predictor trained with population data are {6.96[4.87,8.67], 12.49[7.96,14.23], 19.52[10.62,23.37], 28.79[12.96,34.57], 32.3[16.20,41.59], 28.8[15.13,37.18]} mg/dL, for prediction horizons (PH) of {15,30,60,120,180,240} min, respectively, while the baseline of the individually trained RMSE results are {6.37[5.07,6.70], 11.27[8.35,12.65], 17.44[11.08,20.93], 22.72[14.29,28.19], 28.45[14.79,34.38], 25.58[13.10,36.60]} mg/dL, both training with 16 weeks of data. Results also show that the use of the population approach reduces the required training data by half, without losing any prediction capability.

Keywords