PLoS ONE (Jan 2022)
Processing incomplete questionnaire data into continuous digital biomarkers for addiction monitoring.
Abstract
PurposeeHealth systems allow efficient daily smartphone-based collection of self-reported data on mood, wellbeing, routines, and motivation; however, missing data is frequent. Within addictive disorders, missing data may reflect lack of motivation to stay sober. We hypothesize that qualitative questionnaire data contains valuable information, which after proper handling of missing data becomes more useful for practitioners.MethodsAnonymized data from daily questionnaires containing 11 questions was collected with an eHealth system for 751 patients with alcohol use disorder (AUD). Two digital continuous biomarkers were composed from 9 wellbeing questions (WeBe-i) and from two questions representing motivation/self-confidence to remain sober (MotSC-i). To investigate possible loss of information in the process of composing the digital biomarkers, performance of neural networks to predict exacerbation events (relapse) in alcohol use disorder was compared.ResultsLong short-term memory (LSTM) neural networks predicted a coming exacerbation event 1-3 days (AUC 0.68-0.70) and 5-7 days (AUC 0.65-0.68) in advance on unseen patients. The predictive capability of digital biomarkers and raw questionnaire data was equal, indicating no loss of information. The transformation into digital biomarkers enable a continuous graphical display of each patient's clinical course and a combined interpretation of qualitative and quantitative aspects of recovery on a time scale.ConclusionBy transforming questionnaire data with large proportion of missing data into continuous digital biomarkers, the information captured by questionnaires can be more easily used in clinical practice. Information, assessed by the capability to predict exacerbation events of AUD, is preserved when processing raw questionnaire data into digital biomarkers.