BioMedical Engineering OnLine (Feb 2025)
Cross-evaluation of wearable data for use in Parkinson’s disease research: a free-living observational study on Empatica E4, Fitbit Sense, and Oura
Abstract
Abstract Background Established assessment scales used for Parkinson’s disease (PD) have several limitations in tracking symptom progression and fluctuation. Both research and commercial-grade wearables show potential in improving these assessments. However, it is not known whether pervasive and affordable devices can deliver reliable data, suitable for designing open-source unobtrusive around-the-clock assessments. Our aim is to investigate the usefulness of the research-grade wristband Empatica E4, commercial-grade smartwatch Fitbit Sense, and the Oura ring, for PD research. Method The study included participants with PD (N = 15) and neurologically healthy controls (N = 16). Data were collected using established assessment scales (Movement Disorders Society Unified Parkinson’s Disease Rating Scale, Montreal Cognitive Assessment, REM Sleep Behavior Disorder Screening Questionnaire, Hoehn and Yahr Stage), self-reported diary (activities, symptoms, sleep, medication times), and 2-week digital data from the three devices collected simultaneously. The analyses comprised three steps: preparation (device characteristics assessment, data extraction and preprocessing), processing (data structuring and visualization, cross-correlation analysis, diary comparison, uptime calculation), and evaluation (usability, availability, statistical analyses). Results We found large variation in data characteristics and unsatisfactory cross-correlation. Due to output incongruences, only heart rate and movement could be assessed across devices. Empatica E4 and Fitbit Sense outperformed Oura in reflecting self-reported activities. Results show a weak output correlation and significant differences. The uptime was good, but Oura did not record heart rate and movement concomitantly. We also found variation in terms of access to raw data, sampling rate and level of device-native processing, ease of use, retrieval of data, and design. We graded the system usability of Fitbit Sense as good, Empatica E4 as poor, with Oura in the middle. Conclusions In this study we identified a set of characteristics necessary for PD research: ease of handling, cleaning, data retrieval, access to raw data, score calculation transparency, long battery life, sufficient storage, higher sampling frequencies, software and hardware reliability, transparency. The three analyzed devices are not interchangeable and, based on data features, none were deemed optimal for PD research, but they all have the potential to provide suitable specifications in future iterations.
Keywords