Evaluating Accuracy in Five Commercial Sleep-Tracking Devices Compared to Research-Grade Actigraphy and Polysomnography

Kyle A. Kainec; Jamie Caccavaro; Morgan Barnes; Chloe Hoff; Annika Berlin; Rebecca M. C. Spencer

doi:10.3390/s24020635

Sensors (Jan 2024)

Evaluating Accuracy in Five Commercial Sleep-Tracking Devices Compared to Research-Grade Actigraphy and Polysomnography

Kyle A. Kainec,
Jamie Caccavaro,
Morgan Barnes,
Chloe Hoff,
Annika Berlin,
Rebecca M. C. Spencer

Affiliations

Kyle A. Kainec: Neuroscience & Behavior Program, French Hall, University of Massachusetts Amherst, 230 Stockbridge Road, Amherst, MA 01003, USA
Jamie Caccavaro: Department of Psychological and Brain Sciences, Tobin Hall, University of Massachusetts Amherst, 135 Hicks Way, Amherst, MA 01003, USA
Morgan Barnes: Institute for Applied Life Sciences, Life Science Laboratories, University of Massachusetts Amherst, 240 Thatcher Road, Amherst, MA 01003, USA
Chloe Hoff: Institute for Applied Life Sciences, Life Science Laboratories, University of Massachusetts Amherst, 240 Thatcher Road, Amherst, MA 01003, USA
Annika Berlin: Institute for Applied Life Sciences, Life Science Laboratories, University of Massachusetts Amherst, 240 Thatcher Road, Amherst, MA 01003, USA
Rebecca M. C. Spencer: Neuroscience & Behavior Program, French Hall, University of Massachusetts Amherst, 230 Stockbridge Road, Amherst, MA 01003, USA

DOI: https://doi.org/10.3390/s24020635
Journal volume & issue: Vol. 24, no. 2
p. 635

Abstract

Read online

The development of consumer sleep-tracking technologies has outpaced the scientific evaluation of their accuracy. In this study, five consumer sleep-tracking devices, research-grade actigraphy, and polysomnography were used simultaneously to monitor the overnight sleep of fifty-three young adults in the lab for one night. Biases and limits of agreement were assessed to determine how sleep stage estimates for each device and research-grade actigraphy differed from polysomnography-derived measures. Every device, except the Garmin Vivosmart, was able to estimate total sleep time comparably to research-grade actigraphy. All devices overestimated nights with shorter wake times and underestimated nights with longer wake times. For light sleep, absolute bias was low for the Fitbit Inspire and Fitbit Versa. The Withings Mat and Garmin Vivosmart overestimated shorter light sleep and underestimated longer light sleep. The Oura Ring underestimated light sleep of any duration. For deep sleep, bias was low for the Withings Mat and Garmin Vivosmart while other devices overestimated shorter and underestimated longer times. For REM sleep, bias was low for all devices. Taken together, these results suggest that proportional bias patterns in consumer sleep-tracking technologies are prevalent and could have important implications for their overall accuracy.

Published in Sensors

ISSN: 1424-8220 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Chemical technology
Website: http://www.mdpi.com/journal/sensors

About the journal

Abstract

Keywords