Digital Biomarkers (Dec 2020)

Repeatability of Commonly Used Speech and Language Features for Clinical Applications

  • Gabriela M. Stegmann,
  • Shira Hahn,
  • Julie Liss,
  • Jeremy Shefner,
  • Seward B. Rutkove,
  • Kan Kawabata,
  • Samarth Bhandari,
  • Kerisa Shelton,
  • Cayla Jessica Duncan,
  • Visar Berisha

DOI
https://doi.org/10.1159/000511671
Journal volume & issue
Vol. 4, no. 3
pp. 109 – 122

Abstract

Read online

Introduction: Changes in speech have the potential to provide important information on the diagnosis and progression of various neurological diseases. Many researchers have relied on open-source speech features to develop algorithms for measuring speech changes in clinical populations as they are convenient and easy to use. However, the repeatability of open-source features in the context of neurological diseases has not been studied. Methods: We used a longitudinal sample of healthy controls, individuals with amyotrophic lateral sclerosis, and individuals with suspected frontotemporal dementia, and we evaluated the repeatability of acoustic and language features separately on these 3 data sets. Results: Repeatability was evaluated using intraclass correlation (ICC) and the within-subjects coefficient of variation (WSCV). In 3 sets of tasks, the median ICC were between 0.02 and 0.55, and the median WSCV were between 29 and 79%. Conclusion: Our results demonstrate that the repeatability of speech features extracted using open-source tool kits is low. Researchers should exercise caution when developing digital health models with open-source speech features. We provide a detailed summary of feature-by-feature repeatability results (ICC, WSCV, SE of measurement, limits of agreement for WSCV, and minimal detectable change) in the online supplementary material so that researchers may incorporate repeatability information into the models they develop.

Keywords