Performance of machine learning algorithms for dementia assessment: impacts of language tasks, recording media, and modalities

Mahboobeh (Mah) Parsapoor (Parsa); Muhammad Raisul Alam; Alex Mihailidis

doi:10.1186/s12911-023-02122-6

BMC Medical Informatics and Decision Making (Mar 2023)

Performance of machine learning algorithms for dementia assessment: impacts of language tasks, recording media, and modalities

Mahboobeh (Mah) Parsapoor (Parsa),
Muhammad Raisul Alam,
Alex Mihailidis

Affiliations

Mahboobeh (Mah) Parsapoor (Parsa): Centre de Recherche Informatique de Montréal (CRIM)
Muhammad Raisul Alam: Department of Computer Science, University of Toronto
Alex Mihailidis: Department Occupational Science and Occupational Therapy, University of Toronto

DOI: https://doi.org/10.1186/s12911-023-02122-6
Journal volume & issue: Vol. 23, no. 1
pp. 1 – 19

Abstract

Read online

Abstract Objectives Automatic speech and language assessment methods (SLAMs) can help clinicians assess speech and language impairments associated with dementia in older adults. The basis of any automatic SLAMs is a machine learning (ML) classifier that is trained on participants’ speech and language. However, language tasks, recording media, and modalities impact the performance of ML classifiers. Thus, this research has focused on evaluating the effects of the above-mentioned factors on the performance of ML classifiers that can be used for dementia assessment. Methodology Our methodology includes the following steps: (1) Collecting speech and language datasets from patients and healthy controls; (2) Using feature engineering methods which include feature extraction methods to extract linguistic and acoustic features and feature selection methods to select most informative features; (3) Training different ML classifiers; and (4) Evaluating the performance of ML classifiers to investigate the impacts of language tasks, recording media, and modalities on dementia assessment. Results Our results show that (1) the ML classifiers trained with the picture description language task perform better than the classifiers trained with the story recall language task; (2) the data obtained from phone-based recordings improves the performance of ML classifiers compared to data obtained from web-based recordings; and (3) the ML classifiers trained with acoustic features perform better than the classifiers trained with linguistic features. Conclusion This research demonstrates that we can improve the performance of automatic SLAMs as dementia assessment methods if we: (1) Use the picture description task to obtain participants’ speech; (2) Collect participants’ voices via phone-based recordings; and (3) Train ML classifiers using only acoustic features. Our proposed methodology will help future researchers to investigate the impacts of different factors on the performance of ML classifiers for assessing dementia.

Published in BMC Medical Informatics and Decision Making

ISSN: 1472-6947 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics
Website: http://bmcmedinformdecismak.biomedcentral.com

About the journal

Abstract

Keywords