Zero-shot personalization of speech foundation models for depressed mood monitoring

Maurice Gerczuk; Andreas Triantafyllopoulos; Shahin Amiriparian; Alexander Kathan; Jonathan Bauer; Matthias Berking; Björn W. Schuller

Patterns (Nov 2023)

Zero-shot personalization of speech foundation models for depressed mood monitoring

Maurice Gerczuk,
Andreas Triantafyllopoulos,
Shahin Amiriparian,
Alexander Kathan,
Jonathan Bauer,
Matthias Berking,
Björn W. Schuller

Affiliations

Maurice Gerczuk: Chair of Embedded Intelligence for Healthcare and Wellbeing, University of Augsburg, Augsburg, Germany; Corresponding author
Andreas Triantafyllopoulos: Chair of Embedded Intelligence for Healthcare and Wellbeing, University of Augsburg, Augsburg, Germany
Shahin Amiriparian: Chair of Embedded Intelligence for Healthcare and Wellbeing, University of Augsburg, Augsburg, Germany
Alexander Kathan: Chair of Embedded Intelligence for Healthcare and Wellbeing, University of Augsburg, Augsburg, Germany
Jonathan Bauer: Department of Clinical Psychology and Psychotherapy, Friedrich-Alexander-Universität, Erlangen-Nürnberg, Erlangen, Germany
Matthias Berking: Department of Clinical Psychology and Psychotherapy, Friedrich-Alexander-Universität, Erlangen-Nürnberg, Erlangen, Germany
Björn W. Schuller: Chair of Embedded Intelligence for Healthcare and Wellbeing, University of Augsburg, Augsburg, Germany; GLAM, Imperial College, London, UK

Journal volume & issue: Vol. 4, no. 11
p. 100873

Abstract

Read online

Summary: The monitoring of depressed mood plays an important role as a diagnostic tool in psychotherapy. An automated analysis of speech can provide a non-invasive measurement of a patient’s affective state. While speech has been shown to be a useful biomarker for depression, existing approaches mostly build population-level models that aim to predict each individual’s diagnosis as a (mostly) static property. Because of inter-individual differences in symptomatology and mood regulation behaviors, these approaches are ill-suited to detect smaller temporal variations in depressed mood. We address this issue by introducing a zero-shot personalization of large speech foundation models. Compared with other personalization strategies, our work does not require labeled speech samples for enrollment. Instead, the approach makes use of adapters conditioned on subject-specific metadata. On a longitudinal dataset, we show that the method improves performance compared with a set of suitable baselines. Finally, applying our personalization strategy improves individual-level fairness. The bigger picture: Depression, as one of the most prevalent mental health diseases, negatively impacts millions of lives. Diagnoses are achieved by the assessment of symptoms with standardized tests. However, recent studies indicate that continuously monitoring symptoms (e.g., with ecological momentary assessments [EMAs]) may provide relevant additional information for both diagnosis and treatment decisions. More recently, these manual methods have been complemented by passive sensing methods. Here, speech can serve as a valuable objective marker because it has been shown to be impacted by various pathologies, such as anxiety and mood disorders, and can be collected non-invasively and cheaply. Existing machine learning methods that aim to measure mood, however, often fail to accurately model intra-individual variations, assuming that data are sourced from homogeneous populations. We introduce and evaluate an effective zero-shot personalization of speech foundation models that utilizes diagnostic information about each patient to improve per-speaker depressive mood recognition over a 2-week EMA period.

DSML 2: Proof-of-concept: Data science output has been formulated, implemented, and tested for one domain/problem

Published in Patterns

ISSN: 2666-3899 (Online)
Publisher: Elsevier
Country of publisher: United States
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science: Computer software
Website: https://www.cell.com/patterns

About the journal

Abstract

Keywords