Toward explainable AI (XAI) for mental health detection based on language behavior

Elma Kerz; Sourabh Zanwar; Yu Qiao; Daniel Wiechmann

doi:10.3389/fpsyt.2023.1219479

Frontiers in Psychiatry (Dec 2023)

Toward explainable AI (XAI) for mental health detection based on language behavior

Elma Kerz,
Sourabh Zanwar,
Yu Qiao,
Daniel Wiechmann

Affiliations

Elma Kerz: Department of English and American Studies, RWTH Aachen University, Aachen, North Rhine-Westphalia, Germany
Sourabh Zanwar: Department of English and American Studies, RWTH Aachen University, Aachen, North Rhine-Westphalia, Germany
Yu Qiao: Department of English and American Studies, RWTH Aachen University, Aachen, North Rhine-Westphalia, Germany
Daniel Wiechmann: Institute for Logic, Language and Computation, University of Amsterdam, Amsterdam, Netherlands

DOI: https://doi.org/10.3389/fpsyt.2023.1219479
Journal volume & issue: Vol. 14

Abstract

Read online

Advances in artificial intelligence (AI) in general and Natural Language Processing (NLP) in particular are paving the new way forward for the automated detection and prediction of mental health disorders among the population. Recent research in this area has prioritized predictive accuracy over model interpretability by relying on deep learning methods. However, prioritizing predictive accuracy over model interpretability can result in a lack of transparency in the decision-making process, which is critical in sensitive applications such as healthcare. There is thus a growing need for explainable AI (XAI) approaches to psychiatric diagnosis and prediction. The main aim of this work is to address a gap by conducting a systematic investigation of XAI approaches in the realm of automatic detection of mental disorders from language behavior leveraging textual data from social media. In pursuit of this aim, we perform extensive experiments to evaluate the balance between accuracy and interpretability across predictive mental health models. More specifically, we build BiLSTM models trained on a comprehensive set of human-interpretable features, encompassing syntactic complexity, lexical sophistication, readability, cohesion, stylistics, as well as topics and sentiment/emotions derived from lexicon-based dictionaries to capture multiple dimensions of language production. We conduct extensive feature ablation experiments to determine the most informative feature groups associated with specific mental health conditions. We juxtapose the performance of these models against a “black-box” domain-specific pretrained transformer adapted for mental health applications. To enhance the interpretability of the transformers models, we utilize a multi-task fusion learning framework infusing information from two relevant domains (emotion and personality traits). Moreover, we employ two distinct explanation techniques: the local interpretable model-agnostic explanations (LIME) method and a model-specific self-explaining method (AGRAD). These methods allow us to discern the specific categories of words that the information-infused models rely on when generating predictions. Our proposed approaches are evaluated on two public English benchmark datasets, subsuming five mental health conditions (attention-deficit/hyperactivity disorder, anxiety, bipolar disorder, depression and psychological stress).

Published in Frontiers in Psychiatry

ISSN: 1664-0640 (Online)
Publisher: Frontiers Media S.A.
Country of publisher: Switzerland
LCC subjects: Medicine: Internal medicine: Neurosciences. Biological psychiatry. Neuropsychiatry: Neurology. Diseases of the nervous system: Psychiatry
Website: https://www.frontiersin.org/journals/psychiatry

About the journal

Abstract

Keywords