Structured data vs. unstructured data in machine learning prediction models for suicidal behaviors: A systematic review and meta-analysis

Danielle Hopkins; Debra J. Rickwood; David J. Hallford; Clare Watsford

doi:10.3389/fdgth.2022.945006

Frontiers in Digital Health (Aug 2022)

Structured data vs. unstructured data in machine learning prediction models for suicidal behaviors: A systematic review and meta-analysis

Danielle Hopkins,
Debra J. Rickwood,
David J. Hallford,
Clare Watsford

Affiliations

Danielle Hopkins: Faculty of Health, University of Canberra, Canberra, ACT, Australia
Debra J. Rickwood: Faculty of Health, University of Canberra, Canberra, ACT, Australia
David J. Hallford: Faculty of Health, Deakin University, Melbourne, VIC, Australia
Clare Watsford: Faculty of Health, University of Canberra, Canberra, ACT, Australia

DOI: https://doi.org/10.3389/fdgth.2022.945006
Journal volume & issue: Vol. 4

Abstract

Read online

Suicide remains a leading cause of preventable death worldwide, despite advances in research and decreases in mental health stigma through government health campaigns. Machine learning (ML), a type of artificial intelligence (AI), is the use of algorithms to simulate and imitate human cognition. Given the lack of improvement in clinician-based suicide prediction over time, advancements in technology have allowed for novel approaches to predicting suicide risk. This systematic review and meta-analysis aimed to synthesize current research regarding data sources in ML prediction of suicide risk, incorporating and comparing outcomes between structured data (human interpretable such as psychometric instruments) and unstructured data (only machine interpretable such as electronic health records). Online databases and gray literature were searched for studies relating to ML and suicide risk prediction. There were 31 eligible studies. The outcome for all studies combined was AUC = 0.860, structured data showed AUC = 0.873, and unstructured data was calculated at AUC = 0.866. There was substantial heterogeneity between the studies, the sources of which were unable to be defined. The studies showed good accuracy levels in the prediction of suicide risk behavior overall. Structured data and unstructured data also showed similar outcome accuracy according to meta-analysis, despite different volumes and types of input data.

Published in Frontiers in Digital Health

ISSN: 2673-253X (Online)
Publisher: Frontiers Media S.A.
Country of publisher: Switzerland
LCC subjects: Medicine: Public aspects of medicine; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://www.frontiersin.org/journals/digital-health#

About the journal

Abstract

Keywords