Development and external validation of a pretrained deep learning model for the prediction of non-accidental trauma

David Huang; Steven Cogill; Renee Y. Hsia; Samuel Yang; David Kim

doi:10.1038/s41746-023-00875-y

npj Digital Medicine (Jul 2023)

Development and external validation of a pretrained deep learning model for the prediction of non-accidental trauma

David Huang,
Steven Cogill,
Renee Y. Hsia,
Samuel Yang,
David Kim

Affiliations

David Huang: Department of Computer Science, Stanford University
Steven Cogill: Department of Veterans Affairs
Renee Y. Hsia: Department of Emergency Medicine, UCSF School of Medicine
Samuel Yang: Department of Emergency Medicine, Stanford University
David Kim: Department of Emergency Medicine, Stanford University

DOI: https://doi.org/10.1038/s41746-023-00875-y
Journal volume & issue: Vol. 6, no. 1
pp. 1 – 9

Abstract

Read online

Abstract Non-accidental trauma (NAT) is deadly and difficult to predict. Transformer models pretrained on large datasets have recently produced state of the art performance on diverse prediction tasks, but the optimal pretraining strategies for diagnostic predictions are not known. Here we report the development and external validation of Pretrained and Adapted BERT for Longitudinal Outcomes (PABLO), a transformer-based deep learning model with multitask clinical pretraining, to identify patients who will receive a diagnosis of NAT in the next year. We develop a clinical interface to visualize patient trajectories, model predictions, and individual risk factors. In two comprehensive statewide databases, approximately 1% of patients experience NAT within one year of prediction. PABLO predicts NAT events with area under the receiver operating characteristic curve (AUROC) of 0.844 (95% CI 0.838–0.851) in the California test set, and 0.849 (95% CI 0.846–0.851) on external validation in Florida, outperforming comparator models. Multitask pretraining significantly improves model performance. Attribution analysis shows substance use, psychiatric, and injury diagnoses, in the context of age and racial demographics, as influential predictors of NAT. As a clinical decision support system, PABLO can identify high-risk patients and patient-specific risk factors, which can be used to target secondary screening and preventive interventions at the point-of-care.

Published in npj Digital Medicine

ISSN: 2398-6352 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics
Website: https://www.nature.com/npjdigitalmed/

About the journal