Predicting postoperative delirium assessed by the Nursing Screening Delirium Scale in the recovery room for non-cardiac surgeries without craniotomy: A retrospective study using a machine learning approach.

Niklas Giesa; Stefan Haufe; Mario Menk; Björn Weiß; Claudia D Spies; Sophie K Piper; Felix Balzer; Sebastian D Boie

doi:10.1371/journal.pdig.0000414

PLOS Digital Health (Aug 2024)

Predicting postoperative delirium assessed by the Nursing Screening Delirium Scale in the recovery room for non-cardiac surgeries without craniotomy: A retrospective study using a machine learning approach.

Niklas Giesa,
Stefan Haufe,
Mario Menk,
Björn Weiß,
Claudia D Spies,
Sophie K Piper,
Felix Balzer,
Sebastian D Boie

Affiliations

Niklas Giesa
Stefan Haufe
Mario Menk
Björn Weiß
Claudia D Spies
Sophie K Piper
Felix Balzer
Sebastian D Boie

DOI: https://doi.org/10.1371/journal.pdig.0000414
Journal volume & issue: Vol. 3, no. 8
p. e0000414

Abstract

Read online

Postoperative delirium (POD) contributes to severe outcomes such as death or development of dementia. Thus, it is desirable to identify vulnerable patients in advance during the perioperative phase. Previous studies mainly investigated risk factors for delirium during hospitalization and further used a linear logistic regression (LR) approach with time-invariant data. Studies have not investigated patients' fluctuating conditions to support POD precautions. In this single-center study, we aimed to predict POD in a recovery room setting with a non-linear machine learning (ML) technique using pre-, intra-, and postoperative data. The target variable POD was defined with the Nursing Screening Delirium Scale (Nu-DESC) ≥ 1. Feature selection was conducted based on robust univariate test statistics and L1 regularization. Non-linear multi-layer perceptron (MLP) as well as tree-based models were trained and evaluated-with the receiver operating characteristics curve (AUROC), the area under precision recall curve (AUPRC), and additional metrics-against LR and published models on bootstrapped testing data. The prevalence of POD was 8.2% in a sample of 73,181 surgeries performed between 2017 and 2020. Significant univariate impact factors were the preoperative ASA status (American Society of Anesthesiologists physical status classification system), the intraoperative amount of given remifentanil, and the postoperative Aldrete score. The best model used pre-, intra-, and postoperative data. The non-linear boosted trees model achieved a mean AUROC of 0.854 and a mean AUPRC of 0.418 outperforming linear LR, well as best applied and retrained baseline models. Overall, non-linear machine learning models using data from multiple perioperative time phases were superior to traditional ones in predicting POD in the recovery room. Class imbalance was seen as a main impediment for model application in clinical practice.

Published in PLOS Digital Health

ISSN: 2767-3170 (Online)
Publisher: Public Library of Science (PLoS)
Country of publisher: United States
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics
Website: https://journals.plos.org/digitalhealth/

About the journal