Development and evaluation of uncertainty quantifying machine learning models to predict piperacillin plasma concentrations in critically ill patients

Jarne Verhaeghe; Sofie A. M. Dhaese; Thomas De Corte; David Vander Mijnsbrugge; Heleen Aardema; Jan G. Zijlstra; Alain G. Verstraete; Veronique Stove; Pieter Colin; Femke Ongenae; Jan J. De Waele; Sofie Van Hoecke

doi:10.1186/s12911-022-01970-y

BMC Medical Informatics and Decision Making (Aug 2022)

Development and evaluation of uncertainty quantifying machine learning models to predict piperacillin plasma concentrations in critically ill patients

Jarne Verhaeghe,
Sofie A. M. Dhaese,
Thomas De Corte,
David Vander Mijnsbrugge,
Heleen Aardema,
Jan G. Zijlstra,
Alain G. Verstraete,
Veronique Stove,
Pieter Colin,
Femke Ongenae,
Jan J. De Waele,
Sofie Van Hoecke

Affiliations

Jarne Verhaeghe: IDLab, Department of Information Technology, Ghent University - imec
Sofie A. M. Dhaese: Department of Internal Medicine and Pediatrics, Ghent University
Thomas De Corte: Department of Internal Medicine and Pediatrics, Ghent University
David Vander Mijnsbrugge: IDLab, Department of Information Technology, Ghent University - imec
Heleen Aardema: Department of Critical Care, University Medical Center Groningen
Jan G. Zijlstra: Department of Critical Care, University Medical Center Groningen
Alain G. Verstraete: Department of Diagnostic Sciences, Ghent University
Veronique Stove: Department of Diagnostic Sciences, Ghent University
Pieter Colin: Department of Anesthesiology, University Medical Center Groningen
Femke Ongenae: IDLab, Department of Information Technology, Ghent University - imec
Jan J. De Waele: Department of Critical Care Medicine, Ghent University Hospital
Sofie Van Hoecke: IDLab, Department of Information Technology, Ghent University - imec

DOI: https://doi.org/10.1186/s12911-022-01970-y
Journal volume & issue: Vol. 22, no. 1
pp. 1 – 17

Abstract

Read online

Abstract Background Beta-lactam antimicrobial concentrations are frequently suboptimal in critically ill patients. Population pharmacokinetic (PopPK) modeling is the golden standard to predict drug concentrations. However, currently available PopPK models often lack predictive accuracy, making them less suited to guide dosing regimen adaptations. Furthermore, many currently developed models for clinical applications often lack uncertainty quantification. We, therefore, aimed to develop machine learning (ML) models for the prediction of piperacillin plasma concentrations while also providing uncertainty quantification with the aim of clinical practice. Methods Blood samples for piperacillin analysis were prospectively collected from critically ill patients receiving continuous infusion of piperacillin/tazobactam. Interpretable ML models for the prediction of piperacillin concentrations were designed using CatBoost and Gaussian processes. Distribution-based Uncertainty Quantification was added to the CatBoost model using a proposed Quantile Ensemble method, useable for any model optimizing a quantile function. These models are subsequently evaluated using the distribution coverage error, a proposed interpretable uncertainty quantification calibration metric. Development and internal evaluation of the ML models were performed on the Ghent University Hospital database (752 piperacillin concentrations from 282 patients). Ensuing, ML models were compared with a published PopPK model on a database from the University Medical Centre of Groningen where a different dosing regimen is used (46 piperacillin concentrations from 15 patients.). Results The best performing model was the Catboost model with an RMSE and $$R^2$$ R 2 of 31.94–0.64 and 33.53–0.60 for internal evaluation with and without previous concentration. Furthermore, the results prove the added value of the proposed Quantile Ensemble model in providing clinically useful individualized uncertainty predictions and show the limits of homoscedastic methods like Gaussian Processes in clinical applications. Conclusions Our results show that ML models can consistently estimate piperacillin concentrations with acceptable and high predictive accuracy when identical dosing regimens as in the training data are used while providing highly relevant uncertainty predictions. However, generalization capabilities to other dosing schemes are limited. Notwithstanding, incorporating ML models in therapeutic drug monitoring programs seems definitely promising and the current work provides a basis for validating the model in clinical practice.

Published in BMC Medical Informatics and Decision Making

ISSN: 1472-6947 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics
Website: http://bmcmedinformdecismak.biomedcentral.com

About the journal

Abstract

Keywords