Deep Bayesian Gaussian processes for uncertainty estimation in electronic health records

Yikuan Li; Shishir Rao; Abdelaali Hassaine; Rema Ramakrishnan; Dexter Canoy; Gholamreza Salimi-Khorshidi; Mohammad Mamouei; Thomas Lukasiewicz; Kazem Rahimi

doi:10.1038/s41598-021-00144-6

Scientific Reports (Oct 2021)

Deep Bayesian Gaussian processes for uncertainty estimation in electronic health records

Yikuan Li,
Shishir Rao,
Abdelaali Hassaine,
Rema Ramakrishnan,
Dexter Canoy,
Gholamreza Salimi-Khorshidi,
Mohammad Mamouei,
Thomas Lukasiewicz,
Kazem Rahimi

Affiliations

Yikuan Li: Deep Medicine, Oxford Martin School, University of Oxford
Shishir Rao: Deep Medicine, Oxford Martin School, University of Oxford
Abdelaali Hassaine: Deep Medicine, Oxford Martin School, University of Oxford
Rema Ramakrishnan: Deep Medicine, Oxford Martin School, University of Oxford
Dexter Canoy: Deep Medicine, Oxford Martin School, University of Oxford
Gholamreza Salimi-Khorshidi: Deep Medicine, Oxford Martin School, University of Oxford
Mohammad Mamouei: Deep Medicine, Oxford Martin School, University of Oxford
Thomas Lukasiewicz: Department of Computer Science, University of Oxford
Kazem Rahimi: Deep Medicine, Oxford Martin School, University of Oxford

DOI: https://doi.org/10.1038/s41598-021-00144-6
Journal volume & issue: Vol. 11, no. 1
pp. 1 – 13

Abstract

Read online

Abstract One major impediment to the wider use of deep learning for clinical decision making is the difficulty of assigning a level of confidence to model predictions. Currently, deep Bayesian neural networks and sparse Gaussian processes are the main two scalable uncertainty estimation methods. However, deep Bayesian neural networks suffer from lack of expressiveness, and more expressive models such as deep kernel learning, which is an extension of sparse Gaussian process, captures only the uncertainty from the higher-level latent space. Therefore, the deep learning model under it lacks interpretability and ignores uncertainty from the raw data. In this paper, we merge features of the deep Bayesian learning framework with deep kernel learning to leverage the strengths of both methods for a more comprehensive uncertainty estimation. Through a series of experiments on predicting the first incidence of heart failure, diabetes and depression applied to large-scale electronic medical records, we demonstrate that our method is better at capturing uncertainty than both Gaussian processes and deep Bayesian neural networks in terms of indicating data insufficiency and identifying misclassifications, with a comparable generalization performance. Furthermore, by assessing the accuracy and area under the receiver operating characteristic curve over the predictive probability, we show that our method is less susceptible to making overconfident predictions, especially for the minority class in imbalanced datasets. Finally, we demonstrate how uncertainty information derived by the model can inform risk factor analysis towards model interpretability.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal