Representation learning for clinical time series prediction tasks in electronic health records

Tong Ruan; Liqi Lei; Yangming Zhou; Jie Zhai; Le Zhang; Ping He; Ju Gao

doi:10.1186/s12911-019-0985-7

BMC Medical Informatics and Decision Making (Dec 2019)

Representation learning for clinical time series prediction tasks in electronic health records

Tong Ruan,
Liqi Lei,
Yangming Zhou,
Jie Zhai,
Le Zhang,
Ping He,
Ju Gao

Affiliations

Tong Ruan: School of Information Science and Engineering, East China University of Science and Technology
Liqi Lei: School of Information Science and Engineering, East China University of Science and Technology
Yangming Zhou: School of Information Science and Engineering, East China University of Science and Technology
Jie Zhai: School of Information Science and Engineering, East China University of Science and Technology
Le Zhang: School of Information Science and Engineering, East China University of Science and Technology
Ping He: Shanghai Hospital Development Center
Ju Gao: Shuguang Hospital Affiliated to Shanghai University of Traditional Chinese Medicine

DOI: https://doi.org/10.1186/s12911-019-0985-7
Journal volume & issue: Vol. 19, no. S8
pp. 1 – 14

Abstract

Read online

Abstract Background Electronic health records (EHRs) provide possibilities to improve patient care and facilitate clinical research. However, there are many challenges faced by the applications of EHRs, such as temporality, high dimensionality, sparseness, noise, random error and systematic bias. In particular, temporal information is difficult to effectively use by traditional machine learning methods while the sequential information of EHRs is very useful. Method In this paper, we propose a general-purpose patient representation learning approach to summarize sequential EHRs. Specifically, a recurrent neural network based denoising autoencoder (RNN-DAE) is employed to encode inhospital records of each patient into a low dimensional dense vector. Results Based on EHR data collected from Shuguang Hospital affiliated to Shanghai University of Traditional Chinese Medicine, we experimentally evaluate our proposed RNN-DAE method on both mortality prediction task and comorbidity prediction task. Extensive experimental results show that our proposed RNN-DAE method outperforms existing methods. In addition, we apply the “Deep Feature” represented by our proposed RNN-DAE method to track similar patients with t-SNE, which also achieves some interesting observations. Conclusion We propose an effective unsupervised RNN-DAE method to summarize patient sequential information in EHR data. Our proposed RNN-DAE method is useful on both mortality prediction task and comorbidity prediction task.

Published in BMC Medical Informatics and Decision Making

ISSN: 1472-6947 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics
Website: http://bmcmedinformdecismak.biomedcentral.com

About the journal

Abstract

Keywords