Mathematical Biosciences and Engineering (Sep 2022)

Disease prediction based on multi-type data fusion from Chinese electronic health record

  • Zhaoyu Liang,
  • Zhichang Zhang ,
  • Haoyuan Chen ,
  • Ziqin Zhang

DOI
https://doi.org/10.3934/mbe.2022640
Journal volume & issue
Vol. 19, no. 12
pp. 13732 – 13746

Abstract

Read online

Disease prediction by using a variety of healthcare data to assist doctors in disease diagnosis is becoming a more and more important research topic recently. This paper proposes a disease prediction model that fuses multiple types of encoded representations of Chinese electronic health records (EHRs). The model framework utilizes a multi-head self-attention mechanism, which combines textual and numerical features to enhance text representations. The BiLSTM-CRF and TextCNN models are used, respectively, to extract entities and then obtain the embedding representations of them. The representations of text and entities in it are combined together for formulating representations of EHRs. The experimental results on EHRs data collected from a Three Grade Class B Hospital General in Gansu Province, China, show that our model achieved an F1 score of 91.92%, which outperforms the previous baseline methods.

Keywords