Heliyon (Feb 2024)
A two-stream deep model for automated ICD-9 code prediction in an intensive care unit
Abstract
Assigning medical codes for patients is essential for healthcare organizations, not only for billing purposes but also for maintaining accurate records of patients' medical histories and analyzing the outputs of certain procedures. Due to the abundance of disease codes, it can be laborious and time-consuming for medical specialists to manually assign these codes to each procedure. To address this problem, we discuss the automatic prediction of ICD-9 codes, the most popular and widely accepted system of medical coding. We introduce a two-stream deep learning framework specifically designed to analyze multi-modal data. This framework is applied to the extensive and publicly available MIMIC-III dataset, enabling us to leverage both numerical and text-based data for improved ICD-9 code prediction.Our system uses text representation models to understand the text-based medical records; the Gated Recurrent Unit (GRU) to model the numerical health records; and fuses these two streams to automatically predict the ICD-9 codes used in the intensive care unit. We discuss the preprocessing and classification methods and demonstrate that our proposed two-stream model outperforms other state-of-the-art studies in the literature.