Extreme Multi-Label ICD Classification: Sensitivity to Hospital Service and Time

Alberto Blanco; Alicia Perez; Arantza Casillas

doi:10.1109/ACCESS.2020.3029429

IEEE Access (Jan 2020)

Extreme Multi-Label ICD Classification: Sensitivity to Hospital Service and Time

Alberto Blanco,
Alicia Perez,
Arantza Casillas

Affiliations

Alberto Blanco: ORCiD; HiTZ Center-Ixa, University of the Basque Country (UPV/EHU), Donostia, Spain
Alicia Perez: ORCiD; HiTZ Center-Ixa, University of the Basque Country (UPV/EHU), Donostia, Spain
Arantza Casillas: HiTZ Center-Ixa, University of the Basque Country (UPV/EHU), Donostia, Spain

DOI: https://doi.org/10.1109/ACCESS.2020.3029429
Journal volume & issue: Vol. 8
pp. 183534 – 183545

Abstract

Read online

This work deals with clinical text mining for automatic classification of Electronic Health Records (EHRs) with respect to the International Classification of Diseases (ICD). ICD is the international standard for the identification of diseases and health conditions in EHRs and the foundation for reporting health statistics. Machine learning-based techniques have proven robust to infer classification models from EHRs. Since each EHR tends to involve multiple diseases, multi-label classification is required. The concern in this work is the versatility of the models inferred and their ability to generalise in two ways: as time goes ahead and across hospital services or health specialties. Indeed, in this work, we show the capabilities of a Bidirectional Recurrent Neural Network (RNN) with GRU units and ELMo embeddings on two corpora (a corpus comprising a set of EHRs within the Basque Health System, namely Osakidetza, and the well-known MIMIC-III corpus). To delve into and assess the versatility of the models, we focus on their resilience across hospital admissions taken over two different years and also across six distinct hospital services. In addition, we paid attention to the classification performance to estimate ICD codes of different granularity (e.g. with or without essential modifiers). Our best results are 39.55% and 47.28% F-Score for the Osakidetza and MIMIC-III datasets respectively, with the original main label-sets. Regarding the models evaluated per specialty, the most remarkable results are 57.00% and 72.74% F-Score, in the Cardiology and Nephrology medical services respectively.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords