Effective Natural Language Processing and Interpretable Machine Learning for Structuring CT Liver-Tumor Reports

Yi-Hsuan Chuang; Ja-Hwung Su; Ding-Hong Han; Yi-Wen Liao; Yeong-Chyi Lee; Yu-Fan Cheng; Tzung-Pei Hong; Katherine Shu-Min Li; Hsin-You Ou; Yi Lu; Chih-Chi Wang

doi:10.1109/ACCESS.2022.3218646

IEEE Access (Jan 2022)

Effective Natural Language Processing and Interpretable Machine Learning for Structuring CT Liver-Tumor Reports

Yi-Hsuan Chuang,
Ja-Hwung Su,
Ding-Hong Han,
Yi-Wen Liao,
Yeong-Chyi Lee,
Yu-Fan Cheng,
Tzung-Pei Hong,
Katherine Shu-Min Li,
Hsin-You Ou,
Yi Lu,
Chih-Chi Wang

Affiliations

Yi-Hsuan Chuang: Liver Transplantation Program, Department of Diagnostic Radiology and Surgery, Kaohsiung Chang Gung Memorial Hospital, Chang Gung University College of Medicine, Niao-Sung, Kaohsiung, Taiwan
Ja-Hwung Su: ORCiD; Department of Computer Science and Information Engineering, National University of Kaohsiung, Kaohsiung, Taiwan
Ding-Hong Han: ORCiD; Department of Computer Science and Engineering, National Sun Yat-sen University, Kaohsiung, Taiwan
Yi-Wen Liao: Department of Intelligent Commerce, National Kaohsiung University of Science and Technology, Kaohsiung, Taiwan
Yeong-Chyi Lee: Department of Information Management, Cheng Shiu University, Kaohsiung, Taiwan
Yu-Fan Cheng: Liver Transplantation Program, Department of Diagnostic Radiology and Surgery, Kaohsiung Chang Gung Memorial Hospital, Chang Gung University College of Medicine, Niao-Sung, Kaohsiung, Taiwan
Tzung-Pei Hong: ORCiD; Department of Computer Science and Information Engineering, National University of Kaohsiung, Kaohsiung, Taiwan
Katherine Shu-Min Li: ORCiD; Department of Computer Science and Engineering, National Sun Yat-sen University, Kaohsiung, Taiwan
Hsin-You Ou: Liver Transplantation Program, Department of Diagnostic Radiology and Surgery, Kaohsiung Chang Gung Memorial Hospital, Chang Gung University College of Medicine, Niao-Sung, Kaohsiung, Taiwan
Yi Lu: ORCiD; Liver Transplantation Program, Department of Diagnostic Radiology and Surgery, Kaohsiung Chang Gung Memorial Hospital, Chang Gung University College of Medicine, Niao-Sung, Kaohsiung, Taiwan
Chih-Chi Wang: Liver Transplantation Center and Department of Surgery, Kaohsiung Chang Gung Memorial Hospital, Niao-Sung, Kaohsiung, Taiwan

DOI: https://doi.org/10.1109/ACCESS.2022.3218646
Journal volume & issue: Vol. 10
pp. 116273 – 116286

Abstract

Read online

In the past, the liver tumors were reported manually in an unstructured format. There actually exists much valuable knowledge in these reports for further disease risk assessment, disease recognition and treatment recommendation. Yet, it is not easy to read and mine knowledge from the unstructured reports. Hence, how to extract the knowledge from these biomedical reports effectively and efficiently has been a challenging issue in the past decades. Although a set of Natural Language Processing techniques were proposed for Bio-medical information retrieval, few related works were made on transforming the unstructured CT liver-tumor reports into structured ones. To aim at this issue, in this paper, we propose a two-stage report structuring method by integrating effective Natural Language Processing (NLP) and interpretable machine learning. For the first stage, the candidate keywords in unstructured reports are extracted. Next, the feature keywords are determined by the feature-selection technique. For the second stage, the well-known multi-classifiers are performed, and finally the reports are labeled in a refined structure format. Further, the factor keywords in the classification model are filtered to interpret the performance. In overall, the proposed report structuring method generates a hierarchical data structure, including the common features and refined features in the $1^{\mathrm {st}}$ and $2^{\mathrm {nd}}$ levels/stages, respectively. To reveal the performance of proposed method, a set of evaluations were conducted and the results show that, the proposed method is more promising than the fashion neural networks such as Bert (Bidirectional Encoder Representations from Transformers) in terms of effectiveness and efficiency.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords