Data governance and Gensini score automatic calculation for coronary angiography with deep-learning-based natural language extraction

Feng Li; Mingfeng Jiang; Hongzeng Xu; Yi Chen; Feng Chen; Wei Nie; Li Wang

doi:10.3934/mbe.2024180

Mathematical Biosciences and Engineering (Feb 2024)

Data governance and Gensini score automatic calculation for coronary angiography with deep-learning-based natural language extraction

Feng Li,
Mingfeng Jiang,
Hongzeng Xu,
Yi Chen ,
Feng Chen,
Wei Nie,
Li Wang

Affiliations

Feng Li: 1. School of Information and Electronic Engineering, Zhejiang Gongshang University, Hangzhou 310018, China
Mingfeng Jiang: 1. School of Information and Electronic Engineering, Zhejiang Gongshang University, Hangzhou 310018, China
Hongzeng Xu: 2. Department of Cardiology, The People's Hospital of Liaoning Province, Liaoning, Shenyang 110011, China
Yi Chen: 1. School of Information and Electronic Engineering, Zhejiang Gongshang University, Hangzhou 310018, China
Feng Chen: 1. School of Information and Electronic Engineering, Zhejiang Gongshang University, Hangzhou 310018, China
Wei Nie: 1. School of Information and Electronic Engineering, Zhejiang Gongshang University, Hangzhou 310018, China
Li Wang: 3. College of Marine Electrical Engineering, Dalian Maritime University, Dalian 116026, China

DOI: https://doi.org/10.3934/mbe.2024180
Journal volume & issue: Vol. 21, no. 3
pp. 4085 – 4103

Abstract

Read online

With the widespread adoption of electronic health records, the amount of stored medical data has been increasing. Clinical data, often in the form of semi-structured or unstructured electronic medical records (EMRs), contains rich patient information. However, due to the use of natural language by physicians when composing these records, the effectiveness of traditional methods such as dictionaries, rule matching, and machine learning in the extraction of information from these unstructured texts falls short of clinical standards. In this paper, a novel deep-learning-based natural language extraction method is proposed to overcome current shortcomings in data governance and Gensini score automatic calculation in coronary angiography. A pre-trained model called bidirectional encoder representation from transformers (BERT) with strong text feature representation capabilities is employed as the feature representation layer. It is combined with bidirectional long short-term memory (BiLSTM) and conditional random field (CRF) models to extract both global and local features from the text. The study included an evaluation of the model on a dataset from a hospital in China and it was compared with another model to validate its practical advantages. Hence, the BiLSTM-CRF model was employed to automatically extract relevant coronary angiogram information from EMR texts. The achieved F1 score was 91.19, which is approximately 0.87 higher than the BERT-BiLSTM-CRF model.

Published in Mathematical Biosciences and Engineering

ISSN: 1551-0018 (Online)
Publisher: AIMS Press
Country of publisher: United States
LCC subjects: Technology: Chemical technology: Biotechnology; Science: Mathematics
Website: https://www.aimspress.com/journal/MBE

About the journal

Abstract

Keywords