Study on Entity Extraction Method for Pharmaceutical Instructions Based on Pretrained Models

CHEN Zhongyong, HUANG Yongsheng, ZHANG Min, JIANG Ming

doi:10.3778/j.issn.1673-9418.2304078

Jisuanji kexue yu tansuo (Jul 2024)

Study on Entity Extraction Method for Pharmaceutical Instructions Based on Pretrained Models

CHEN Zhongyong, HUANG Yongsheng, ZHANG Min, JIANG Ming

Affiliations

CHEN Zhongyong, HUANG Yongsheng, ZHANG Min, JIANG Ming: 1. Zhejiang Pharmaceutical Information Publicity and Development Service Center, Hangzhou 310061, China 2. School of Computer Science, Hangzhou Dianzi University, Hangzhou 310018, China

DOI: https://doi.org/10.3778/j.issn.1673-9418.2304078
Journal volume & issue: Vol. 18, no. 7
pp. 1911 – 1922

Abstract

Read online

The extraction of medical entities from drug instructions provides fundamental data for the intelligent retrieval of medication information and the construction of medical knowledge graphs, with remarkable research significance and practical value. However, the heterogeneity of medical entities in drug instructions for treating different diseases poses challenges in model training, which requires a large number of annotated samples. To address this issue, a “large model + small model” design approach is used in this research. Specifically, this research proposes a part-label named entity recognition model based on a pre-trained model, which first employs a pre-trained language model fine-tuned on a small number of samples to extract partial entities from drug instructions, and then utilizes a Transformer- based part-label model to further optimize the entity extraction results. The part-label model encodes the input text, identified partial entities, and entity labels using a planar lattice structure, extracts feature representations using Transformer, and predicts entity labels through a conditional random fields (CRF) layer. To reduce the need for annotated training data, a sample data augmentation method is proposed using entity masking strategy on labeled samples to train the part-label model. Experimental results validate the feasibility of the “large model + small model” approach in medical entity extraction, with precision (P), recall (R), and F1 score of 85.0%, 86.1%, and 85.6%, respectively, demonstrating superior performance compared with other learning methods.

named entity recognition (ner); pre-trained models; medical entity extraction; transformer

Published in Jisuanji kexue yu tansuo

ISSN: 1673-9418 (Print)
Publisher: Journal of Computer Engineering and Applications Beijing Co., Ltd., Science Press
Country of publisher: China
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: http://fcst.ceaj.org

About the journal

Abstract

Keywords