Context Embedding Based on Bi-LSTM in Semi-Supervised Biomedical Word Sense Disambiguation

Zhi Li; Fan Yang; Yaoru Luo

doi:10.1109/ACCESS.2019.2912584

IEEE Access (Jan 2019)

Context Embedding Based on Bi-LSTM in Semi-Supervised Biomedical Word Sense Disambiguation

Zhi Li,
Fan Yang,
Yaoru Luo

Affiliations

Zhi Li: College of Electronics and Information Engineering, University of Sichuan, Chengdu, China
Fan Yang: Department of Gynecology and Obstetrics, Key Laboratory of Obstetric and Gynecologic and Pediatric Diseases and Birth Defects, Ministry of Education, West China Second Hospital, University of Sichuan, Chengdu, China
Yaoru Luo: ORCiD; College of Electronics and Information Engineering, University of Sichuan, Chengdu, China

DOI: https://doi.org/10.1109/ACCESS.2019.2912584
Journal volume & issue: Vol. 7
pp. 72928 – 72935

Abstract

Read online

Word sense disambiguation (WSD) is a basic task of natural language processing (NLP) and its purpose to choose the correct sense of an ambiguous word according to its context. In biomedical WSD, recent research has used context embeddings built by concatenating or averaging word embeddings to represent the sense of a context. These simple linear operations on neighbor words ignore the information about the sequence and may cause their models to be flawed in semantic representation. In this paper, we present a novel language model based on Bi-LSTM to embed an entire sentential context in continuous space by taking account of word order. We demonstrate that our language model can generate high-quality context representations in an unsupervised manner. Unlike the previous work that directly predicts the word senses, our model classifies a word in a context by building sense embeddings and this helps us set a new state-of-the-art result (macro/micro average) on both MSH and NLM datasets. In addition, with the same language model, we propose semi-supervised learning based on label propagation (LP) to reduce the dependence on biomedical data. The results show that this method can nearly approach the state-of-the-art results produced by our Bi-LSTM when reducing the labeled training data.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords