Bidirectional long short-term memory with CRF for detecting biomedical event trigger in FastText semantic space

Yan Wang; Jian Wang; Hongfei Lin; Xiwei Tang; Shaowu Zhang; Lishuang Li

doi:10.1186/s12859-018-2543-1

BMC Bioinformatics (Dec 2018)

Bidirectional long short-term memory with CRF for detecting biomedical event trigger in FastText semantic space

Yan Wang,
Jian Wang,
Hongfei Lin,
Xiwei Tang,
Shaowu Zhang,
Lishuang Li

Affiliations

Yan Wang: School of Computer Science and Technology, Dalian University of Technology
Jian Wang: School of Computer Science and Technology, Dalian University of Technology
Hongfei Lin: School of Computer Science and Technology, Dalian University of Technology
Xiwei Tang: School of Information Science and Engineering, Hunan First Normal University
Shaowu Zhang: School of Computer Science and Technology, Dalian University of Technology
Lishuang Li: School of Computer Science and Technology, Dalian University of Technology

DOI: https://doi.org/10.1186/s12859-018-2543-1
Journal volume & issue: Vol. 19, no. S20
pp. 59 – 66

Abstract

Read online

Abstract Background In biomedical information extraction, event extraction plays a crucial role. Biological events are used to describe the dynamic effects or relationships between biological entities such as proteins and genes. Event extraction is generally divided into trigger detection and argument recognition. The performance of trigger detection directly affects the results of the event extraction. In general, the traditional method is used to address the trigger detection as a classification task, as well as the use of machine learning or rules method, which construct many features to improve the classification results. Moreover, the classification model only recognizes triggers composed of single words, whereas for multiple words, the result is unsatisfactory. Results The corpus of our model is MLEE. If we were to only use the biomedical LSTM and CRF model without other features, the F-score would reach about 78.08%. Comparing entity to part of speech (POS), we find the entity features more conducive to the improvement of performance of detection, with the F-score potentially reaching about 80%. Furthermore, we also experiment on the other three corpora (BioNLP 2009, BioNLP 2011, and BioNLP 2013) to verify the generalization of our model. Hence, F-scores can reach more than 60%, which are better than the comparative experiments. Conclusions The trigger recognition method based on the sequence annotation model does not require initial complex feature engineering, and only requires a simple labeling mechanism to complete the training. Therefore, generalization of our model is better compared to other traditional models. Secondly, this method can identify multi-word triggers, thereby improving the F-scores of trigger recognition. Thirdly, details on the entity have a crucial impact on trigger detection. Finally, the combination of character-level word embedding and word-level word embedding provides increasingly effective information for the model; therefore, it is a key to the success of the experiment.

Published in BMC Bioinformatics

ISSN: 1471-2105 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics; Science: Biology (General)
Website: http://www.biomedcentral.com/bmcbioinformatics/

About the journal

Abstract

Keywords