Transfer Learning for Named Entity Recognition in Financial and Biomedical Documents

Sumam Francis; Jordy Van Landeghem; Marie-Francine Moens

doi:10.3390/info10080248

Information (Jul 2019)

Transfer Learning for Named Entity Recognition in Financial and Biomedical Documents

Sumam Francis,
Jordy Van Landeghem,
Marie-Francine Moens

Affiliations

Sumam Francis: Department of Computer Science, Language Intelligence & Information Retrieval Lab (LIIR), 3000 KU Leuven, Belgium
Jordy Van Landeghem: Contract.fit, 1000 Brussels, Belgium
Marie-Francine Moens: Department of Computer Science, Language Intelligence & Information Retrieval Lab (LIIR), 3000 KU Leuven, Belgium

DOI: https://doi.org/10.3390/info10080248
Journal volume & issue: Vol. 10, no. 8
p. 248

Abstract

Read online

Recent deep learning approaches have shown promising results for named entity recognition (NER). A reasonable assumption for training robust deep learning models is that a sufficient amount of high-quality annotated training data is available. However, in many real-world scenarios, labeled training data is scarcely present. In this paper we consider two use cases: generic entity extraction from financial and from biomedical documents. First, we have developed a character based model for NER in financial documents and a word and character based model with attention for NER in biomedical documents. Further, we have analyzed how transfer learning addresses the problem of limited training data in a target domain. We demonstrate through experiments that NER models trained on labeled data from a source domain can be used as base models and then be fine-tuned with few labeled data for recognition of different named entity classes in a target domain. We also witness an interest in language models to improve NER as a way of coping with limited labeled data. The current most successful language model is BERT. Because of its success in state-of-the-art models we integrate representations based on BERT in our biomedical NER model along with word and character information. The results are compared with a state-of-the-art model applied on a benchmarking biomedical corpus.

Published in Information

ISSN: 2078-2489 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Technology (General): Industrial engineering. Management engineering: Information technology
Website: http://www.mdpi.com/journal/information/

About the journal

Abstract

Keywords