IEEE Access (Jan 2019)

Joint Entity and Relation Extraction Based on Reinforcement Learning

  • Xin Zhou,
  • Luping Liu,
  • Xiaodong Luo,
  • Haiqiang Chen,
  • Linbo Qing,
  • Xiaohai He

DOI
https://doi.org/10.1109/ACCESS.2019.2938986
Journal volume & issue
Vol. 7
pp. 125688 – 125699

Abstract

Read online

Information extraction is a crucial technology to construct a knowledge base. In this paper, a novel model was proposed to extract entities and relations from plain text. This model consists of two components: a joint network and a reinforcement learning agent. The joint network is designed end-to-end, which can extract entities and relations simultaneously. In the joint model, a new tagging scheme was adopted, then the entity and relation extraction can be modeled as a joint sequence tagging problem. To enhance the robustness of the model, we also introduced a reinforcement learning (RL) agent to remove the noisy data from the training dataset. The RL agent aims at determining whether a candidate instance should be removed from the training dataset or not. When the agent completes a selection process, the training dataset will be divided into two parts: clean data and noisy data. Then the joint network can be trained again on the clean dataset to generate a better model. To assess the validity of the model we proposed, extensive experiments were conducted on the New York Times dataset (NYT10 and NYT 11). The experimental results showed that the model we proposed is superior compared with the baselines, achieving the F1 value on NYT10 and NYT11 with 0.612 and 0.549, respectively.

Keywords