IEEE Access (Jan 2021)

Domain Specific Entity Recognition With Semantic-Based Deep Learning Approach

  • Quoc Hung Ngo,
  • Tahar Kechadi,
  • Nhien-An Le-Khac

DOI
https://doi.org/10.1109/ACCESS.2021.3128178
Journal volume & issue
Vol. 9
pp. 152892 – 152902

Abstract

Read online

In digital agriculture, agronomists are required to make timely, profitable and more actionable precise decisions based on knowledge and experience. The input can be cultivated and related agricultural data, and one of them is text data, including news articles, business news, policy documents, or farming notes. To process this kind of data, identifying agricultural entities in the text is necessary to update news with agricultural orientation. This task is called Agriculture Entity Recognition (AGER - a kind of Named Entity Recognition task, NER, in the agriculture domain). However, there are very few approaches on AGER because of a lack of the consistent tagset and resources. In this study, we developed a new tagset for AGER to cover popular concepts in agriculture and we also propose a process for this task that consists of two stages: in the first stage, we use semantic-based approaches for detecting agricultural entities and semi-automatically build an annotated corpus of agricultural entities, while in the second stage, we identify the agricultural entities from the plain text using a deep learning approach, train on the annotated corpus. For the evaluation and validation, we build an annotated agriculture corpus and demonstrated the efficiency and robustness of our approach.

Keywords