IEEE Access (Jan 2020)

A Survey of Named-Entity Recognition Methods for Food Information Extraction

  • Gorjan Popovski,
  • Barbara Korousic Seljak,
  • Tome Eftimov

DOI
https://doi.org/10.1109/ACCESS.2020.2973502
Journal volume & issue
Vol. 8
pp. 31586 – 31594

Abstract

Read online

As great amounts of food-related information is presented in the form of heterogeneous textual data, computer-based methods are useful to automatically extract such information. One way to do this is to utilize Named-Entity Recognition (NER) methods that are broadly used in computer science for information extraction. Despite the existence of numerous and well-versed NER methods in the biomedical domain, the domain of food science still remains scarcely resourced. In this paper, we provide an overview and a comparison of named-entity recognition methods in the food domain, which can be used for automated extraction of food information from text. Four methods are discussed: FoodIE, NCBO (SNOMED CT), NCBO (OntoFood), and NCBO (FoodON). We compare them using a benchmark data set that consists of 1000 manually annotated recipes initially obtained from Allrecipes, which is the largest social network focused on food. After analysing the results from the evaluation, it is evident that FoodIE obtains very promising results compared to the other food named-entity recognition methods taken into consideration.

Keywords