IEEE Access (Jan 2024)

Aspect Extraction in Domain Lexicon Generation: A New Frequency-Based Approach

  • Tasnim M. A. Zayet,
  • Maizatul Akmar Ismail,
  • Kasturi Dewi Varathan

DOI
https://doi.org/10.1109/ACCESS.2024.3442930
Journal volume & issue
Vol. 12
pp. 138972 – 138984

Abstract

Read online

Domain sentimental lexicon building become an attractive field in recent years. This is due to the increased number of users’ generated data through the internet besides the different sentiments of opinion words in different contexts. Domain lexicons mainly consist of opinion pairs and their associated sentiment. Any opinion pair is formed by a domain word and one of its associated opinion words. Therefore, to generate a domain lexicon from a domain corpus, domain word extraction is needed with their associated opinion words. One of the traditional approaches is frequency-based approaches. However, the ambiguity problem is a big concern of these approaches. This paper introduced a frequency-based equation that considers the context of the words for domain word extraction. The equation was tested on five Amazon reviews datasets and it proved its efficiency over other used frequency-based equations in terms of recall and precision. Therefore, more related lexicons to the domains were generated.

Keywords