Applied Sciences (Sep 2020)

Unsupervised Predominant Sense Detection and Its Application to Text Classification

  • Attaporn Wangpoonsarp,
  • Kazuya Shimura,
  • Fumiyo Fukumoto

DOI
https://doi.org/10.3390/app10176052
Journal volume & issue
Vol. 10, no. 17
p. 6052

Abstract

Read online

This paper focuses on the domain-specific senses of words and proposes a method for detecting predominant sense depending on each domain. Our Domain-Specific Senses (DSS) model is an unsupervised manner and detects predominant senses in each domain. We apply a simple Markov Random Walk (MRW) model to ranking senses for each domain. It decides the importance of a sense within a graph by using the similarity of senses. The similarity of senses is obtained by using distributional representations of words from gloss texts in the thesaurus. It can capture large semantic context and thus does not require manual annotation of sense-tagged data. We used the Reuters corpus and the WordNet in the experiments. We applied the results of domain-specific senses to text classification and examined how DSS affects the overall performance of the text classification task. We compared our DSS model with one of the word sense disambiguation techniques (WSD), Context2vec, and the results demonstrate our domain-specific sense approach gains 0.053 F1 improvement on average over the WSD approach.

Keywords