Telfor Journal (Jun 2014)

Evaluation and Classification of Syntax Usage in Determining Short-Text Semantic Similarity

  • V. Batanović,
  • D. Bojić

Journal volume & issue
Vol. 6, no. 1
pp. 64 – 68

Abstract

Read online

This paper outlines and categorizes ways of using syntactic information in a number of algorithms for determining the semantic similarity of short texts. We consider the use of word order information, part-of-speech tagging, parsing and semantic role labeling. We analyze and evaluate the effects of syntax usage on algorithm performance by utilizing the results of a paraphrase detection test on the Microsoft Research Paraphrase Corpus. We also propose a new classification of algorithms based on their applicability to languages with scarce natural language processing tools.

Keywords