Pamukkale University Journal of Engineering Sciences (Apr 2021)

Certainty factor model in paraphrase detection

  • Katira Soleymanzadeh,
  • Tarık Kışla,
  • Bahar Karaoğlan,
  • Senem Kumova Metin

Journal volume & issue
Vol. 27, no. 2
pp. 139 – 150

Abstract

Read online

In this paper, we address the problem of uncertainty management in identification of paraphrase sentence pairs. Paraphrase sentences are simply sets/pairs of sentences that express the same facts and/or opinions using different words or order of words. We propose the use of certainty factor (CF) model in paraphrase detection. A set of succeeding paraphrase detection features (generic and distance based features) is built by filtering and this set is used as evidences in CF model. The CF model is evaluated by F1 and accuracy measures on Microsoft Research Paraphrase corpus. The results are compared to the well-known Bayesian reasoning. The experimental results showed that CF model is an alternating paraphrase detection method to Bayes model.

Keywords