Journal of Applied Mathematics (Jan 2021)

Document Plagiarism Detection Using a New Concept Similarity in Formal Concept Analysis

  • Jirapond Muangprathub,
  • Siriwan Kajornkasirat,
  • Apirat Wanichsombat

DOI
https://doi.org/10.1155/2021/6662984
Journal volume & issue
Vol. 2021

Abstract

Read online

This paper proposes an algorithm for document plagiarism detection using the provided incremental knowledge construction with formal concept analysis (FCA). The incremental knowledge construction is presented to support document matching between the source document in storage and the suspect document. Thus, a new concept similarity measure is also proposed for retrieving formal concepts in the knowledge construction. The presented concept similarity employs appearance frequencies in the obtained knowledge construction. Our approach can be applied to retrieve relevant information because the obtained structure uses FCA in concept form that is definable by a conjunction of properties. This measure is mathematically proven to be a formal similarity metric. The performance of the proposed similarity measure is demonstrated in document plagiarism detection. Moreover, this paper provides an algorithm to build the information structure for document plagiarism detection. Thai text test collections are used for performance evaluation of the implemented web application.