MATEC Web of Conferences (Jan 2018)

The implementation of web service based text preprocessing to measure Indonesian student thesis similarity level

  • Watequlis Syaifudin Yan,
  • Saputra Pramana Yoga,
  • Puspitasari Dwi

DOI
https://doi.org/10.1051/matecconf/201819703019
Journal volume & issue
Vol. 197
p. 03019

Abstract

Read online

The plagiarism of scientific work, especially undergraduate thesis, mostly happened in the college. In this research we used text mining, a new method which can be used to do the checking procedure, to obtain specific pattern of the document. After obtaining the document pattern, we compare the pattern with another document pattern. If the level of pattern similarity is high, it can be suspected as plagiarism. This paper will explain the development of the text preprocessing, a part of text mining. We choosed Nazief and Adriani Algorithm as a text preprocessing algorithm for this research. This research will result a text preprocessing web service. The web service is expected to be used for further development of text mining.