Proceedings of the XXth Conference of Open Innovations Association FRUCT (May 2021)

Incoherent Sentence Detection in Scientific Articles in Russian and English

  • Mark Zaslavskiy,
  • Quang Huy Nguyen

DOI
https://doi.org/10.23919/FRUCT52173.2021.9435478
Journal volume & issue
Vol. 29, no. 1
pp. 267 – 273

Abstract

Read online

Text coherence is an important factor that often gets overlooked by novice writers. Incoherence in academic writing directly affects both the reading experience and the comprehensibility of the articles. This paper introduces and describes a method for detecting incoherence in academic writing. The method utilized a fine-tuned BERT model in conjunction with graph clustering algorithm. We benchmarked the method against baseline models on Discordant Sentence Detection using Timetravel dataset, and the results showed that the proposed method outperformed baseline models in terms of F1-score. Afterward, the method was tested on a corpora of Russian and English scientific articles in order to assess its proficiency in Narrative Incoherence Detection when applied on the papers main research subject: academic writing. The papers proposed method achieved a decent F1 in Discordant Sentence Detection. For future work, our biggest goal is to further refine the method and be able to effectively deploy it on existing systems used for reviewing academic corpora.

Keywords