Proceedings of the XXth Conference of Open Innovations Association FRUCT (May 2023)

Web Tool for Automated Document Formatting Verification

  • Andrei Berezhkov,
  • Viacheslav Martsinkevich

DOI
https://doi.org/10.5281/zenodo.8005373
Journal volume & issue
Vol. 33, no. 2
pp. 375 – 381

Abstract

Read online

The article discusses the algorithms for document compilation check in compliance with standards and regulatory documents. The paper presents various approaches and methods of extraction of structural elements and their properties from PDF, ODT, DOCX documents; reveals machine-learning opportunities in terms of class extraction and further structural elements classification. The machine-learning methods are also used to provide recommendations on possible errors. The article introduces automated document formatting verification service architecture.

Keywords