Communications in Science and Technology (Jul 2020)

Comparison of text-image fusion models for high school diploma certificate classification

  • Chandra Ramadhan Atmaja Perdana,
  • Hanung Adi Nugroho,
  • Igi Ardiyanto

DOI
https://doi.org/10.21924/cst.5.1.2020.172
Journal volume & issue
Vol. 5, no. 1
pp. 5 – 9

Abstract

Read online

File scanned documents are commonly used in this digital era. Text and image extraction of scanned documents play an important role in acquiring information. A document may contain both texts and images. A combination of text-image classification has been previously investigated. The dataset used for those research works the text were digitally provided. In this research, we used a dataset of high school diploma certificate, which the text must be acquired using optical character recognition (OCR) method. There were two categories for this high school diploma certificate, each category has three classes. We used convolutional neural network for both text and image classifications. We then combined those two models by using adaptive fusion model and weight fusion model to find the best fusion model. We come into conclusion that the performance of weight fusion model which is 0.927 is better than that of adaptive fusion model with 0.892.

Keywords