End-To-End Evaluation of Deep Learning Architectures for Off-Line Handwriting Writer Identification: A Comparative Study

Wirmanto Suteddy; Devi Aprianti Rimadhani Agustini; Anugrah Adiwilaga; Dastin Aryo Atmanto

doi:10.30630/joiv.7.1.1293

JOIV: International Journal on Informatics Visualization (Feb 2023)

End-To-End Evaluation of Deep Learning Architectures for Off-Line Handwriting Writer Identification: A Comparative Study

Wirmanto Suteddy,
Devi Aprianti Rimadhani Agustini,
Anugrah Adiwilaga,
Dastin Aryo Atmanto

Affiliations

Wirmanto Suteddy: Universitas Pendidikan Indonesia, Bandung, 40625, Indonesia
Devi Aprianti Rimadhani Agustini: Universitas Pendidikan Indonesia, Bandung, 40625, Indonesia
Anugrah Adiwilaga: Universitas Pendidikan Indonesia, Bandung, 40625, Indonesia
Dastin Aryo Atmanto: Universitas Pendidikan Indonesia, Bandung, 40625, Indonesia

DOI: https://doi.org/10.30630/joiv.7.1.1293
Journal volume & issue: Vol. 7, no. 1
pp. 178 – 185

Abstract

Read online

Identifying writers using their handwriting is particularly challenging for a machine, given that a person’s writing can serve as their distinguishing characteristic. The process of identification using handcrafted features has shown promising results, but the intra-class variability between authors still needs further development. Almost all computer vision-related tasks use Deep learning (DL) nowadays, and as a result, researchers are developing many DL architectures with their respective methods. In addition, feature extraction, usually accomplished using handcrafted algorithms, can now be automatically conducted using convolutional neural networks. With the various developments of the DL method, it is necessary to evaluate the suitable DL for the problem we are aiming at, namely the classification of writer identification. This comparative study evaluated several DL architectures such as VGG16, ResNet50, MobileNet, Xception, and EfficientNet end-to-end to examine their advantages to offline handwriting for writer identification problems with IAM and CVL databases. Each architecture compared its respective process to the training and validation metrics accuracy, demonstrating that ResNet50 DL had the highest train accuracy of 98.86%. However, Xception DL performed slightly better due to the convergence gap for validation accuracy compared to all the other architectures, which were 21.79% and 15.12% for IAM and CVL. Also, the smallest gap of convergence between training and validation accuracy for the IAM and CVL datasets were 19.13% and 16.49%, respectively. The results of these findings serve as the basis for DL architecture selection and open up overfitting problems for future work.

Published in JOIV: International Journal on Informatics Visualization

ISSN: 2549-9610 (Print); 2549-9904 (Online)
Publisher: Politeknik Negeri Padang
Country of publisher: Indonesia
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science: Computer software
Website: http://joiv.org

About the journal

Abstract

Keywords