Self-Writer: Clusterable Embedding Based Self-Supervised Writer Recognition from Unlabeled Data

Zabir Mohammad; Muhammad Mohsin Kabir; Muhammad Mostafa Monowar; Md Abdul Hamid; Muhammad Firoz Mridha

doi:10.3390/math10244796

Mathematics (Dec 2022)

Self-Writer: Clusterable Embedding Based Self-Supervised Writer Recognition from Unlabeled Data

Zabir Mohammad,
Muhammad Mohsin Kabir,
Muhammad Mostafa Monowar,
Md Abdul Hamid,
Muhammad Firoz Mridha

Affiliations

Zabir Mohammad: Department of Computer Science and Engineering, Bangladesh University of Business and Technology, Dhaka 1216, Bangladesh
Muhammad Mohsin Kabir: Department of Computer Science and Engineering, Bangladesh University of Business and Technology, Dhaka 1216, Bangladesh
Muhammad Mostafa Monowar: Department of Information Technology, Faculty of Computing and Information Technology, King AbdulAziz University, Jeddah 21589, Saudi Arabia
Md Abdul Hamid: Department of Information Technology, Faculty of Computing and Information Technology, King AbdulAziz University, Jeddah 21589, Saudi Arabia
Muhammad Firoz Mridha: Department of Computer Science, American International University-Bangladesh (AIUB), Dhaka 1229, Bangladesh

DOI: https://doi.org/10.3390/math10244796
Journal volume & issue: Vol. 10, no. 24
p. 4796

Abstract

Read online

Writer recognition based on a small amount of handwritten text is one of the most challenging deep learning problems because of the implicit characteristics of handwriting styles. In a deep convolutional neural network, writer recognition based on supervised learning has shown great success. These supervised methods typically require a lot of annotated data. However, collecting annotated data is expensive. Although unsupervised writer recognition methods may address data annotation issues significantly, they often fail to capture sufficient feature relationships and usually perform less efficiently than supervised learning methods. Self-supervised learning may solve the unlabeled dataset issue and train the unsupervised datasets in a supervised manner. This paper introduces Self-Writer, a self-supervised writer recognition approach dealing with unlabeled data. The proposed scheme generates clusterable embeddings from a small fixed-length image frame such as a text block. The training strategy presumes that a small image frame of handwritten text should include the writer’s handwriting characteristics. We construct pairwise constraints and nongenerative augmentation to train Siamese architecture to generate embeddings depending on such an assumption. Self-Writer is evaluated on the two most widely used datasets, IAM and CVL, on pairwise and triplet architecture. We find Self-Writer to be convincing in achieving satisfactory performance using pairwise architectures.

Published in Mathematics

ISSN: 2227-7390 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Science: Mathematics
Website: http://www.mdpi.com/journal/mathematics

About the journal

Abstract

Keywords