A Lightweight Visual Font Style Recognition With Quantized Convolutional Autoencoder

Moshiur Rahman Tonmoy; Abdul Fattah Rakib; Rashik Rahman; Md. Akhtaruzzaman Adnan; M. F. Mridha; Jie Huang; Jungpil Shin

doi:10.1109/OJCS.2024.3378709

IEEE Open Journal of the Computer Society (Jan 2024)

A Lightweight Visual Font Style Recognition With Quantized Convolutional Autoencoder

Moshiur Rahman Tonmoy,
Abdul Fattah Rakib,
Rashik Rahman,
Md. Akhtaruzzaman Adnan,
M. F. Mridha,
Jie Huang,
Jungpil Shin

Affiliations

Moshiur Rahman Tonmoy: ORCiD; Advanced Machine Intelligence Research Lab, Dhaka, Bangladesh
Abdul Fattah Rakib: ORCiD; Department of Computer Science and Engineering, University of Asia Pacific, Dhaka, Bangladesh
Rashik Rahman: ORCiD; Department of Computer Science and Engineering, University of Asia Pacific, Dhaka, Bangladesh
Md. Akhtaruzzaman Adnan: ORCiD; Department of Computer Science and Engineering, University of Asia Pacific, Dhaka, Bangladesh
M. F. Mridha: ORCiD; Department of Computer Science, American International University-Bangladesh, Dhaka, Bangladesh
Jie Huang: School of Computer Science and Engineering, The University of Aizu, Aizuwakamatsu, Japan
Jungpil Shin: ORCiD; School of Computer Science and Engineering, The University of Aizu, Aizuwakamatsu, Japan

DOI: https://doi.org/10.1109/OJCS.2024.3378709
Journal volume & issue: Vol. 5
pp. 120 – 130

Abstract

Read online

Font style recognition plays a vital role in the field of computer vision, particularly in document and pattern analysis, and image processing. In the industry context, this recognition of font styles holds immense importance for professionals such as graphic designers, front-end developers, and UI-UX developers. In recent times, font style recognition using Computer Vision has made significant progress, especially in English. Very few works have been done for other languages as well. However, the existing models are computationally costly, time-consuming, and not diversified. In this work, we propose a state-of-the-art model to recognize Bangla fonts from images using a quantized Convolutional Autoencoder (Q-CAE) approach. The compressed model takes around 58 KB of memory only which makes it suitable for not only high-end but also low-end computational edge devices. We have also created a synthetic data set consisting of 10 distinct Bangla font styles and a total of 60,000 images for conducting this study as no dedicated dataset is available publicly. Experimental outcomes demonstrate that the proposed method can perform better than existing methods, gaining an overall accuracy of 99.95% without quantization and 99.85% after quantization.

Published in IEEE Open Journal of the Computer Society

ISSN: 2644-1268 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science; Technology: Technology (General): Industrial engineering. Management engineering: Information technology
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=8782664

About the journal

Abstract

Keywords