DKNet: Deep Kuzushiji Characters Recognition Network

Dilbag Singh; C. V. Aravinda; Manjit Kaur; Meng Lin; Jyothi Shetty; Vikram Raju Reddicherla; Heung-No Lee

doi:10.1109/ACCESS.2022.3191429

IEEE Access (Jan 2022)

DKNet: Deep Kuzushiji Characters Recognition Network

Dilbag Singh,
C. V. Aravinda,
Manjit Kaur,
Meng Lin,
Jyothi Shetty,
Vikram Raju Reddicherla,
Heung-No Lee

Affiliations

Dilbag Singh: ORCiD; School of Electrical Engineering and Computer Science, Gwangju Institute of Science and Technology, Gwangju, South Korea
C. V. Aravinda: ORCiD; N.M.A.M. Institute of Technology, Nitte, Karkala, India
Manjit Kaur: ORCiD; School of Electrical Engineering and Computer Science, Gwangju Institute of Science and Technology, Gwangju, South Korea
Meng Lin: ORCiD; Department of Electronic and Computer Engineering, The Graduate School of Science and Engineering, Ritsumeikan University, Kusatsu, Shiga, Japan
Jyothi Shetty: ORCiD; N.M.A.M. Institute of Technology, Nitte, Karkala, India
Vikram Raju Reddicherla: ORCiD; N.M.A.M. Institute of Technology, Nitte, Karkala, India
Heung-No Lee: ORCiD; School of Electrical Engineering and Computer Science, Gwangju Institute of Science and Technology, Gwangju, South Korea

DOI: https://doi.org/10.1109/ACCESS.2022.3191429
Journal volume & issue: Vol. 10
pp. 75872 – 75883

Abstract

Read online

Kuzushiji, a cursive writing style, had been extensively utilized in Japan for over a thousand years starting from the $8^{th}$ century. In 1900, Kuzushiji was not included in regular school curricula due to the change in the Japanese writing system. Nowadays Japanese natives are unable to read historical books that were written using Kuzushiji language. Therefore, libraries and museums have decided to build digital copies of the documents and books that were written in Kuzushiji language. Due to a limited number of trained experts, researchers have utilized machine and deep learning models to convert historical documents and books into a modern script that can be easily read by human beings. However, the existing deep learning techniques suffer from over-fitting and gradient vanishing problems. To overcome these problems, an efficient deep Kuzushiji characters recognition network (DKNet) is proposed. Initially, to remove noise from the training images, a trilateral joint filter is applied. Contrast limited adaptive histogram equalization (CLAHE) is then applied to enhance the visibility of filtered images. Thereafter, a pre-trained MobileNet is utilized to extract the features of Kuzushiji characters. MobileNet’s final layers are removed, including the fully connected layer and softmax. The flatten layer is then applied to the input. A fully connected classification layer is then used with Rectified linear units (ReLUs) and dropouts. Dropouts are used to generalize the model, thus preventing the over-fitting problem. Finally, the softmax activation function is employed to provide the recognition results. To test the proposed model, actual documents are first segmented by using the proposed Maximally stable extremal regions (MSERs) and convexhull-based segmentation approach. Segmented characters are then recognized using the trained DKNet. Extensive comparative analyses reveal that DKNet achieves better performance than the competitive models in terms of various performance metrics. An efficient Application Programming Interface (API) is also designed for Japanese Kuzushiji ancient heritage character recognition to help the end-users.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords