Deep learning for ancient scripts recognition: A CapsNet-LSTM based approach

Aditi Moudgil; Saravjeet Singh; Shalli Rani; Mohammad Shabaz; Shtwai Alsubai

Alexandria Engineering Journal (Sep 2024)

Deep learning for ancient scripts recognition: A CapsNet-LSTM based approach

Aditi Moudgil,
Saravjeet Singh,
Shalli Rani,
Mohammad Shabaz,
Shtwai Alsubai

Affiliations

Aditi Moudgil: Chitkara University Institute of Engineering and Technology, Chitkara University, Rajpura, 140401, Punjab, India
Saravjeet Singh: Chitkara University Institute of Engineering and Technology, Chitkara University, Rajpura, 140401, Punjab, India
Shalli Rani: Chitkara University Institute of Engineering and Technology, Chitkara University, Rajpura, 140401, Punjab, India; Corresponding author.
Mohammad Shabaz: Model Institute of Engineering and Technology, Jammu, J&K, India
Shtwai Alsubai: Department of Computer Science, College of Computer Engineering and Sciences in Al-Kharj, Prince Sattam bin Abdulaziz University, P.O. Box 151 Al-Kharj 11942, Saudi Arabia

Journal volume & issue: Vol. 103
pp. 169 – 179

Abstract

Read online

Efficient character recognition in ancient handwritten Devanagari documents is crucial for societal advancements. Challenges such as overlapping characters, missing headlines, and over-inked stains further complicate the recognition process. In response, we propose a Capsule Network (CapsNet) with LSTM to address hierarchical temporal dependencies in Devanagari scripts, following initial implementation of a simple CNN. We also explored a combined CNN+LSTM architecture for character recognition, leveraging CNN’s feature extraction capabilities with LSTM’s sequential modeling to handle temporal dependencies in Devanagari scripts. Our experimentation involved a dataset of 10,825 characters from ancient Devanagari manuscripts, encompassing basic characters, modifiers, and conjuncts, classified into 399 classes. Testing various training–testing ratios (9:1, 8:2, and 7:3), we visually and statistically evaluated the experimental data, demonstrating the superiority of CapsNet and LSTM in handling these challenges. We calculated recognition accuracy, precision, and recall values, with CapsNet achieving a maximum accuracy of 95.98% after 30 epochs. This research underscores the effectiveness of CapsNet and LSTM in advancing character recognition for ancient Devanagari manuscripts.

Published in Alexandria Engineering Journal

ISSN: 1110-0168 (Print); 2090-2670 (Online)
Publisher: Elsevier
Country of publisher: Egypt
LCC subjects: Technology: Engineering (General). Civil engineering (General)
Website: http://www.journals.elsevier.com/alexandria-engineering-journal/

About the journal

Abstract

Keywords