Recognition of printed Urdu script in Nastaleeq font by using CNN-BiGRU-GRU Based Encoder-Decoder Framework

Sohail Zia; Muhammad Azhar; Bumshik Lee; Adnan Tahir; Javed Ferzund; Fozia Murtaza; Moazam Ali

Intelligent Systems with Applications (May 2023)

Recognition of printed Urdu script in Nastaleeq font by using CNN-BiGRU-GRU Based Encoder-Decoder Framework

Sohail Zia,
Muhammad Azhar,
Bumshik Lee,
Adnan Tahir,
Javed Ferzund,
Fozia Murtaza,
Moazam Ali

Affiliations

Sohail Zia: Department of Computer Science, Comsats University Islamabad, Sahiwal Campus, Sahiwal 57000, Pakistan
Muhammad Azhar: Department of Computer Science, Comsats University Islamabad, Sahiwal Campus, Sahiwal 57000, Pakistan; Department of Information and Communications Engineering, Chosun University, Gwangju 61452, Republic of Korea; Corresponding author
Bumshik Lee: Department of Information and Communications Engineering, Chosun University, Gwangju 61452, Republic of Korea
Adnan Tahir: Department of Computer Science & IT, The Islamia University of Bahalwapur, Rahim Yar Khan Campus, 64200, Punjab, Pakistan
Javed Ferzund: Department of Computer Science, Comsats University Islamabad, Sahiwal Campus, Sahiwal 57000, Pakistan
Fozia Murtaza: Department of Computer Science, Comsats University Islamabad, Sahiwal Campus, Sahiwal 57000, Pakistan
Moazam Ali: Department of Computer Science, Comsats University Islamabad, Sahiwal Campus, Sahiwal 57000, Pakistan

Journal volume & issue: Vol. 18
p. 200194

Abstract

Read online

RNN based Deep learning models has shown tremendous success in sequential and temporal data where the order is critical to achieve higher accuracy in context understanding. RNN family like LSTM, BLSTM, GRU, BiGRU etc. are the mainly used models in these kind of sequential tasks. RNN family based Encoder-decoder frameworks are widely used for the recognition of various languages scripts. However, in Urdu, very less research has been done especially with the deep learning models. The existing research work for printed Urdu recognition have shown that the current models only work for very basic sentences of Urdu but in case of complex words and sentences, these algorithms totally fail in terms of accuracy and the time complexity in identification of the Nastaleeq font writing. To identify printed Urdu text in images, we have proposed an encoder-decoder based hybrid deep learning approach with Convolutional Neural Network (CNN) for feature extraction part, bi-directional Gated Recurrent Unit network (BiGRU) as encoder and Gated Recurrent Unit network (GRU) as decoder. The CNN layer of the algorithm is used to obtain ligature features in Urdu, which are subsequently utilized by encoder (BiGRU) and decoder (GRU) to recognize the sentences by accurately distinguishing the characters and joiners. Experimental results have shown that our proposed CNN-BiGRU-GRU hybrid technique with specific hyper-parameter tuning performs well as compared to other state-of-the-art algorithms in terms of epochs (70 epochs as compared to 100 with BLSTM-LSTM based encoder decoder), 6 percent increase of Character Recognition Accuracy (86.95 percent as compared to 81.08 percent of BLSTM-LSTM), 10 percent increase of Word Recognition Accuracy (WRA) (89.48% as compared to 79.06 percent of BLSTM-LSTM) and less time complexity (18 seconds less than BLSTM-LSTM with same system configuration).

Published in Intelligent Systems with Applications

ISSN: 2667-3053 (Online)
Publisher: Elsevier
Country of publisher: United Kingdom
LCC subjects: Science: Science (General): Cybernetics; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://www.journals.elsevier.com/intelligent-systems-with-applications

About the journal

Abstract

Keywords