ET-Network: A novel efficient transformer deep learning model for automated Urdu handwritten text recognition.

Ameer Hamza; Shengbing Ren; Usman Saeed

doi:10.1371/journal.pone.0302590

PLoS ONE (Jan 2024)

ET-Network: A novel efficient transformer deep learning model for automated Urdu handwritten text recognition.

Ameer Hamza,
Shengbing Ren,
Usman Saeed

Affiliations

Ameer Hamza
Shengbing Ren
Usman Saeed

DOI: https://doi.org/10.1371/journal.pone.0302590
Journal volume & issue: Vol. 19, no. 5
p. e0302590

Abstract

Read online

Automatic Urdu handwritten text recognition is a challenging task in the OCR industry. Unlike printed text, Urdu handwriting lacks a uniform font and structure. This lack of uniformity causes data inconsistencies and recognition issues. Different writing styles, cursive scripts, and limited data make Urdu text recognition a complicated task. Major languages, such as English, have experienced advances in automated recognition, whereas low-resource languages, such as Urdu, still lag. Transformer-based models are promising for automated recognition in high- and low-resource languages such as Urdu. This paper presents a transformer-based method called ET-Network that integrates self-attention into EfficientNet for feature extraction and a transformer for language modeling. The use of self-attention layers in EfficientNet helps to extract global and local features that capture long-range dependencies. These features proceeded into a vanilla transformer to generate text, and a prefix beam search is used for the finest outcome. NUST-UHWR, UPTI2.0, and MMU-OCR-21 are three datasets used to train and test the ET Network for a handwritten Urdu script. The ET-Network improved the character error rate by 4% and the word error rate by 1.55%, while establishing a new state-of-the-art character error rate of 5.27% and a word error rate of 19.09% for Urdu handwritten text.

Published in PLoS ONE

ISSN: 1932-6203 (Online)
Publisher: Public Library of Science (PLoS)
Country of publisher: United States
LCC subjects: Medicine; Science
Website: https://journals.plos.org/plosone/

About the journal