ADOCRNet: A Deep Learning OCR for Arabic Documents Recognition

Lamia Mosbah; Ikram Moalla; Tarek M. Hamdani; Bilel Neji; Taha Beyrouthy; Adel M. Alimi

doi:10.1109/access.2024.3379530

IEEE Access (Jan 2024)

ADOCRNet: A Deep Learning OCR for Arabic Documents Recognition

Lamia Mosbah,
Ikram Moalla,
Tarek M. Hamdani,
Bilel Neji,
Taha Beyrouthy,
Adel M. Alimi

Affiliations

Lamia Mosbah: ORCiD; REsearch Groups in Intelligent Machines (ReGIM-Lab), National Engineering School of Sfax (ENIS), University of Sfax, Sfax, Tunisia
Ikram Moalla: ORCiD; REsearch Groups in Intelligent Machines (ReGIM-Lab), National Engineering School of Sfax (ENIS), University of Sfax, Sfax, Tunisia
Tarek M. Hamdani: ORCiD; REsearch Groups in Intelligent Machines (ReGIM-Lab), National Engineering School of Sfax (ENIS), University of Sfax, Sfax, Tunisia
Bilel Neji: ORCiD; College of Engineering and Technology, American University of the Middle East, Egaila, Kuwait
Taha Beyrouthy: ORCiD; College of Engineering and Technology, American University of the Middle East, Egaila, Kuwait
Adel M. Alimi: REsearch Groups in Intelligent Machines (ReGIM-Lab), National Engineering School of Sfax (ENIS), University of Sfax, Sfax, Tunisia

DOI: https://doi.org/10.1109/access.2024.3379530
Journal volume & issue: Vol. 12
pp. 55620 – 55631

Abstract

Read online

In recent years, Optical character recognition (OCR) has experienced a resurgence of interest especially for contemporary Arabic data. In fact, OCR development for printed and handwritten Arabic script is still a challenging task. These challenges are due to the specific characteristics of the Arabic script. In this work, we attempt to address these challenges by creating a deep learning OCR for Arabic document recognition called ADOCRNet. It is a novel deep learning framework whose architecture is built of layers of Convolutional Neural Networks (CNNs) and Bidirectional Long Short-Term Memory (BLSTM) trained using Connectionist Temporal Classification (CTC) algorithm. In order to assess the performance of our OCR, the proposed system is performed on two printed text datasets which are P-KHATT (text line images) and APTI (word images). It’s also evaluated on a handwritten Arabic text dataset IFN/ENIT (word images). According to the practical tests, the conceived model achieves strength recognition rates on the three datasets. ADOCRNet reaches a Character Error Rate (CER) of 0.01% on the P-KHATT dataset, 0.03% on the APTI dataset and a Word Error Rate (WER) of 1.09% on the IFN/ENIT dataset, which significantly outperforms the outcomes of the current systems.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords