Open Datasets and Tools for Arabic Text Detection and Recognition in News Video Frames

Oussama Zayene; Sameh Masmoudi Touj; Jean Hennebert; Rolf Ingold; Najoua Essoukri Ben Amara

doi:10.3390/jimaging4020032

Journal of Imaging (Jan 2018)

Open Datasets and Tools for Arabic Text Detection and Recognition in News Video Frames

Oussama Zayene,
Sameh Masmoudi Touj,
Jean Hennebert,
Rolf Ingold,
Najoua Essoukri Ben Amara

Affiliations

Oussama Zayene: LATIS Lab, National Engineering School of Sousse (Eniso), University of Sousse, Sousse 4054, Tunisia
Sameh Masmoudi Touj: LATIS Lab, National Engineering School of Sousse (Eniso), University of Sousse, Sousse 4054, Tunisia
Jean Hennebert: ICoSys Institute, HES-SO, University of Applied Sciences, Fribourg 1705, Switzerland
Rolf Ingold: DIVA Group, Department of Informatics, University of Fribourg (Unifr), Fribourg 1700, Switzerland
Najoua Essoukri Ben Amara: LATIS Lab, National Engineering School of Sousse (Eniso), University of Sousse, Sousse 4054, Tunisia

DOI: https://doi.org/10.3390/jimaging4020032
Journal volume & issue: Vol. 4, no. 2
p. 32

Abstract

Read online

Recognizing texts in video is more complex than in other environments such as scanned documents. Video texts appear in various colors, unknown fonts and sizes, often affected by compression artifacts and low quality. In contrast to Latin texts, there are no publicly available datasets which cover all aspects of the Arabic Video OCR domain. This paper describes a new well-defined and annotated Arabic-Text-in-Video dataset called AcTiV 2.0. The dataset is dedicated especially to building and evaluating Arabic video text detection and recognition systems. AcTiV 2.0 contains 189 video clips serving as a raw material for creating 4063 key frames for the detection task and 10,415 cropped text images for the recognition task. AcTiV 2.0 is also distributed with its annotation and evaluation tools that are made open-source for standardization and validation purposes. This paper also reports on the evaluation of several systems tested under the proposed detection and recognition protocols.

Published in Journal of Imaging

ISSN: 2313-433X (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Photography; Medicine: Medicine (General): Computer applications to medicine. Medical informatics; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: http://www.mdpi.com/journal/jimaging

About the journal

Abstract

Keywords