IEEE Access (Jan 2022)

Recognition of Visual Arabic Scripting News Ticker From Broadcast Stream

  • Moeen Tayyab,
  • Ayyaz Hussain,
  • Mohammed Ali Alshara,
  • Shakir Khan,
  • Reemiah Muneer Alotaibi,
  • Abdul Rauf Baig

DOI
https://doi.org/10.1109/ACCESS.2022.3179366
Journal volume & issue
Vol. 10
pp. 59189 – 59204

Abstract

Read online

News ticker recognition is a vital area of research due to its applications such as information analysis, opinion mining and language translation for media regulatory authorities. Without automated systems, manual anatomizing is difficult. In this paper, we focus on the automatic Arabic and Urdu news ticker recognition system. It mainly consists of ticker segmentation and text recognition to generate textual data for various online services. Our work investigates character-wise explicit segmentation and syntactical models with Kufi and Nastaleeq fonts. Various network models anticipate learning of deep representations by homogenizing the classes regardless of inter-symbol correlations and linguistic taxonomy. The proposed learning model incorporates fairness by maximizing the balance among sensitive features of characters in a unified manner. Furthermore, we demonstrate the efficiency of the proposed model by carrying out experiments using customized news tickers datasets with accurate character-level and component-level labeling. Moreover, our method is evaluated on a challenging Urdu Printed Text Images (UPTI) dataset that only provides ligature based annotations. The proposed method attains 98.36%, outperforms the current state of the art method. Ablation investigations show that our technique enhances the performance of character classes with low symbol frequencies.

Keywords