Rectification and Super-Resolution Enhancements for Forensic Text Recognition

Pablo Blanco-Medina; Eduardo Fidalgo; Enrique Alegre; Rocío Alaiz-Rodríguez; Francisco Jáñez-Martino; Alexandra Bonnici

doi:10.3390/s20205850

Sensors (Oct 2020)

Rectification and Super-Resolution Enhancements for Forensic Text Recognition

Pablo Blanco-Medina,
Eduardo Fidalgo,
Enrique Alegre,
Rocío Alaiz-Rodríguez,
Francisco Jáñez-Martino,
Alexandra Bonnici

Affiliations

Pablo Blanco-Medina: Department of Electrical, Systems and Automation, Universidad de León, 24007 León, Spain
Eduardo Fidalgo: Department of Electrical, Systems and Automation, Universidad de León, 24007 León, Spain
Enrique Alegre: Department of Electrical, Systems and Automation, Universidad de León, 24007 León, Spain
Rocío Alaiz-Rodríguez: Department of Electrical, Systems and Automation, Universidad de León, 24007 León, Spain
Francisco Jáñez-Martino: Department of Electrical, Systems and Automation, Universidad de León, 24007 León, Spain
Alexandra Bonnici: Faculty of Engineering, University of Malta, MSD2080 Msida, Malta

DOI: https://doi.org/10.3390/s20205850
Journal volume & issue: Vol. 20, no. 20
p. 5850

Abstract

Read online

Retrieving text embedded within images is a challenging task in real-world settings. Multiple problems such as low-resolution and the orientation of the text can hinder the extraction of information. These problems are common in environments such as Tor Darknet and Child Sexual Abuse images, where text extraction is crucial in the prevention of illegal activities. In this work, we evaluate eight text recognizers and, to increase the performance of text transcription, we combine these recognizers with rectification networks and super-resolution algorithms. We test our approach on four state-of-the-art and two custom datasets (TOICO-1K and Child Sexual Abuse (CSA)-text, based on text retrieved from Tor Darknet and Child Sexual Exploitation Material, respectively). We obtained a 0.3170 score of correctly recognized words in the TOICO-1K dataset when we combined Deep Convolutional Neural Networks (CNN) and rectification-based recognizers. For the CSA-text dataset, applying resolution enhancements achieved a final score of 0.6960. The highest performance increase was achieved on the ICDAR 2015 dataset, with an improvement of 4.83% when combining the MORAN recognizer and the Residual Dense resolution approach. We conclude that rectification outperforms super-resolution when applied separately, while their combination achieves the best average improvements in the chosen datasets.

Published in Sensors

ISSN: 1424-8220 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Chemical technology
Website: http://www.mdpi.com/journal/sensors

About the journal

Abstract

Keywords