Learning‐free handwritten word spotting method for historical handwritten documents

Hanadi Hassen Mohammed; Nandhini Subramanian; Somaya Al‐Madeed

doi:10.1049/ipr2.12216

IET Image Processing (Aug 2021)

Learning‐free handwritten word spotting method for historical handwritten documents

Hanadi Hassen Mohammed,
Nandhini Subramanian,
Somaya Al‐Madeed

Affiliations

Hanadi Hassen Mohammed: Department of Computer Science and Engineering Qatar University Doha Qatar
Nandhini Subramanian: Department of Computer Science and Engineering Qatar University Doha Qatar
Somaya Al‐Madeed: Department of Computer Science and Engineering Qatar University Doha Qatar

DOI: https://doi.org/10.1049/ipr2.12216
Journal volume & issue: Vol. 15, no. 10
pp. 2332 – 2341

Abstract

Read online

Abstract Word spotting on degraded and noisy historical documents can become a challenging task considering the computational time and memory usage required to scan the entire document image. This paper proposes a new effective technique for multi‐language word spotting using a two different feature extraction techniques, Histogram of Oriented Gradients (HOG) and Speeded Up Robust Features (SURF) features. First, regions of interest (ROIs) are extracted using a cross‐correlation measure, and the extracted ROIs are re‐ranked using feature extraction and matching methods. The algorithm handles two types of scenarios: Segmentation‐based and segmentation‐free. It also facilitates the search for words that occur once as well as multiple times in the image. Evaluations were conducted on the George Washington and HADARA datasets using a standard evaluation method. The proposed methodology shows improved performance over contemporary technologies currently being used in the word spotting research field.

Published in IET Image Processing

ISSN: 1751-9659 (Print); 1751-9667 (Online)
Publisher: Wiley
Country of publisher: United Kingdom
LCC subjects: Technology: Photography; Science: Mathematics: Instruments and machines: Electronic computers. Computer science: Computer software
Website: https://ietresearch.onlinelibrary.wiley.com/journal/17519667

About the journal

Abstract

Keywords