Applied Sciences (Nov 2020)

Learning-Free Text Line Segmentation for Historical Handwritten Documents

  • Berat Kurar Barakat,
  • Rafi Cohen,
  • Ahmad Droby,
  • Irina Rabaev,
  • Jihad El-Sana

DOI
https://doi.org/10.3390/app10228276
Journal volume & issue
Vol. 10, no. 22
p. 8276

Abstract

Read online

We present a learning-free method for text line segmentation of historical handwritten document images. This method relies on automatic scale selection together with second derivative of anisotropic Gaussian filters to detect the blob lines that strike through the text lines. Detected blob lines guide an energy minimization procedure to extract the text lines. Historical handwritten documents contain noise, heterogeneous text line heights, skews and touching characters among text lines. Automatic scale selection allows for automatic adaption to the heterogeneous nature of handwritten text lines in case the character height range is correctly estimated. In the extraction phase, the method can accurately split the touching characters among the text lines. We provide results investigating various settings and compare the model with recent learning-free and learning-based methods on the cBAD competition dataset.

Keywords