Signals (Oct 2024)

Curved Text Line Rectification via Bresenham’s Algorithm and Generalized Additive Models

  • Thomas Stogiannopoulos,
  • Ilias Theodorakopoulos

DOI
https://doi.org/10.3390/signals5040039
Journal volume & issue
Vol. 5, no. 4
pp. 705 – 720

Abstract

Read online

This paper presents a methodology for rectifying curved text lines, a crucial process in optical character recognition (OCR) and computer vision. Utilizing generalized additive models (GAMs), the proposed method accurately estimates text curvature and rectifies it into a straight format for improved text recognition. The process includes image binarization techniques like Otsu’s thresholding, morphological operations, curve estimation, and the Bresenham line drawing algorithm. The results show significant improvements in OCR accuracy among different challenging distortion scenarios. The implementation, written in Python, demonstrates the potential for enhancing text alignment and rectification in scanned text line images utilizing a flexible, robust, and customizable framework.

Keywords