Signals (Oct 2024)
Curved Text Line Rectification via Bresenham’s Algorithm and Generalized Additive Models
Abstract
This paper presents a methodology for rectifying curved text lines, a crucial process in optical character recognition (OCR) and computer vision. Utilizing generalized additive models (GAMs), the proposed method accurately estimates text curvature and rectifies it into a straight format for improved text recognition. The process includes image binarization techniques like Otsu’s thresholding, morphological operations, curve estimation, and the Bresenham line drawing algorithm. The results show significant improvements in OCR accuracy among different challenging distortion scenarios. The implementation, written in Python, demonstrates the potential for enhancing text alignment and rectification in scanned text line images utilizing a flexible, robust, and customizable framework.
Keywords