EURASIP Journal on Advances in Signal Processing (Oct 2021)

PSENet-based efficient scene text detection

  • Guanglong Liao,
  • Zhongjie Zhu,
  • Yongqiang Bai,
  • Tingna Liu,
  • Zhibo Xie

DOI
https://doi.org/10.1186/s13634-021-00808-5
Journal volume & issue
Vol. 2021, no. 1
pp. 1 – 13

Abstract

Read online

Abstract Text detection is a key technique and plays an important role in computer vision applications, but efficient and precise text detection is still challenging. In this paper, an efficient scene text detection scheme is proposed based on the Progressive Scale Expansion Network (PSENet). A Mixed Pooling Module (MPM) is designed to effectively capture the dependence of text information at different distances, where different pooling operations are employed to better extract information of text shape. The backbone network is optimized by combining two extensions of the Residual Network (ResNet), i.e., ResNeXt and Res2Net, to enhance feature extraction effectiveness. Experimental results show that the precision of our scheme is improved more than by 5% compared with the original PSENet.

Keywords