Computational Visual Media (Dec 2021)

Scene text removal via cascaded text stroke detection and erasing

  • Xuewei Bian,
  • Chaoqun Wang,
  • Weize Quan,
  • Juntao Ye,
  • Xiaopeng Zhang,
  • Dong-Ming Yan

DOI
https://doi.org/10.1007/s41095-021-0242-8
Journal volume & issue
Vol. 8, no. 2
pp. 273 – 287

Abstract

Read online

Abstract Recent learning-based approaches show promising performance improvement for the scene text removal task but usually leave several remnants of text and provide visually unpleasant results. In this work, a novel end-to-end framework is proposed based on accurate text stroke detection. Specifically, the text removal problem is decoupled into text stroke detection and stroke removal; we design separate networks to solve these two subproblems, the latter being a generative network. These two networks are combined as a processing unit, which is cascaded to obtain our final model for text removal. Experimental results demonstrate that the proposed method substantially outperforms the state-of-the-art for locating and erasing scene text. A new large-scale real-world dataset with 12,120 images has been constructed and is being made available to facilitate research, as current publicly available datasets are mainly synthetic so cannot properly measure the performance of different methods.

Keywords