Arbitrary-Shaped Text Detection With Adaptive Text Region Representation

Xiufeng Jiang; Shugong Xu; Shunqing Zhang; Shan Cao

doi:10.1109/ACCESS.2020.2999069

IEEE Access (Jan 2020)

Arbitrary-Shaped Text Detection With Adaptive Text Region Representation

Xiufeng Jiang,
Shugong Xu,
Shunqing Zhang,
Shan Cao

Affiliations

Xiufeng Jiang: ORCiD; Shanghai Institute for Advanced Communication and Data Science, Shanghai University, Shanghai, China
Shugong Xu: ORCiD; Shanghai Institute for Advanced Communication and Data Science, Shanghai University, Shanghai, China
Shunqing Zhang: ORCiD; Shanghai Institute for Advanced Communication and Data Science, Shanghai University, Shanghai, China
Shan Cao: ORCiD; Shanghai Institute for Advanced Communication and Data Science, Shanghai University, Shanghai, China

DOI: https://doi.org/10.1109/ACCESS.2020.2999069
Journal volume & issue: Vol. 8
pp. 102106 – 102118

Abstract

Read online

Text detection/localization, as an important task in computer vision, has witnessed substantial advancements in methodology and performance with convolutional neural networks. However, the vast majority of popular methods use rectangles or quadrangles to describe text regions. These representations have inherent drawbacks, especially relating to dense adjacent text and loose regional text boundaries, which usually cause difficulty detecting arbitrarily shaped text. In this paper, we propose a novel text region representation method, with a robust pipeline, which can precisely detect dense adjacent text instances with arbitrary shapes. We consider a text instance to be composed of an adaptive central text region mask and a corresponding expanding ratio between the central text region and the full text region. More specifically, our pipeline generates adaptive central text regions and corresponding expanding ratios with a proposed training strategy, followed by a new proposed post-processing algorithm which expands central text regions to the complete text instance with the corresponding expanding ratios. We demonstrated that our new text region representation is effective, and that the pipeline can precisely detect closely adjacent text instances of arbitrary shapes. Experimental results on common datasets demonstrate superior performance of our work.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords