Pavement crack detection from CCD images with a locally enhanced transformer network

Zhengsen Xu; Haiyan Guan; Jian Kang; Xiangda Lei; Lingfei Ma; Yongtao Yu; Yiping Chen; Jonathan Li

International Journal of Applied Earth Observations and Geoinformation (Jun 2022)

Pavement crack detection from CCD images with a locally enhanced transformer network

Zhengsen Xu,
Haiyan Guan,
Jian Kang,
Xiangda Lei,
Lingfei Ma,
Yongtao Yu,
Yiping Chen,
Jonathan Li

Affiliations

Zhengsen Xu: School of Remote Sensing and Geomatics Engineering, Nanjing University of Information Science and Technology, Nanjing 210044, China
Haiyan Guan: School of Remote Sensing and Geomatics Engineering, Nanjing University of Information Science and Technology, Nanjing 210044, China; Corresponding authors.
Jian Kang: School of Remote Sensing and Geomatics Engineering, Nanjing University of Information Science and Technology, Nanjing 210044, China
Xiangda Lei: School of Remote Sensing and Geomatics Engineering, Nanjing University of Information Science and Technology, Nanjing 210044, China
Lingfei Ma: School of Statistics and Mathematics, Central University of Finance and Economics, Beijing 102206, China; Corresponding authors.
Yongtao Yu: Faculty of Computer and Software Engineering, Huaiyin Institute of Technology, Huaian 223003, China
Yiping Chen: Department of Computer Sciences, School of Informatics, Xiamen University, Xiamen 361000, China
Jonathan Li: Department of Geography and Environmental Management and Department of Systems Design Engineering, University of Waterloo, Waterloo ON N2L 3G1, Canada

Journal volume & issue: Vol. 110
p. 102825

Abstract

Read online

Precisely identifying pavement cracks from charge-coupled devices (CCDs) captured high-resolution images faces many challenges. Even though convolutional neural networks (CNNs) have achieved impressive performance in this task, the stacked convolutional layers fail to extract long-range contextual features and impose high computational costs. Therefore, we propose a locally enhanced Transformer network (LETNet) to completely and efficiently detect pavement cracks. In the LETNet, Transformer is employed to model long-range dependencies. By designing a convolution stem and a local enhancement module, both low-level and high-level local features can be compensated. To take advantage of these rich features, a skip connection strategy and an efficient upsampling module is built to restore detailed information. In addition, a defect rectification module is further developed to reinforce the network for hard sample recognition. The quantitative comparison demonstrates that the proposed LETNet outperformed four advanced deep learning-based models with respect to both efficiency and effectiveness. Specifically, the average precision, recall, ODS, IoU, and frame per second (FPS) of the LETNet on three testing datasets are approximately 93.04%, 92.85%, 92.94%, 94.07%, and 30.80FPS, respectively. We also built a comprehensive pavement crack dataset containing 156 high-resolution manually annotated CCD images and made it publicly available on Zenodo.

Published in International Journal of Applied Earth Observations and Geoinformation

ISSN: 1569-8432 (Print); 1872-826X (Online)
Publisher: Elsevier
Country of publisher: Netherlands
LCC subjects: Geography. Anthropology. Recreation: Physical geography; Geography. Anthropology. Recreation: Environmental sciences
Website: https://www.journals.elsevier.com/international-journal-of-applied-earth-observation-and-geoinformation

About the journal

Abstract

Keywords