International Journal of Applied Earth Observations and Geoinformation (Nov 2023)

DPENet: Dual-path extraction network based on CNN and transformer for accurate building and road extraction

  • Ziyi Chen,
  • Yuhua Luo,
  • Jing Wang,
  • Jonathan Li,
  • Cheng Wang,
  • Dilong Li

DOI
https://doi.org/10.1016/j.jag.2023.103510
Journal volume & issue
Vol. 124
p. 103510

Abstract

Read online

The acceleration of urbanization and the increasing demand for precise city planning have made the extraction of buildings and roads from remote sensing images crucial. Deep learning-based methods have propelled the progress of object extraction technology, but there are still challenges such as the missing and incomplete extraction of buildings and roads for small objects and occlusions. To address this issue, we propose a dual-path extraction network based on CNN and Transformer, combining local and global features to fully extract the semantic information of objects. To further enhance the semantic reconstruction capability of features, this paper introduces a multi-scale upsampling mechanism, thereby expanding the visual range of reconstruction. Finally, we adopt a deep supervision strategy to improve the reconstruction accuracy of objects at different resolutions. Our method has been tested on four remote sensing image datasets and has achieved excellent IoU scores on all datasets (Massachusetts Building and Roads Dataset: 76.69% and 66.41%, LRSNY and CHN6-CUG Roads Dataset: 88.96% and 61.99%). Furthermore, our method demonstrates superior performance compared to other mainstream image segmentation algorithms, fully demonstrating the effectiveness of our approach.

Keywords