IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing (Jan 2024)

DCTC: Fast and Accurate Contour-Based Instance Segmentation With DCT Encoding for High-Resolution Remote Sensing Images

  • Zhong Chen,
  • Tianhang Liu,
  • Xueru Xu,
  • Junsong Leng,
  • Zhenxue Chen

DOI
https://doi.org/10.1109/JSTARS.2024.3386754
Journal volume & issue
Vol. 17
pp. 8697 – 8709

Abstract

Read online

Instance segmentation in remote sensing images (RSI) poses significant challenges due to the diverse scales of targets, scene complexity, and a high number of targets, making most methods struggle with suboptimal performance and time-consuming computations. To solve those problems, a fast and accurate RSI instance segmentation model (named DCTC) is designed in this article. DCTC transforms classification problem into regression problem to improve the reference speed. DCTC contains two parallel branches. The contour branch performs iterative regression on contours, extracting precise contour information to improve boundary accuracy. Meanwhile, the discrete cosine transformation (DCT) branch refines mask predictions and supplements instance context information, which particularly benefits the segmentation of small targets. DCT encoding is employed in the DCT branch to convert the mask representation into DCT format, aligning the outputs of the contour and DCT branches. Three innovative modules are introduced in the DCT branch: the coarse result generation (CRG) module, iteratively deform and regression (IDR) module, and contour and DCT fusion module (CDF). The CRG module generates coarse DCT vectors and contour coordinates, facilitating information exchange between the contour and DCT branches. The IDR module iteratively refines DCT vectors, enabling DCTC to focus more on small targets and instance details. The CDF module merges DCT vectors and contour coordinates, ensuring effective interaction between boundary and context information, thereby enhancing performance. Extensive experiments demonstrate the superiority of DCTC, which achieves 67.7, 36.3, 67.4, and 55.1AP on NWPU VHR-10, iSAID, synthetic aperture radar (SAR) ship detection dataset, and high-resolution SAR images dataset, respectively, and ranks first among state-of-the-art methods while maintaining real-time processing capability. Furthermore, DCTC exhibits strong performance on both optical and SAR images, and the designed DCT branch can be simply plug into any contour-based method to improve the network performance.

Keywords