GLTF-Net: Deep-Learning Network for Thick Cloud Removal of Remote Sensing Images via Global–Local Temporality and Features

Junhao Jia; Mingzhong Pan; Yaowei Li; Yanchao Yin; Shengmei Chen; Hongjia Qu; Xiaoxuan Chen; Bo Jiang

doi:10.3390/rs15215145

Remote Sensing (Oct 2023)

GLTF-Net: Deep-Learning Network for Thick Cloud Removal of Remote Sensing Images via Global–Local Temporality and Features

Junhao Jia,
Mingzhong Pan,
Yaowei Li,
Yanchao Yin,
Shengmei Chen,
Hongjia Qu,
Xiaoxuan Chen,
Bo Jiang

Affiliations

Junhao Jia: School of Information Science and Technology, Northwest University, Xi’an 710127, China
Mingzhong Pan: School of Physics and Photoelectric Engineering, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou 310012, China
Yaowei Li: Department of Computer Science and Engineering, Xi’an University of Technology, Xi’an 710048, China
Yanchao Yin: School of Information Science and Technology, Northwest University, Xi’an 710127, China
Shengmei Chen: School of Information Science and Technology, Northwest University, Xi’an 710127, China
Hongjia Qu: School of Information Science and Technology, Northwest University, Xi’an 710127, China
Xiaoxuan Chen: School of Information Science and Technology, Northwest University, Xi’an 710127, China
Bo Jiang: School of Information Science and Technology, Northwest University, Xi’an 710127, China

DOI: https://doi.org/10.3390/rs15215145
Journal volume & issue: Vol. 15, no. 21
p. 5145

Abstract

Read online

Remote sensing images are very vulnerable to cloud interference during the imaging process. Cloud occlusion, especially thick cloud occlusion, significantly reduces the imaging quality of remote sensing images, which in turn affects a variety of subsequent tasks using the remote sensing images. The remote sensing images miss ground information due to thick cloud occlusion. The thick cloud removal method based on a temporality global–local structure is initially suggested as a solution to this problem. This method includes two stages: the global multi-temporal feature fusion (GMFF) stage and the local single-temporal information restoration (LSIR) stage. It adopts the fusion feature of global multi-temporal to restore the thick cloud occlusion information of local single temporal images. Then, the featured global–local structure is created in both two stages, fusing the global feature capture ability of Transformer with the local feature extraction ability of CNN, with the goal of effectively retaining the detailed information of the remote sensing images. Finally, the local feature extraction (LFE) module and global–local feature extraction (GLFE) module is designed according to the global–local characteristics, and the different module details are designed in this two stages. Experimental results indicate that the proposed method performs significantly better than the compared methods in the established data set for the task of multi-temporal thick cloud removal. In the four scenes, when compared to the best method CMSN, the peak signal-to-noise ratio (PSNR) index improved by 2.675, 5.2255, and 4.9823 dB in the first, second, and third temporal images, respectively. The average improvement of these three temporal images is 9.65%. In the first, second, and third temporal images, the correlation coefficient (CC) index improved by 0.016, 0.0658, and 0.0145, respectively, and the average improvement for the three temporal images is 3.35%. Structural similarity (SSIM) and root mean square (RMSE) are improved 0.33% and 34.29%, respectively. Consequently, in the field of multi-temporal cloud removal, the proposed method enhances the utilization of multi-temporal information and achieves better effectiveness of thick cloud restoration.

Published in Remote Sensing

ISSN: 2072-4292 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Science
Website: http://www.mdpi.com/journal/remotesensing/

About the journal

Abstract

Keywords