TCPSNet: Transformer and Cross-Pseudo-Siamese Learning Network for Classification of Multi-Source Remote Sensing Images

Yongduo Zhou; Cheng Wang; Hebing Zhang; Hongtao Wang; Xiaohuan Xi; Zhou Yang; Meng Du

doi:10.3390/rs16173120

Remote Sensing (Aug 2024)

TCPSNet: Transformer and Cross-Pseudo-Siamese Learning Network for Classification of Multi-Source Remote Sensing Images

Yongduo Zhou,
Cheng Wang,
Hebing Zhang,
Hongtao Wang,
Xiaohuan Xi,
Zhou Yang,
Meng Du

Affiliations

Yongduo Zhou: School of Surveying and Land Information Engineering, Henan Polytechnic University, Jiaozuo 454000, China
Cheng Wang: School of Surveying and Land Information Engineering, Henan Polytechnic University, Jiaozuo 454000, China
Hebing Zhang: School of Surveying and Land Information Engineering, Henan Polytechnic University, Jiaozuo 454000, China
Hongtao Wang: School of Surveying and Land Information Engineering, Henan Polytechnic University, Jiaozuo 454000, China
Xiaohuan Xi: Key Laboratory of Digital Earth Science, Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100094, China
Zhou Yang: Key Laboratory of Digital Earth Science, Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100094, China
Meng Du: Key Laboratory of Digital Earth Science, Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100094, China

DOI: https://doi.org/10.3390/rs16173120
Journal volume & issue: Vol. 16, no. 17
p. 3120

Abstract

Read online

The integration of multi-source remote sensing data, bolstered by advancements in deep learning, has emerged as a pivotal strategy for enhancing land use and land cover (LULC) classification accuracy. However, current methods often fail to consider the numerous prior knowledge of remote sensing images and the characteristics of heterogeneous remote sensing data, resulting in data loss between different modalities and the loss of a significant amount of useful information, thus affecting classification accuracy. To tackle these challenges, this paper proposes a LULC classification method based on remote sensing data that combines a Transformer and cross-pseudo-siamese learning deep neural network (TCPSNet). It first conducts shallow feature extraction in a dynamic multi-scale manner, fully leveraging the prior information of remote sensing data. Then, it further models deep features through the multimodal cross-attention module (MCAM) and cross-pseudo-siamese learning module (CPSLM). Finally, it achieves comprehensive fusion of local and global features through feature-level fusion and decision-level fusion combinations. Extensive experiments on datasets such as Trento, Houston 2013, Augsburg, MUUFL and Berlin demonstrate the superior performance of the proposed TCPSNet. The overall accuracy (OA) of the network on the Trento, Houston 2013 and Augsburg datasets is of 99.76%, 99.92%, 97.41%, 87.97% and 97.96%, respectively.

Published in Remote Sensing

ISSN: 2072-4292 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Science
Website: http://www.mdpi.com/journal/remotesensing/

About the journal

Abstract

Keywords