A 3-D Convolutional Vision Transformer for PolSAR Image Classification and Change Detection

Lei Wang; Rong Gui; Hanyu Hong; Jun Hu; Lei Ma; Yu Shi

doi:10.1109/JSTARS.2024.3409775

IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing (Jan 2024)

A 3-D Convolutional Vision Transformer for PolSAR Image Classification and Change Detection

Lei Wang,
Rong Gui,
Hanyu Hong,
Jun Hu,
Lei Ma,
Yu Shi

Affiliations

Lei Wang: ORCiD; Hubei Key Laboratory of Optical Information and Pattern Recognition, School of Electrical and Information Engineering, Wuhan Institute of Technology, Wuhan, China
Rong Gui: ORCiD; School of Geosciences and Info-physics, Central South University, Changsha, China
Hanyu Hong: ORCiD; Hubei Key Laboratory of Optical Information and Pattern Recognition, School of Electrical and Information Engineering, Wuhan Institute of Technology, Wuhan, China
Jun Hu: ORCiD; School of Geosciences and Info-physics, Central South University, Changsha, China
Lei Ma: ORCiD; Hubei Key Laboratory of Optical Information and Pattern Recognition, School of Electrical and Information Engineering, Wuhan Institute of Technology, Wuhan, China
Yu Shi: ORCiD; Hubei Key Laboratory of Optical Information and Pattern Recognition, School of Electrical and Information Engineering, Wuhan Institute of Technology, Wuhan, China

DOI: https://doi.org/10.1109/JSTARS.2024.3409775
Journal volume & issue: Vol. 17
pp. 11503 – 11520

Abstract

Read online

The scattering properties of targets in polarimetric synthetic aperture radar (PolSAR) images are directly influenced by the targets' orientations, as the scattering properties from the same target with different orientations can be very different. This interpretation diversity caused by the target orientations is one of the primary technical bottlenecks in PolSAR image interpretation. In this article, a 3-D convolutional vision transformer (3-D-Conv-ViT) is proposed to describe the relationship between polarimetric coherent matrices with different polarization orientation angles (POAs) for PolSAR image classification and change detection. First, 3-D convolutional neural networks are used to capture the high-level feature representations of the polarimetric coherent matrix sequence. Second, a new Rotation-3-D-ViT block is proposed to learn the local and global representations of the high-level feature maps. The self-attention mechanism in the ViT can express the regularity of polarimetric coherent matrices with different POAs and improve the PolSAR image interpretation performance. Third, combined with different classifiers, the proposed 3-D-Conv-ViT can be applied to both PolSAR image classification and change detection. Experiments on real PolSAR image datasets demonstrate that the proposed method can overcome the problem of the interpretation ambiguity caused by the target orientation. The classification accuracies of the proposed method can reach 94.01%–99.48%, and the change detection accuracies can reach 93.84%–96.86%.

Published in IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing

ISSN: 1939-1404 (Print); 2151-1535 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Ocean engineering; Science: Physics: Geophysics. Cosmic physics
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=4609443

About the journal

Abstract

Keywords