Dynamic Convolution Covariance Network Using Multiscale Feature Fusion for Remote Sensing Scene Image Classification

Xinyu Wang; Furong Shi; Haixia Xu; Liming Yuan; Xianbin Wen

doi:10.1109/JSTARS.2024.3456854

IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing (Jan 2024)

Dynamic Convolution Covariance Network Using Multiscale Feature Fusion for Remote Sensing Scene Image Classification

Xinyu Wang,
Furong Shi,
Haixia Xu,
Liming Yuan,
Xianbin Wen

Affiliations

Xinyu Wang: ORCiD; School of Computer Science and Engineering and Key Laboratory of Computer Vision and System of the Ministry of Education, Tianjin University of Technology, Tianjin, China
Furong Shi: School of Computer Science and Engineering and Key Laboratory of Computer Vision and System of the Ministry of Education, Tianjin University of Technology, Tianjin, China
Haixia Xu: ORCiD; School of Computer Science and Engineering and Key Laboratory of Computer Vision and System of the Ministry of Education, Tianjin University of Technology, Tianjin, China
Liming Yuan: ORCiD; School of Computer Science and Engineering and Key Laboratory of Computer Vision and System of the Ministry of Education, Tianjin University of Technology, Tianjin, China
Xianbin Wen: ORCiD; School of Computer Science and Engineering and Key Laboratory of Computer Vision and System of the Ministry of Education, Tianjin University of Technology, Tianjin, China

DOI: https://doi.org/10.1109/JSTARS.2024.3456854
Journal volume & issue: Vol. 17
pp. 16815 – 16830

Abstract

Read online

The rapid increase in spatial resolution of remote sensing scene images (RSIs) has led to a concomitant increase in the complexity of the spatial contextual information contained therein. The coexistence of numerous smaller features makes it challenging to accurately locate and mine these features, which in turn makes accurate interpretation difficult. In order to address the aforementioned issues, this article proposes a dynamic convolution covariance network (ODFMN) based on omni-dimensional dynamic convolution, which can extract multidimensional and multiscale features from RSIs and perform statistical higher-order representation of feature information. First, in order to fully exploit the complex spatial context information of RSIs and at the same time improve the limitation of a single static convolution kernel for feature extraction, we constructed a omni-dimensional feature extraction module based on dynamic convolution, which fully extracts the 4-D information within the convolution kernel. Then, to make full use of the full-dimensional feature information extracted from each level in the network, the feature representation is enriched by constructing multiscale feature fusion module to establish relationships from local to global. Finally, higher order statistical information is employed to address the challenge of representing first-order information for smaller object features, which is inherently difficult to do. Experiments conducted on publicly available datasets have demonstrated that the method achieves high classification accuracies of 99.04%, 95.34%, and 92.50%, respectively. Furthermore, the method has been verified to have high capture accuracy for feature target contours, shapes, and spatial context information through feature visualization.

Published in IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing

ISSN: 1939-1404 (Print); 2151-1535 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Ocean engineering; Science: Physics: Geophysics. Cosmic physics
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=4609443

About the journal

Abstract

Keywords