Local feature acquisition and global context understanding network for very high-resolution land cover classification

Zhengpeng Li; Jun Hu; Kunyang Wu; Jiawei Miao; Zixue Zhao; Jiansheng Wu

doi:10.1038/s41598-024-63363-7

Scientific Reports (Jun 2024)

Local feature acquisition and global context understanding network for very high-resolution land cover classification

Zhengpeng Li,
Jun Hu,
Kunyang Wu,
Jiawei Miao,
Zixue Zhao,
Jiansheng Wu

Affiliations

Zhengpeng Li: School of Electronic and Information Engineering, University of Science and Technology Liaoning
Jun Hu: School of Electronic and Information Engineering, University of Science and Technology Liaoning
Kunyang Wu: College of Instrumentation and Electrical Engineering, Jilin University
Jiawei Miao: School of Electronic and Information Engineering, University of Science and Technology Liaoning
Zixue Zhao: School of Computer Science and Software Engineering, University of Science and Technology Liaoning
Jiansheng Wu: School of Computer Science and Software Engineering, University of Science and Technology Liaoning

DOI: https://doi.org/10.1038/s41598-024-63363-7
Journal volume & issue: Vol. 14, no. 1
pp. 1 – 20

Abstract

Read online

Abstract Very high-resolution remote sensing images hold promising applications in ground observation tasks, paving the way for highly competitive solutions using image processing techniques for land cover classification. To address the challenges faced by convolutional neural network (CNNs) in exploring contextual information in remote sensing image land cover classification and the limitations of vision transformer (ViT) series in effectively capturing local details and spatial information, we propose a local feature acquisition and global context understanding network (LFAGCU). Specifically, we design a multidimensional and multichannel convolutional module to construct a local feature extractor aimed at capturing local information and spatial relationships within images. Simultaneously, we introduce a global feature learning module that utilizes multiple sets of multi-head attention mechanisms for modeling global semantic information, abstracting the overall feature representation of remote sensing images. Validation, comparative analyses, and ablation experiments conducted on three different scales of publicly available datasets demonstrate the effectiveness and generalization capability of the LFAGCU method. Results show its effectiveness in locating category attribute information related to remote sensing areas and its exceptional generalization capability. Code is available at https://github.com/lzp-lkd/LFAGCU .

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal

Abstract

Keywords