Exploring region features in remote sensing image captioning

Kai Zhao; Wei Xiong

International Journal of Applied Earth Observations and Geoinformation (Mar 2024)

Exploring region features in remote sensing image captioning

Kai Zhao,
Wei Xiong

Affiliations

Kai Zhao: Space Engineering University, Beijing, 101400, China; Corresponding author.
Wei Xiong: Space Engineering University, Science and Technology on Complex Electronic System Simulation Laboratory, Beijing, 101400, China

Journal volume & issue: Vol. 127
p. 103672

Abstract

Read online

Remote sensing image captioning (RSIC), an emerging field of cross-modal tasks, has become a popular research topic in recent years. Feature extraction underlies all RSIC tasks, with current tasks using grid features. Compared with grid features, region features provide object-level location-related information; however, these features have not been considered in the RSIC tasks. Therefore, this study examined the performance of region features on RSIC tasks. We generated region annotations based on published RSIC datasets to address the need for region-related datasets. We extracted region features according to the labeled data and proposed a Region Attention Transformer model. To solve the information loss problem owing to the region of interest pooling during region feature extraction, we proposed region-grid features and used geometry relationships for estimating correlations between different region features. We compared the performances of the models using grid and region features. The results showed that region features performed well in RSIC tasks, and region features forced the model to pay more attention to object regions when generating object-related words. This study describes a novel method of using features in RSIC tasks. Our region annotations are available at https://github.com/zk-1019/exploring.

Published in International Journal of Applied Earth Observations and Geoinformation

ISSN: 1569-8432 (Print); 1872-826X (Online)
Publisher: Elsevier
Country of publisher: Netherlands
LCC subjects: Geography. Anthropology. Recreation: Physical geography; Geography. Anthropology. Recreation: Environmental sciences
Website: https://www.journals.elsevier.com/international-journal-of-applied-earth-observation-and-geoinformation

About the journal

Abstract

Keywords