IEEE Access (Jan 2021)

A Systematic Survey of Remote Sensing Image Captioning

  • Beigeng Zhao

DOI
https://doi.org/10.1109/ACCESS.2021.3128140
Journal volume & issue
Vol. 9
pp. 154086 – 154111

Abstract

Read online

Image captioning is a cross-disciplinary task to automatically generate textural descriptions for a given image using computer vision and natural language processing techniques. Remote sensing image captioning refers to the application of this task to remote sensing images taken from high altitude by satellites, aircraft or drones. This interesting and valuable topic has only emerged in recent years and attracted considerable research attention. There has been extensive related work in the literature, with considerable results and an independent body of research, and various issues must be addressed in future work. However, to the best of our knowledge, there has been no review study in this area that can provide researchers with systematic reference information, which is the motivation of this study. To achieve this goal, 30 relevant articles were conditionally filtered and obtained for the review study. We analyzed and summarized the existing work from various perspectives, including technical solutions, data, evaluation metrics, and the experimental results of state-of-the-art methods. Based on this summary, the trends, pros and cons of the existing studies, issues to be addressed and valuable research directions in future work are discussed. The results of this paper can provide valuable reference information for researchers in related fields.

Keywords