IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing (Jan 2023)
Brain-Inspired Remote Sensing Foundation Models and Open Problems: A Comprehensive Survey
- Licheng Jiao,
- Zhongjian Huang,
- Xiaoqiang Lu,
- Xu Liu,
- Yuting Yang,
- Jiaxuan Zhao,
- Jinyue Zhang,
- Biao Hou,
- Shuyuan Yang,
- Fang Liu,
- Wenping Ma,
- Lingling Li,
- Xiangrong Zhang,
- Puhua Chen,
- Zhixi Feng,
- Xu Tang,
- Yuwei Guo,
- Dou Quan,
- Shuang Wang,
- Weibin Li,
- Jing Bai,
- Yangyang Li,
- Ronghua Shang,
- Jie Feng
Affiliations
- Licheng Jiao
- ORCiD
- Key Laboratory of Intelligent Perception and Image Understanding, Ministry of Education of China, International Research Center of Intelligent Perception and Computation, School of Artificial Intelligence, Xidian University, Xi'an, China
- Zhongjian Huang
- ORCiD
- Key Laboratory of Intelligent Perception and Image Understanding, Ministry of Education of China, International Research Center of Intelligent Perception and Computation, School of Artificial Intelligence, Xidian University, Xi'an, China
- Xiaoqiang Lu
- ORCiD
- Key Laboratory of Intelligent Perception and Image Understanding, Ministry of Education of China, International Research Center of Intelligent Perception and Computation, School of Artificial Intelligence, Xidian University, Xi'an, China
- Xu Liu
- ORCiD
- Key Laboratory of Intelligent Perception and Image Understanding, Ministry of Education of China, International Research Center of Intelligent Perception and Computation, School of Artificial Intelligence, Xidian University, Xi'an, China
- Yuting Yang
- ORCiD
- Key Laboratory of Intelligent Perception and Image Understanding, Ministry of Education of China, International Research Center of Intelligent Perception and Computation, School of Artificial Intelligence, Xidian University, Xi'an, China
- Jiaxuan Zhao
- ORCiD
- Key Laboratory of Intelligent Perception and Image Understanding, Ministry of Education of China, International Research Center of Intelligent Perception and Computation, School of Artificial Intelligence, Xidian University, Xi'an, China
- Jinyue Zhang
- ORCiD
- Key Laboratory of Intelligent Perception and Image Understanding, Ministry of Education of China, International Research Center of Intelligent Perception and Computation, School of Artificial Intelligence, Xidian University, Xi'an, China
- Biao Hou
- ORCiD
- Key Laboratory of Intelligent Perception and Image Understanding, Ministry of Education of China, International Research Center of Intelligent Perception and Computation, School of Artificial Intelligence, Xidian University, Xi'an, China
- Shuyuan Yang
- ORCiD
- Key Laboratory of Intelligent Perception and Image Understanding, Ministry of Education of China, International Research Center of Intelligent Perception and Computation, School of Artificial Intelligence, Xidian University, Xi'an, China
- Fang Liu
- ORCiD
- Key Laboratory of Intelligent Perception and Image Understanding, Ministry of Education of China, International Research Center of Intelligent Perception and Computation, School of Artificial Intelligence, Xidian University, Xi'an, China
- Wenping Ma
- ORCiD
- Key Laboratory of Intelligent Perception and Image Understanding, Ministry of Education of China, International Research Center of Intelligent Perception and Computation, School of Artificial Intelligence, Xidian University, Xi'an, China
- Lingling Li
- ORCiD
- Key Laboratory of Intelligent Perception and Image Understanding, Ministry of Education of China, International Research Center of Intelligent Perception and Computation, School of Artificial Intelligence, Xidian University, Xi'an, China
- Xiangrong Zhang
- ORCiD
- Key Laboratory of Intelligent Perception and Image Understanding, Ministry of Education of China, International Research Center of Intelligent Perception and Computation, School of Artificial Intelligence, Xidian University, Xi'an, China
- Puhua Chen
- ORCiD
- Key Laboratory of Intelligent Perception and Image Understanding, Ministry of Education of China, International Research Center of Intelligent Perception and Computation, School of Artificial Intelligence, Xidian University, Xi'an, China
- Zhixi Feng
- ORCiD
- Key Laboratory of Intelligent Perception and Image Understanding, Ministry of Education of China, International Research Center of Intelligent Perception and Computation, School of Artificial Intelligence, Xidian University, Xi'an, China
- Xu Tang
- ORCiD
- Key Laboratory of Intelligent Perception and Image Understanding, Ministry of Education of China, International Research Center of Intelligent Perception and Computation, School of Artificial Intelligence, Xidian University, Xi'an, China
- Yuwei Guo
- ORCiD
- Key Laboratory of Intelligent Perception and Image Understanding, Ministry of Education of China, International Research Center of Intelligent Perception and Computation, School of Artificial Intelligence, Xidian University, Xi'an, China
- Dou Quan
- ORCiD
- Key Laboratory of Intelligent Perception and Image Understanding, Ministry of Education of China, International Research Center of Intelligent Perception and Computation, School of Artificial Intelligence, Xidian University, Xi'an, China
- Shuang Wang
- ORCiD
- Key Laboratory of Intelligent Perception and Image Understanding, Ministry of Education of China, International Research Center of Intelligent Perception and Computation, School of Artificial Intelligence, Xidian University, Xi'an, China
- Weibin Li
- ORCiD
- Key Laboratory of Intelligent Perception and Image Understanding, Ministry of Education of China, International Research Center of Intelligent Perception and Computation, School of Artificial Intelligence, Xidian University, Xi'an, China
- Jing Bai
- ORCiD
- Key Laboratory of Intelligent Perception and Image Understanding, Ministry of Education of China, International Research Center of Intelligent Perception and Computation, School of Artificial Intelligence, Xidian University, Xi'an, China
- Yangyang Li
- ORCiD
- Key Laboratory of Intelligent Perception and Image Understanding, Ministry of Education of China, International Research Center of Intelligent Perception and Computation, School of Artificial Intelligence, Xidian University, Xi'an, China
- Ronghua Shang
- ORCiD
- Key Laboratory of Intelligent Perception and Image Understanding, Ministry of Education of China, International Research Center of Intelligent Perception and Computation, School of Artificial Intelligence, Xidian University, Xi'an, China
- Jie Feng
- ORCiD
- Key Laboratory of Intelligent Perception and Image Understanding, Ministry of Education of China, International Research Center of Intelligent Perception and Computation, School of Artificial Intelligence, Xidian University, Xi'an, China
- DOI
- https://doi.org/10.1109/JSTARS.2023.3316302
- Journal volume & issue
-
Vol. 16
pp. 10084 – 10120
Abstract
The foundation model (FM) has garnered significant attention for its remarkable transfer performance in downstream tasks. Typically, it undergoes task-agnostic pretraining on a large dataset and can be efficiently adapted to various downstream applications through fine-tuning. While FMs have been extensively explored in language and other domains, their potential in remote sensing has also begun to attract scholarly interest. However, comprehensive investigations and performance comparisons of these models on remote sensing tasks are currently lacking. In this survey, we provide essential background knowledge by introducing key technologies and recent developments in FMs. Subsequently, we explore essential downstream applications in remote sensing, covering classification, localization, and understanding. Our analysis encompasses over 30 FMs in both natural and remote sensing fields, and we conduct extensive experiments on more than 10 datasets, evaluating global feature representation, local feature representation, and target localization. Through quantitative assessments, we highlight the distinctions among various FMs and confirm that pretrained large-scale natural FMs can also deliver outstanding performance in remote sensing tasks. After that, we systematically presented a brain-inspired framework for remote sensing foundation models (RSFMs). We delve into the brain-inspired characteristics in this framework, including structure, perception, learning, and cognition. To conclude, we summarize 12 open problems in RSFMs, providing potential research directions. Our survey offers valuable insights into the burgeoning field of RSFMs and aims to foster further advancements in this exciting area.
Keywords