The illusion of success: Test set disproportion causes inflated accuracy in remote sensing mapping research

Yuanjun Xiao; Zhen Zhao; Jingfeng Huang; Ran Huang; Wei Weng; Gerui Liang; Chang Zhou; Qi Shao; Qiyu Tian

International Journal of Applied Earth Observations and Geoinformation (Dec 2024)

The illusion of success: Test set disproportion causes inflated accuracy in remote sensing mapping research

Yuanjun Xiao,
Zhen Zhao,
Jingfeng Huang,
Ran Huang,
Wei Weng,
Gerui Liang,
Chang Zhou,
Qi Shao,
Qiyu Tian

Affiliations

Yuanjun Xiao: Key Laboratory of Environmental Remediation and Ecological Health, Ministry of Education, Zhejiang University, Hangzhou, 310058, China; Institute of Applied Remote Sensing and Information Technology, Zhejiang University, Hangzhou, 310058, China; Key Laboratory of Agricultural Remote Sensing and Information Systems, Hangzhou, 310058, China
Zhen Zhao: Key Laboratory of Environmental Remediation and Ecological Health, Ministry of Education, Zhejiang University, Hangzhou, 310058, China; Institute of Applied Remote Sensing and Information Technology, Zhejiang University, Hangzhou, 310058, China; Key Laboratory of Agricultural Remote Sensing and Information Systems, Hangzhou, 310058, China
Jingfeng Huang: Key Laboratory of Environmental Remediation and Ecological Health, Ministry of Education, Zhejiang University, Hangzhou, 310058, China; Institute of Applied Remote Sensing and Information Technology, Zhejiang University, Hangzhou, 310058, China; Key Laboratory of Agricultural Remote Sensing and Information Systems, Hangzhou, 310058, China; Corresponding author at: Key Laboratory of Environmental Remediation and Ecological Health, Ministry of Education, Zhejiang University, Hangzhou, 310058, China.
Ran Huang: School of Automation, Hangzhou Dianzi University, Xiasha Higher Education Zone, Hangzhou, 310058, China
Wei Weng: Key Laboratory of Environmental Remediation and Ecological Health, Ministry of Education, Zhejiang University, Hangzhou, 310058, China; Institute of Applied Remote Sensing and Information Technology, Zhejiang University, Hangzhou, 310058, China; Key Laboratory of Agricultural Remote Sensing and Information Systems, Hangzhou, 310058, China
Gerui Liang: College of Life Science, Zhejiang University, Hangzhou, 310058, China
Chang Zhou: Key Laboratory of Environmental Remediation and Ecological Health, Ministry of Education, Zhejiang University, Hangzhou, 310058, China; Institute of Applied Remote Sensing and Information Technology, Zhejiang University, Hangzhou, 310058, China; Key Laboratory of Agricultural Remote Sensing and Information Systems, Hangzhou, 310058, China
Qi Shao: Key Laboratory of Environmental Remediation and Ecological Health, Ministry of Education, Zhejiang University, Hangzhou, 310058, China; Institute of Applied Remote Sensing and Information Technology, Zhejiang University, Hangzhou, 310058, China; Key Laboratory of Agricultural Remote Sensing and Information Systems, Hangzhou, 310058, China
Qiyu Tian: College of Biosystems Engineering and Food Science, Zhejiang University, Hangzhou, 310058, China

Journal volume & issue: Vol. 135
p. 104256

Abstract

Read online

In remote sensing mapping studies, selecting an appropriate test set to accurately evaluate the results is critical. An imprecise accuracy assessment can be misleading and fail to validate the applicability of mapping products. Commencing with the WHU-Hi-HanChuan dataset, this paper revealed the impact of sample size ratios in test sets on accuracy metrics by generating a series of test sets with varying ratios of positive and negative sample size to evaluate the same map. A rigorous approach for accuracy assessment was suggested, and an example of tea plantations mapping is used to demonstrate the process and analyse potential issues in traditional approaches. A scale factor (λ) was constructed to measure the discrepancy in sample size ratios between test sets and actual conditions. Accuracy adjustment formulas were developed and applied to adjust the accuracy of 42 previous maps based on the λ. Results showed a higher ratio of positive to negative sample size in test set led to inflated user’s accuracy (UA), F1-score (F1) and overall accuracy (OA), but had little impact on producer’s accuracy. When the ratio aligned with that in the target area, the UA, F1, and OA closely matched the true values, indicating the proportion of positive and negative samples in test set should be consistent with that in actual situation. The accuracies reported by the traditional approaches including test set sampling from labelled data and 5-fold cross validation were far from the true accuracy and could not reflect the performance of the map. Among 42 previous maps, nearly 60% of the maps had UAs overestimated by 10%, and 9.5% of the maps had UAs and F1s deviations of more than 25%. The conclusions of this study provide a clear caution for future mapping research and assist in producing and identifying truly excellent maps.

Published in International Journal of Applied Earth Observations and Geoinformation

ISSN: 1569-8432 (Print); 1872-826X (Online)
Publisher: Elsevier
Country of publisher: Netherlands
LCC subjects: Geography. Anthropology. Recreation: Physical geography; Geography. Anthropology. Recreation: Environmental sciences
Website: https://www.journals.elsevier.com/international-journal-of-applied-earth-observation-and-geoinformation

About the journal

Abstract

Keywords