Geo-spatial Information Science (Sep 2024)
Modeling information flow from multispectral remote sensing images to land use and land cover maps for understanding classification mechanism
Abstract
Information on Land Use and Land Cover Map (LULCM) is essential for environment and socioeconomic applications. Such maps are generally derived from Multispectral Remote Sensing Images (MRSI) via classification. The classification process can be described as information flow from images to maps through a trained classifier. Characterizing the information flow is essential for understanding the classification mechanism, providing solutions that address such theoretical issues as “what is the maximum number of classes that can be classified from a given MRSI?” and “how much information gain can be obtained?” Consequently, two interesting questions naturally arise, i.e. (i) How can we characterize the information flow? and (ii) What is the mathematical form of the information flow? To answer these two questions, this study first hypothesizes that thermodynamic entropy is the appropriate measure of information for both MRSI and LULCM. This hypothesis is then supported by kinetic-theory-based experiments. Thereafter, upon such an entropy, a generalized Jarzynski equation is formulated to mathematically model the information flow, which contains such parameters as thermodynamic entropy of MRSI, thermodynamic entropy of LULCM, weighted F1-score (classification accuracy), and total number of classes. This generalized Jarzynski equation has been successfully validated by hypothesis-driven experiments where 694 Sentinel-2 images are classified into 10 classes by four classical classifiers. This study provides a way for linking thermodynamic laws and concepts to the characterization and understanding of information flow in land cover classification, opening a new door for constructing domain knowledge.
Keywords