IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing (Jan 2024)
Instance-Dependent Multilabel Noise Generation for Multilabel Remote Sensing Image Classification
Abstract
Multilabel remote sensing image classification is a fundamental task that classifies multiple objects and land covers within an image. However, training deep learning models for this task requires a considerable cost of labeling. While several efforts to reduce labeling costs have been made, they often result in decreased label quality and the inclusion of incorrect (i.e., noisy) labels. To tackle this issue, algorithms for training deep learning models robust to multilabel noise have been proposed in the literature. Nonetheless, the efficacy of these algorithms has been evaluated only under instance-independent multilabel noise, where noise is generated regardless of the individual characteristics and features of each remote sensing image. In this article, we introduce generating instance-dependent multilabel noise into multilabel remote sensing image datasets for the first time. We leverage a vision-language model with zero-shot prediction capabilities to compute categorywise prediction scores for each image, based on which we generate multilabel noise in an instance-dependent manner. We demonstrate that the proposed instance-dependent multilabel noise is more feasibly generated with respect to individual images compared to traditional instance-independent multilabel noise. We also demonstrate that a more challenging noise scenario is generated, which leads to a more complex decision boundary and stronger overfitting during deep learning model training. Finally, we re-evaluate existing noise-robust training algorithms under the generated instance-dependent multilabel noise and observe that several algorithms exhibit limited robustness against instance-dependent multilabel noise.
Keywords