CRAS: Curriculum Regularization and Adaptive Semi-Supervised Learning with Noisy Labels

Ryota Higashimoto; Soh Yoshida; Mitsuji Muneyasu

doi:10.3390/app14031208

Applied Sciences (Jan 2024)

CRAS: Curriculum Regularization and Adaptive Semi-Supervised Learning with Noisy Labels

Ryota Higashimoto,
Soh Yoshida,
Mitsuji Muneyasu

Affiliations

Ryota Higashimoto: Graduate School of Science and Engineering, Kansai University, 3-3-35 Yamate-cho, Suita-shi 564-8680, Osaka, Japan
Soh Yoshida: Faculty of Engineering Science, Kansai University, Suita-shi 564-8680, Osaka, Japan
Mitsuji Muneyasu: Faculty of Engineering Science, Kansai University, Suita-shi 564-8680, Osaka, Japan

DOI: https://doi.org/10.3390/app14031208
Journal volume & issue: Vol. 14, no. 3
p. 1208

Abstract

Read online

This paper addresses the performance degradation of deep neural networks caused by learning with noisy labels. Recent research on this topic has exploited the memorization effect: networks fit data with clean labels during the early stages of learning and eventually memorize data with noisy labels. This property allows for the separation of clean and noisy samples from a loss distribution. In recent years, semi-supervised learning, which divides training data into a set of labeled clean samples and a set of unlabeled noisy samples, has achieved impressive results. However, this strategy has two significant problems: (1) the accuracy of dividing the data into clean and noisy samples depends strongly on the network’s performance, and (2) if the divided data are biased towards the unlabeled samples, there are few labeled samples, causing the network to overfit to the labels and leading to a poor generalization performance. To solve these problems, we propose the curriculum regularization and adaptive semi-supervised learning (CRAS) method. Its key ideas are (1) to train the network with robust regularization techniques as a warm-up before dividing the data, and (2) to control the strength of the regularization using loss weights that adaptively respond to data bias, which varies with each split at each training epoch. We evaluated the performance of CRAS on benchmark image classification datasets, CIFAR-10 and CIFAR-100, and real-world datasets, mini-WebVision and Clothing1M. The findings demonstrate that CRAS excels in handling noisy labels, resulting in a superior generalization and robustness to a range of noise rates, compared with the existing method.

Published in Applied Sciences

ISSN: 2076-3417 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Engineering (General). Civil engineering (General); Science: Biology (General); Science: Physics; Science: Chemistry
Website: http://www.mdpi.com/journal/applsci

About the journal

Abstract

Keywords