Applied Sciences (Nov 2021)

Leveraging Expert Knowledge for Label Noise Mitigation in Machine Learning

  • Quoc Nguyen,
  • Tomoaki Shikina,
  • Daichi Teruya,
  • Seiji Hotta,
  • Huy-Dung Han,
  • Hironori Nakajo

DOI
https://doi.org/10.3390/app112211040
Journal volume & issue
Vol. 11, no. 22
p. 11040

Abstract

Read online

In training-based Machine Learning applications, the training data are frequently labeled by non-experts and expose substantial label noise which greatly alters the training models. In this work, a novel method for reducing the effect of label noise is introduced. The rules are created from expert knowledge to identify the incorrect non-expert training data. Using the gradient descent algorithm, the violating data samples are weighted less to mitigate their effects during model training. The proposed method is applied to the image classification problem using Manga109 and CIFAR-10 dataset. The experiments show that when the noise level is up to 50% our proposed method significantly increases the accuracy of the model compared to conventional learning methods.

Keywords