Jisuanji kexue yu tansuo (Jun 2022)

Survey on Pseudo-Labeling Methods in Deep Semi-supervised Learning

  • LIU Yafen, ZHENG Yifeng, JIANG Lingyi, LI Guohe, ZHANG Wenjie

DOI
https://doi.org/10.3778/j.issn.1673-9418.2111144
Journal volume & issue
Vol. 16, no. 6
pp. 1279 – 1290

Abstract

Read online

With the development of intelligent technology, deep learning has become a hot topic in machine learning. It is playing a more and more important role in various fields. Deep learning requires a lot of labeled data to imp-rove model performance. Therefore, researchers effectively combine semi-supervised learning with deep learning to solve the labeled data problem. It utilizes a small amount of labeled data and a large amount of unlabeled data to build the model simultaneously. It can help to expand the sample space. In view of its theoretical significance and practical application value, this paper focuses on the pseudo-labeling methods as the starting point. Firstly, deep semi-supervised learning is introduced and the advantage of pseudo-labeling methods is pointed out. Secondly, the pseudo-labeling methods are described from self-training and multi-view training and the existing model is comprehensively analyzed. And then, the label propagation method based on graph and pseudo-labeling is introduced. Furthermore, the existing pseudo-labeling methods are analyzed and compared. Finally, the problems and future research direction of pseudo-labeling methods are summarized from the utility of unlabeled data, noise data, rationality, and the combi-nation of pseudo-labeling methods.

Keywords