IEEE Access (Jan 2022)

How Much a Model be Trained by Passive Learning Before Active Learning?

  • Dae Ung Jo,
  • Sangdoo Yun,
  • Jin Young Choi

DOI
https://doi.org/10.1109/ACCESS.2022.3162253
Journal volume & issue
Vol. 10
pp. 34677 – 34689

Abstract

Read online

Most pool-based active learning studies have focused on query strategy for active learning. In this paper, via empirical analysis on the effect of passive learning before starting active learning, we reveal that the amount of data acquired by passive learning significantly affects the performance of active learning algorithms. In addition, we confirm that the best amount of data that should be acquired by passive learning depends on the given settings: network complexity, query strategy, and datasets. Inspired by these observations, we propose a method to automatically determine the starting point of active learning for the given settings. To this end, we suggest entropy of sample-uncertainty to measure the training degree of a target model and develop three empirical formulas to determine an appropriate entropy of sample-uncertainty that should be obtained by passive learning before starting active learning. The effectiveness of the proposed method is validated by extensive experiments on popular image classification benchmarks and query strategies.

Keywords