Overcoming Overconfidence for Active Learning

Yujin Hwang; Won Jo; Juyoung Hong; Yukyung Choi

doi:10.1109/ACCESS.2024.3449915

IEEE Access (Jan 2024)

Overcoming Overconfidence for Active Learning

Yujin Hwang,
Won Jo,
Juyoung Hong,
Yukyung Choi

Affiliations

Yujin Hwang: ORCiD; Sejong University, Gwangjin-gu, Seoul, Republic of Korea
Won Jo: Sejong University, Gwangjin-gu, Seoul, Republic of Korea
Juyoung Hong: ORCiD; Sejong University, Gwangjin-gu, Seoul, Republic of Korea
Yukyung Choi: ORCiD; Sejong University, Gwangjin-gu, Seoul, Republic of Korea

DOI: https://doi.org/10.1109/ACCESS.2024.3449915
Journal volume & issue: Vol. 12
pp. 118707 – 118716

Abstract

Read online

Recent advances in artificial intelligence undeniably depend on vast amounts of high-quality data. However, a persistent global challenge is the restricted budgets allocated for data labeling. To address this, active learning emerges as a prominent and efficient strategy. It involves iterative selections of valuable data for labeling through a model and updating the model based on these selections. Nonetheless, the limited data available in each iteration renders the model susceptible to bias, resulting in potentially overconfident predictions. To mitigate this issue, we propose the Overcoming Overconfidence for Active Learning (OO4AL) framework. This framework comprises two parts: Cross-Mix-and-Mix, an augmentation strategy aimed at broadening the training distribution to calibrate the model, and Ranked Margin Sampling, a selection strategy that prevents the selection of overconfidence-inducing data by evaluating predictions. Through comprehensive experiments and analyses, we demonstrate that our framework facilitates efficient data selection by reducing overconfidence, though it can be readily implemented.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords