Learning From Imbalanced Data Using Triplet Adversarial Samples

Jaesub Yun; Jong-Seok Lee

doi:10.1109/ACCESS.2023.3262604

IEEE Access (Jan 2023)

Learning From Imbalanced Data Using Triplet Adversarial Samples

Jaesub Yun,
Jong-Seok Lee

Affiliations

Jaesub Yun: Department of Industrial Engineering, Sungkyunkwan University, Suwon, Republic of Korea
Jong-Seok Lee: ORCiD; Department of Industrial Engineering, Sungkyunkwan University, Suwon, Republic of Korea

DOI: https://doi.org/10.1109/ACCESS.2023.3262604
Journal volume & issue: Vol. 11
pp. 31467 – 31478

Abstract

Read online

The imbalance of classes in real-world datasets poses a major challenge in machine learning and classification, and traditional synthetic data generation methods often fail to address this problem effectively. A major limitation of these methods is that they tend to separate the process of generating synthetic samples from the training process, resulting in synthetic data that lack the necessary informative characteristics for proper model training. We present a new synthetic data generation method that addresses this issue by combining adversarial sample generation with a triplet loss method. This approach focuses on increasing the diversity in the minority class while preserving the integrity of the decision boundary. Furthermore, we show that reducing triplet loss is equivalent to maximizing the area under the receiver operating characteristic curve under specific conditions, providing a theoretical basis for the effectiveness of our method. In addition, we present a model training approach to further improve the generalization of the model to small classes by providing a diverse set of synthetic samples optimized using our proposed loss function. We evaluated our method on several imbalanced benchmark tasks and compared it to state-of-the-art techniques, demonstrating that our method can deliver even better performance, making it an effective solution to the class imbalance problem.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords