Evaluating classifier performance with highly imbalanced Big Data

John T. Hancock; Taghi M. Khoshgoftaar; Justin M. Johnson

doi:10.1186/s40537-023-00724-5

Journal of Big Data (Apr 2023)

Evaluating classifier performance with highly imbalanced Big Data

John T. Hancock,
Taghi M. Khoshgoftaar,
Justin M. Johnson

Affiliations

John T. Hancock: Department of Electrical Engineering and Computer Science, Florida Atlantic University
Taghi M. Khoshgoftaar: Department of Electrical Engineering and Computer Science, Florida Atlantic University
Justin M. Johnson: Department of Electrical Engineering and Computer Science, Florida Atlantic University

DOI: https://doi.org/10.1186/s40537-023-00724-5
Journal volume & issue: Vol. 10, no. 1
pp. 1 – 31

Abstract

Read online

Abstract Using the wrong metrics to gauge classification of highly imbalanced Big Data may hide important information in experimental results. However, we find that analysis of metrics for performance evaluation and what they can hide or reveal is rarely covered in related works. Therefore, we address that gap by analyzing multiple popular performance metrics on three Big Data classification tasks. To the best of our knowledge, we are the first to utilize three new Medicare insurance claims datasets which became publicly available in 2021. These datasets are all highly imbalanced. Furthermore, the datasets are comprised of completely different data. We evaluate the performance of five ensemble learners in the Machine Learning task of Medicare fraud detection. Random Undersampling (RUS) is applied to induce five class ratios. The classifiers are evaluated with both the Area Under the Receiver Operating Characteristic Curve (AUC), and Area Under the Precision Recall Curve (AUPRC) metrics. We show that AUPRC provides a better insight into classification performance. Our findings reveal that the AUC metric hides the performance impact of RUS. However, classification results in terms of AUPRC show RUS has a detrimental effect. We show that, for highly imbalanced Big Data, the AUC metric fails to capture information about precision scores and false positive counts that the AUPRC metric reveals. Our contribution is to show AUPRC is a more effective metric for evaluating the performance of classifiers when working with highly imbalanced Big Data.

Published in Journal of Big Data

ISSN: 2196-1115 (Online)
Publisher: SpringerOpen
Country of publisher: United Kingdom
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering: Electronics: Computer engineering. Computer hardware; Technology: Technology (General): Industrial engineering. Management engineering: Information technology; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://journalofbigdata.springeropen.com

About the journal

Abstract

Keywords