Investigating Credit Card Payment Fraud with Detection Methods Using Advanced Machine Learning

Victor Chang; Basit Ali; Lewis Golightly; Meghana Ashok Ganatra; Muhidin Mohamed

doi:10.3390/info15080478

Information (Aug 2024)

Investigating Credit Card Payment Fraud with Detection Methods Using Advanced Machine Learning

Victor Chang,
Basit Ali,
Lewis Golightly,
Meghana Ashok Ganatra,
Muhidin Mohamed

Affiliations

Victor Chang: Department of Operations and Information Management, Aston University, Birmingham B4 7ET, UK
Basit Ali: Department of Operations and Information Management, Aston University, Birmingham B4 7ET, UK
Lewis Golightly: Department of Computing and Games, Teesside University, Middlesbrough TS1 3BX, UK
Meghana Ashok Ganatra: Department of Operations and Information Management, Aston University, Birmingham B4 7ET, UK
Muhidin Mohamed: Department of Operations and Information Management, Aston University, Birmingham B4 7ET, UK

DOI: https://doi.org/10.3390/info15080478
Journal volume & issue: Vol. 15, no. 8
p. 478

Abstract

Read online

In the cybersecurity industry, where legitimate transactions far outnumber fraudulent ones, detecting fraud is of paramount significance. In order to evaluate the accuracy of detecting fraudulent transactions in imbalanced real datasets, this study compares the efficacy of two approaches, random under-sampling and oversampling, using the synthetic minority over-sampling technique (SMOTE). Random under-sampling aims for fairness by excluding examples from the majority class, but this compromises precision in favor of recall. To strike a balance and ensure statistical significance, SMOTE was used instead to produce artificial examples of the minority class. Based on the data obtained, it is clear that random under-sampling achieves high recall (92.86%) at the expense of low precision, whereas SMOTE achieves a higher accuracy (86.75%) and a more even F1 score (73.47%) at the expense of a slightly lower recall. As true fraudulent transactions require at least two methods for verification, we investigated different machine learning methods and made suitable balances between accuracy, F1 score, and recall. Our comparison sheds light on the subtleties and ramifications of each approach, allowing professionals in the field of cybersecurity to better choose the approach that best meets the needs of their own firm. This research highlights the need to resolve class imbalances for effective fraud detection in cybersecurity, as well as the need for constant monitoring and the investigation of new approaches to increase applicability.

Published in Information

ISSN: 2078-2489 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Technology (General): Industrial engineering. Management engineering: Information technology
Website: http://www.mdpi.com/journal/information/

About the journal

Abstract

Keywords