Unbalanced Web Phishing Classification through Deep Reinforcement Learning

Antonio Maci; Alessandro Santorsola; Antonio Coscia; Andrea Iannacone

doi:10.3390/computers12060118

Computers (Jun 2023)

Unbalanced Web Phishing Classification through Deep Reinforcement Learning

Antonio Maci,
Alessandro Santorsola,
Antonio Coscia,
Andrea Iannacone

Affiliations

Antonio Maci: Cybersecurity Laboratory, BV TECH S.p.A., 20123 Milan, Italy
Alessandro Santorsola: Cybersecurity Laboratory, BV TECH S.p.A., 20123 Milan, Italy
Antonio Coscia: Cybersecurity Laboratory, BV TECH S.p.A., 20123 Milan, Italy
Andrea Iannacone: Cybersecurity Laboratory, BV TECH S.p.A., 20123 Milan, Italy

DOI: https://doi.org/10.3390/computers12060118
Journal volume & issue: Vol. 12, no. 6
p. 118

Abstract

Read online

Web phishing is a form of cybercrime aimed at tricking people into visiting malicious URLs to exfiltrate sensitive data. Since the structure of a malicious URL evolves over time, phishing detection mechanisms that can adapt to such variations are paramount. Furthermore, web phishing detection is an unbalanced classification task, as legitimate URLs outnumber malicious ones in real-life cases. Deep learning (DL) has emerged as a promising technique to minimize concept drift to enhance web phishing detection. Deep reinforcement learning (DRL) combines DL with reinforcement learning (RL); that is, a sequential decision-making paradigm in which the problem to be addressed is expressed as a Markov decision process (MDP). Recent studies have proposed an ad hoc MDP formulation to tackle unbalanced classification tasks called the imbalanced classification Markov decision process (ICMDP). In this paper, we exploit the ICMDP to present a double deep Q-Network (DDQN)-based classifier to address the unbalanced web phishing classification problem. The proposed algorithm is evaluated on a Mendeley web phishing dataset, from which three different data imbalance scenarios are generated. Despite a significant training time, it results in better geometric mean, index of balanced accuracy, F1 score, and area under the ROC curve than other DL-based classifiers combined with data-level sampling techniques in all test cases.

Published in Computers

ISSN: 2073-431X (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: http://www.mdpi.com/journal/computers

About the journal

Abstract

Keywords