Deep Learning for Phishing Detection: Taxonomy, Current Challenges and Future Directions

Nguyet Quang Do; Ali Selamat; Ondrej Krejcar; Enrique Herrera-Viedma; Hamido Fujita

doi:10.1109/ACCESS.2022.3151903

IEEE Access (Jan 2022)

Deep Learning for Phishing Detection: Taxonomy, Current Challenges and Future Directions

Nguyet Quang Do,
Ali Selamat,
Ondrej Krejcar,
Enrique Herrera-Viedma,
Hamido Fujita

Affiliations

Nguyet Quang Do: ORCiD; Malaysia–Japan International Institute of Technology, Universiti Teknologi Malaysia, Kuala Lumpur, Wilayah Persekutuan Kuala Lumpur, Kuala Lumpur, Malaysia
Ali Selamat: ORCiD; Malaysia–Japan International Institute of Technology, Universiti Teknologi Malaysia, Kuala Lumpur, Wilayah Persekutuan Kuala Lumpur, Kuala Lumpur, Malaysia
Ondrej Krejcar: ORCiD; Malaysia–Japan International Institute of Technology, Universiti Teknologi Malaysia, Kuala Lumpur, Wilayah Persekutuan Kuala Lumpur, Kuala Lumpur, Malaysia
Enrique Herrera-Viedma: ORCiD; Andalusian Research Institute in Data Science and Computational Intelligence (DaSCI), University of Granada, Granada, Spain
Hamido Fujita: ORCiD; Malaysia–Japan International Institute of Technology, Universiti Teknologi Malaysia, Kuala Lumpur, Wilayah Persekutuan Kuala Lumpur, Kuala Lumpur, Malaysia

DOI: https://doi.org/10.1109/ACCESS.2022.3151903
Journal volume & issue: Vol. 10
pp. 36429 – 36463

Abstract

Read online

Phishing has become an increasing concern and captured the attention of end-users as well as security experts. Existing phishing detection techniques still suffer from the deficiency in performance accuracy and inability to detect unknown attacks despite decades of development and improvement. Motivated to solve these problems, many researchers in the cybersecurity domain have shifted their attention to phishing detection that capitalizes on machine learning techniques. Deep learning has emerged as a branch of machine learning that becomes a promising solution for phishing detection in recent years. As a result, this study proposes a taxonomy of deep learning algorithm for phishing detection by examining 81 selected papers using a systematic literature review approach. The paper first introduces the concept of phishing and deep learning in the context of cybersecurity. Then, taxonomies of phishing detection and deep learning algorithm are provided to classify the existing literature into various categories. Next, taking the proposed taxonomy as a baseline, this study comprehensively reviews the state-of-the-art deep learning techniques and analyzes their advantages as well as disadvantages. Subsequently, the paper discusses various issues that deep learning faces in phishing detection and proposes future research directions to overcome these challenges. Finally, an empirical analysis is conducted to evaluate the performance of various deep learning techniques in a practical context, and to highlight the related issues that motivate researchers in their future works. The results obtained from the empirical experiment showed that the common issues among most of the state-of-the-art deep learning algorithms are manual parameter-tuning, long training time, and deficient detection accuracy.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords