Supervised methods of machine learning for email classification: a literature survey

Muath AlShaikh; Yasser Alrajeh; Sultan Alamri; Suhib Melhem; Ahmed Abu-Khadrah

doi:10.1080/21642583.2025.2474450

Systems Science & Control Engineering (Dec 2025)

Supervised methods of machine learning for email classification: a literature survey

Muath AlShaikh,
Yasser Alrajeh,
Sultan Alamri,
Suhib Melhem,
Ahmed Abu-Khadrah

Affiliations

Muath AlShaikh: Cybersecurity Department, College of Engineering, Al Ain University, Abu Dhabi, UAE
Yasser Alrajeh: Department of Computer Science, College of Computing and Informatics, Saudi Electronic University, Riyadh, Kingdom of Saudi Arabia
Sultan Alamri: Department of Computer Science, College of Computing and Informatics, Saudi Electronic University, Riyadh, Kingdom of Saudi Arabia
Suhib Melhem: Cybersecurity Department, College of Engineering, Al Ain University, Abu Dhabi, UAE
Ahmed Abu-Khadrah: Department of Electrical Engineering, Al-Balqa Applied University Faculty of Engineering, As-Salt, Jordan

DOI: https://doi.org/10.1080/21642583.2025.2474450
Journal volume & issue: Vol. 13, no. 1

Abstract

Read online

In today’s digital landscape, email is acknowledged as a critical conduit for global data exchanges. With a surge in data volume, malefactors exploit user identities, leading to data misuse. Cybercriminals employ electronic transgressions such as phishing and spam to orchestrate security infractions. Machine learning counters these breaches using myriad techniques, demonstrating significant efficiency in identifying phishing emails. We can divide machine learning into two types: supervised and unsupervised. Supervised learning requires pre-training the model on labelled datasets, amalgamating classification, and regression learning. Notably, supervised methodologies such as support vector machines (SVMs), naive Bayes, decision trees, neural networks, random forests, and deep learning have been exploited for spam filtering. This review delves into issues concerning spam filtering and email classification through supervised machine learning techniques, offering a comprehensive evaluation of strategies, methods, performance indicators, and the benefits and drawbacks of different research. This information allows researchers to assess the efficiency and effectiveness of supervised learning algorithms, laying the foundation for advanced email categorization techniques.

Published in Systems Science & Control Engineering

ISSN: 2164-2583 (Online)
Publisher: Taylor & Francis Group
Country of publisher: United Kingdom
LCC subjects: Technology: Mechanical engineering and machinery: Control engineering systems. Automatic machinery (General); Technology: Engineering (General). Civil engineering (General): Systems engineering
Website: https://www.tandfonline.com/journals/tssc

About the journal

Abstract

Keywords