IEEE Access (Jan 2024)

Email Spam: A Comprehensive Review of Optimize Detection Methods, Challenges, and Open Research Problems

  • Ekramul Haque Tusher,
  • Mohd Arfian Ismail,
  • Md Arafatur Rahman,
  • Ali H. Alenezi,
  • Mueen Uddin

DOI
https://doi.org/10.1109/ACCESS.2024.3467996
Journal volume & issue
Vol. 12
pp. 143627 – 143657

Abstract

Read online

Nowadays, emails are used across almost every field, spanning from business to education. Broadly, emails can be categorized as either ham or spam. Email spam, also known as junk emails or unwanted emails, can harm users by wasting time and computing resources, along with stealing valuable information. The volume of spam emails is rising rapidly day by day. Detecting and filtering spam presents significant and complex challenges for email systems. Traditional identification techniques like blocklists, real-time blackhole listing, and content-based methods have limitations. These limitations have led to the advancement of more sophisticated machine learning (ML) and deep learning (DL) methods for enhanced spam detection accuracy. In recent years, considerable attention has focused on the potential of ML and DL methods to improve email spam detection. A comprehensive literature review is therefore imperative for developing an updated, evidence-based understanding of contemporary research on employing these methods against this persistent problem. The review aims to systematically identify various ML and DL methods applied for spam detection, evaluate their effectiveness, and highlight promising future research directions considering gaps. By combining and analyzing findings across studies, it will obtain the strengths and weaknesses of existing methods. This review seeks to advance knowledge on reliable and efficient integration of state-of-the-art ML and DL into identifying email spam.

Keywords