Revista Brasileira de Computação Aplicada (Apr 2011)

Dimensionality Reduction Applied to Spam Filtering using Bayesian Classifiers

  • Tiago A. Almeida,
  • Akebo Yamakami

DOI
https://doi.org/10.5335/rbca.2013.1317
Journal volume & issue
Vol. 3, no. 1
pp. 16 – 29

Abstract

Read online

In recent years, e-mail spam has become an increasingly important problem with a big economic impact in society. Fortunately, there are different approaches able to automatically detect and remove most of these messages, and the best-known ones are based on Bayesian decision theory. However, the most of these probabilistic approaches have the same difficulty: the high dimensionality of the feature space. Many term selection methods have been proposed in the literature. In this paper, we revise the most popular methods used as term selection techniques with seven different versions of Naive Bayes spam filters.

Keywords