Sistemas de Informação (Jun 2010)

Classification in Text Mining

  • BEZERRA, E.,
  • GOLDSCHMIDT, R.

Journal volume & issue
no. 5
pp. 42 – 62

Abstract

Read online

Classification means assigning each object in a collection or dataset to a category or class. This mapping can also be called classification model or classifier. When using textual data, the objecto to be classified can be either documents in a collection or words or sentences that belong to those documents. This tutorial consists on an introduction to the task of document classification, one of the best known problems in Text Mining. Some popular algorithms that are used throughout the literature to classify document are presented here. Besides, we also present some techniques whose goal is to evaluate the quality of a classification model.

Keywords