Sistemas de Informação (Jun 2013)

A New Approach of Parallelism and Load Balance for the Apriori Algorithm

  • BOLINA, A. C.,
  • PEREIRA, D. A.,
  • ESMIN, A. A. A.,
  • PEREIRA, M. R.

Journal volume & issue
no. 11
pp. 33 – 41

Abstract

Read online

The main goal of data mining is to discover relevant information on digital content. The Apriori algorithm is widely used to this objective, but its sequential version has a low performance when execu- ted over large volumes of data. Among the solutions for this problem is the parallel implementation of the algorithm, and among the parallel implementations presented in the literature that based on Apriori, it highlights the DPA (Distributed Parallel Apriori) [10]. This paper presents the DMTA (Distributed Multithread Apriori) algorithm, which is based on DPA and exploits the parallelism level of threads in order to increase the performance. Besides, DMTA can be executed over heterogeneous hardware platform, using different number of cores. The results showed that DMTA outperforms DPA, presents load balance among processes and threads, and it is effective in current multicore architectures.

Keywords