New and Efficient Algorithms for Producing Frequent Itemsets with the Map-Reduce Framework

Yaron Gonen; Ehud Gudes; Kirill Kandalov

doi:10.3390/a11120194

Algorithms (Nov 2018)

New and Efficient Algorithms for Producing Frequent Itemsets with the Map-Reduce Framework

Yaron Gonen,
Ehud Gudes,
Kirill Kandalov

Affiliations

Yaron Gonen: Department of Computer Science, Ben-Gurion University, Beer-Sheva 8410501, Israel
Ehud Gudes: Department of Computer Science, Ben-Gurion University, Beer-Sheva 8410501, Israel
Kirill Kandalov: Department of Computer Science, Open University, Ra’anana 4353701, Israel

DOI: https://doi.org/10.3390/a11120194
Journal volume & issue: Vol. 11, no. 12
p. 194

Abstract

Read online

The Map-Reduce (MR) framework has become a popular framework for developing new parallel algorithms for Big Data. Efficient algorithms for data mining of big data and distributed databases has become an important problem. In this paper we focus on algorithms producing association rules and frequent itemsets. After reviewing the most recent algorithms that perform this task within the MR framework, we present two new algorithms: one algorithm for producing closed frequent itemsets, and the second one for producing frequent itemsets when the database is updated and new data is added to the old database. Both algorithms include novel optimizations which are suitable to the MR framework, as well as to other parallel architectures. A detailed experimental evaluation shows the effectiveness and advantages of the algorithms over existing methods when it comes to large distributed databases.

Published in Algorithms

ISSN: 1999-4893 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Technology (General): Industrial engineering. Management engineering; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://www.mdpi.com/journal/algorithms

About the journal

Abstract

Keywords