Journal of Big Data (Mar 2022)

A scalable association rule learning and recommendation algorithm for large-scale microarray datasets

  • Haosong Li,
  • Phillip C.-Y. Sheu

DOI
https://doi.org/10.1186/s40537-022-00577-4
Journal volume & issue
Vol. 9, no. 1
pp. 1 – 25

Abstract

Read online

Abstract Association rule learning algorithms have been applied to microarray datasets to find association rules among genes. With the development of microarray technology, larger datasets have been generated recently that challenge the current association rule learning algorithms. Specifically, the large number of items per transaction significantly increases the running time and memory consumption of such tasks. In this paper, we propose the Scalable Association Rule Learning (SARL) heuristic that efficiently learns gene-disease association rules and gene–gene association rules from large-scale microarray datasets. The rules are ranked based on their importance. Our experiments show the SARL algorithm outperforms the Apriori algorithm by one to three orders of magnitude.

Keywords