Exploiting multi–core and many–core parallelism for subspace clustering

Datta Amitava; Kaur Amardeep; Lauer Tobias; Chabbouh Sami

doi:10.2478/amcs-2019-0006

International Journal of Applied Mathematics and Computer Science (Mar 2019)

Exploiting multi–core and many–core parallelism for subspace clustering

Datta Amitava,
Kaur Amardeep,
Lauer Tobias,
Chabbouh Sami

Affiliations

Datta Amitava: School of Computer Science and Software Engineering, University of Western Australia, 35 Stirling Highway, Crawley, Perth, WA 6009, Australia
Kaur Amardeep: School of Computer Science and Software Engineering, University of Western Australia, 35 Stirling Highway, Crawley, Perth, WA 6009, Australia
Lauer Tobias: Department of Electrical Engineering and Information Technology, Offenburg University of Applied Sciences, Badstr. 24, 77652Offenburg, Germany
Chabbouh Sami: Department of Electrical Engineering and Information Technology, Offenburg University of Applied Sciences, Badstr. 24, 77652Offenburg, Germany

DOI: https://doi.org/10.2478/amcs-2019-0006
Journal volume & issue: Vol. 29, no. 1
pp. 81 – 91

Abstract

Read online

Finding clusters in high dimensional data is a challenging research problem. Subspace clustering algorithms aim to find clusters in all possible subspaces of the dataset, where a subspace is a subset of dimensions of the data. But the exponential increase in the number of subspaces with the dimensionality of data renders most of the algorithms inefficient as well as ineffective. Moreover, these algorithms have ingrained data dependency in the clustering process, which means that parallelization becomes difficult and inefficient. SUBSCALE is a recent subspace clustering algorithm which is scalable with the dimensions and contains independent processing steps which can be exploited through parallelism. In this paper, we aim to leverage the computational power of widely available multi-core processors to improve the runtime performance of the SUBSCALE algorithm. The experimental evaluation shows linear speedup. Moreover, we develop an approach using graphics processing units (GPUs) for fine-grained data parallelism to accelerate the computation further. First tests of the GPU implementation show very promising results.

Published in International Journal of Applied Mathematics and Computer Science

ISSN: 2083-8492 (Online)
Publisher: Sciendo
Country of publisher: Poland
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://www.amcs.uz.zgora.pl/

About the journal

Abstract

Keywords