Clustering graph data: the roadmap to spectral techniques

Rahul Mondal; Evelina Ignatova; Daniel Walke; David Broneske; Gunter Saake; Robert Heyer

doi:10.1007/s44163-024-00102-x

Discover Artificial Intelligence (Jan 2024)

Clustering graph data: the roadmap to spectral techniques

Rahul Mondal,
Evelina Ignatova,
Daniel Walke,
David Broneske,
Gunter Saake,
Robert Heyer

Affiliations

Rahul Mondal: Faculty of Computer Science, Otto-von-Guericke-University
Evelina Ignatova: Faculty of Computer Science, Otto-von-Guericke-University
Daniel Walke: Faculty of Computer Science, Otto-von-Guericke-University
David Broneske: German Center for Higher Education Research and Science Studies (DZHW)
Gunter Saake: Faculty of Computer Science, Otto-von-Guericke-University
Robert Heyer: Faculty of Technology, Bielefeld University

DOI: https://doi.org/10.1007/s44163-024-00102-x
Journal volume & issue: Vol. 4, no. 1
pp. 1 – 22

Abstract

Read online

Abstract Graph data models enable efficient storage, visualization, and analysis of highly interlinked data, by providing the benefits of horizontal scalability and high query performance. Clustering techniques, such as K-means, hierarchical clustering, are highly beneficial tools in data mining and machine learning to find meaningful similarities and differences between data points. Recent developments in graph data models, as well as clustering algorithms for graph data, have shown promising results in image segmentation, gene data analysis, etc. This has been primarily achieved through research and development of algorithms in the field of spectral theory, leading to the conception of spectral clustering algorithms. Spectral clustering algorithms have been one of the most effective in grouping similar data points in graph data models. In this paper, we have compiled 16 spectral clustering algorithms and compared their computational complexities, after an overview of graph data models and graph database models. Furthermore, we provided a broad taxonomy to classify most existing clustering algorithms and discussed the taxonomy in detail.

Published in Discover Artificial Intelligence

ISSN: 2731-0809 (Online)
Publisher: Springer
Country of publisher: Switzerland
LCC subjects: Language and Literature: Philology. Linguistics: Computational linguistics. Natural language processing; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://www.springer.com/journal/44163

About the journal

Abstract

Keywords