Incorporating Topological Priors Into Low-Dimensional Visualizations Through Topological Regularization

Edith Heiter; Robin Vandaele; Tijl de Bie; Yvan Saeys; Jefrey Lijffijt

doi:10.1109/ACCESS.2024.3456474

IEEE Access (Jan 2024)

Incorporating Topological Priors Into Low-Dimensional Visualizations Through Topological Regularization

Edith Heiter,
Robin Vandaele,
Tijl de Bie,
Yvan Saeys,
Jefrey Lijffijt

Affiliations

Edith Heiter: ORCiD; Department of Electronics and Information Systems, IDLab, Ghent University, Ghent, Belgium
Robin Vandaele: ORCiD; Department of Electronics and Information Systems, IDLab, Ghent University, Ghent, Belgium
Tijl de Bie: ORCiD; Department of Electronics and Information Systems, IDLab, Ghent University, Ghent, Belgium
Yvan Saeys: ORCiD; Department of Applied Mathematics, Computer Science and Statistics, Ghent University, Ghent, Belgium
Jefrey Lijffijt: ORCiD; Department of Electronics and Information Systems, IDLab, Ghent University, Ghent, Belgium

DOI: https://doi.org/10.1109/ACCESS.2024.3456474
Journal volume & issue: Vol. 12
pp. 129541 – 129573

Abstract

Read online

Unsupervised representation learning techniques are commonly employed to analyze high-dimensional or unstructured data. In some cases, users may have prior knowledge of the topology of the data, such as a known cluster structure or the fact that it follows a tree- or graph-based structure. However, generic methods for ensuring this inherent structure is evident in low-dimensional representations are lacking and it is unknown how imposing topological constraints affects downstream learning tasks. To fill this gap, we propose topological regularization - a generic approach based on algebraic topology to incorporate topological prior knowledge into low-dimensional representations. We introduce a class of topological loss functions and demonstrate that optimizing an embedding loss together with one of these loss functions as a regularizer results in embeddings that consider not only local proximities but also the desired topological structure. We provide a self-contained introduction to essential concepts in algebraic topology and offer intuitive guidance for designing topological loss functions suitable for a variety of data shapes, such as clusters, cycles, or bifurcations. We empirically assess the efficiency, robustness, and versatility of the proposed method when combined with linear and non-linear dimensionality reduction techniques, as well as graph embedding methods.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords