IEEE Access (Jan 2024)

Incorporating Topological Priors Into Low-Dimensional Visualizations Through Topological Regularization

  • Edith Heiter,
  • Robin Vandaele,
  • Tijl de Bie,
  • Yvan Saeys,
  • Jefrey Lijffijt

DOI
https://doi.org/10.1109/ACCESS.2024.3456474
Journal volume & issue
Vol. 12
pp. 129541 – 129573

Abstract

Read online

Unsupervised representation learning techniques are commonly employed to analyze high-dimensional or unstructured data. In some cases, users may have prior knowledge of the topology of the data, such as a known cluster structure or the fact that it follows a tree- or graph-based structure. However, generic methods for ensuring this inherent structure is evident in low-dimensional representations are lacking and it is unknown how imposing topological constraints affects downstream learning tasks. To fill this gap, we propose topological regularization - a generic approach based on algebraic topology to incorporate topological prior knowledge into low-dimensional representations. We introduce a class of topological loss functions and demonstrate that optimizing an embedding loss together with one of these loss functions as a regularizer results in embeddings that consider not only local proximities but also the desired topological structure. We provide a self-contained introduction to essential concepts in algebraic topology and offer intuitive guidance for designing topological loss functions suitable for a variety of data shapes, such as clusters, cycles, or bifurcations. We empirically assess the efficiency, robustness, and versatility of the proposed method when combined with linear and non-linear dimensionality reduction techniques, as well as graph embedding methods.

Keywords