Advanced Intelligent Systems (Nov 2024)

Dimension Reduction of Multidimensional Structured and Unstructured Datasets through Ensemble Learning of Neural Embeddings

  • Juan Carlos Alvarado‐Pérez,
  • Miguel Angel Garcia,
  • Domenec Puig

DOI
https://doi.org/10.1002/aisy.202400178
Journal volume & issue
Vol. 6, no. 11
pp. n/a – n/a

Abstract

Read online

Dimension reduction aims to project a high‐dimensional dataset into a low‐dimensional space. It tries to preserve the topological relationships among the original data points and/or induce clusters. NetDRm, an online dimensionality reduction method based on neural ensemble learning that integrates different dimension reduction methods in a synergistic way, is introduced. NetDRm is designed for datasets of multidimensional points that can be either structured (e.g., images) or unstructured (e.g., point clouds, tabular data). It starts by training a collection of deep residual encoders that learn the embeddings induced by multiple dimension reduction methods applied to the input dataset. Subsequently, a dense neural network integrates the generated encoders by emphasizing topological preservation or cluster induction. Experiments conducted on widely used multidimensional datasets (point‐cloud manifolds, image datasets, tabular record datasets) show that the proposed method yields better results in terms of topological preservation (RNX curves), cluster induction (V measure), and classification accuracy than the most relevant dimension reduction methods.

Keywords