Algorithms (Jul 2024)

Hardness and Approximability of Dimension Reduction on the Probability Simplex

  • Roberto Bruno

DOI
https://doi.org/10.3390/a17070296
Journal volume & issue
Vol. 17, no. 7
p. 296

Abstract

Read online

Dimension reduction is a technique used to transform data from a high-dimensional space into a lower-dimensional space, aiming to retain as much of the original information as possible. This approach is crucial in many disciplines like engineering, biology, astronomy, and economics. In this paper, we consider the following dimensionality reduction instance: Given an n-dimensional probability distribution p and an integer mn, we aim to find the m-dimensional probability distribution q that is the closest to p, using the Kullback–Leibler divergence as the measure of closeness. We prove that the problem is strongly NP-hard, and we present an approximation algorithm for it.

Keywords