Entropy (Nov 2016)

Efficient Multi-Label Feature Selection Using Entropy-Based Label Selection

  • Jaesung Lee,
  • Dae-Won Kim

DOI
https://doi.org/10.3390/e18110405
Journal volume & issue
Vol. 18, no. 11
p. 405

Abstract

Read online

Multi-label feature selection is designed to select a subset of features according to their importance to multiple labels. This task can be achieved by ranking the dependencies of features and selecting the features with the highest rankings. In a multi-label feature selection problem, the algorithm may be faced with a dataset containing a large number of labels. Because the computational cost of multi-label feature selection increases according to the number of labels, the algorithm may suffer from a degradation in performance when processing very large datasets. In this study, we propose an efficient multi-label feature selection method based on an information-theoretic label selection strategy. By identifying a subset of labels that significantly influence the importance of features, the proposed method efficiently outputs a feature subset. Experimental results demonstrate that the proposed method can identify a feature subset much faster than conventional multi-label feature selection methods for large multi-label datasets.

Keywords