Future Internet (Mar 2020)

Feature Selection Algorithms as One of the Python Data Analytical Tools

  • Nikita Pilnenskiy,
  • Ivan Smetannikov

DOI
https://doi.org/10.3390/fi12030054
Journal volume & issue
Vol. 12, no. 3
p. 54

Abstract

Read online

With the current trend of rapidly growing popularity of the Python programming language for machine learning applications, the gap between machine learning engineer needs and existing Python tools increases. Especially, it is noticeable for more classical machine learning fields, namely, feature selection, as the community attention in the last decade has mainly shifted to neural networks. This paper has two main purposes. First, we perform an overview of existing open-source Python and Python-compatible feature selection libraries, show their problems, if any, and demonstrate the gap between these libraries and the modern state of feature selection field. Then, we present new open-source scikit-learn compatible ITMO FS (Information Technologies, Mechanics and Optics University feature selection) library that is currently under development, explain how its architecture covers modern views on feature selection, and provide some code examples on how to use it with Python and its performance compared with other Python feature selection libraries.

Keywords