IEEE Access (Jan 2022)

A Comprehensive Survey on the Process, Methods, Evaluation, and Challenges of Feature Selection

  • Md Rashedul Islam,
  • Aklima Akter Lima,
  • Sujoy Chandra Das,
  • M. F. Mridha,
  • Akibur Rahman Prodeep,
  • Yutaka Watanobe

DOI
https://doi.org/10.1109/ACCESS.2022.3205618
Journal volume & issue
Vol. 10
pp. 99595 – 99632

Abstract

Read online

Feature selection is employed to reduce the feature dimensions and computational complexity by eliminating irrelevant and redundant features. A vast amount of increasing data and its processing generates many feature sets, which are reduced by the feature selection process to improve the performance in all types of classification, regression, clustering models. This study performs a detailed analysis of motivation and concentrates on the fundamental architecture of feature selection. This study aims to establish a structured formation related to popular methods such as filters, wrappers and, embedded into search strategies, evaluation criteria, and learning methods. Different methods organize a comparison of the benefits and drawbacks followed by multiple classification algorithms and standard validation measures. The diversity of applications in multiple domains such as data retrieval, prediction analysis, and medical, intrusion, and industrial applications is efficiently highlighted. This study focuses on some additional feature selection methods for handling big data. Nonetheless, new challenges have surfaced in the analysis of such data, which were also addressed in this study. Reflecting on commonly encountered challenges and clarifying how to obtain the absolute feature selection method are the significant components of this study.

Keywords